Javascript must be enabled to continue!
DETIRE: A Hybrid Deep Learning Model for identifying Viral Sequences from Metagenomes
View through CrossRef
Abstract
A metagenome contains all DNA sequences from an environmental sample, including viruses, bacteria, fungi, actinomycetes and so on. Since viruses are of huge abundance and have caused vast mortality and morbidity to human society in history as a kind of major pathogens, detecting viruses from metagenomes plays a crucial role in analysing the viral component of samples and is the very first step for clinical diagnosis. However, detecting viral fragments directly from the metagenomes is still a tough issue because of the existence of huge number of short sequences. In this paper, a hybrid Deep lEarning model for idenTifying vIral sequences fRom mEtagenomes (DETIRE), is proposed to solve the problem. Firstly, the graph-based nucleotide sequence embedding strategy is utilized to enrich the expression of DNA sequences by training an embedding matrix. Then the spatial and sequential features are extracted by trained CNN and BiLSTM networks respectively to improve the feature expression of short sequences. Finally, the two set of features are weighted combined for the final decision. Trained by 220,000 sequences of 500bp subsampled from the Virus and Host RefSeq genomes, DETIRE identifies more short viral sequences (<1,000bp) than three latest methods, DeepVirFinder, PPR-Meta and CHEER. DETIRE is freely available at
https://github.com/crazyinter/DETIRE
.
Title: DETIRE: A Hybrid Deep Learning Model for identifying Viral Sequences from Metagenomes
Description:
Abstract
A metagenome contains all DNA sequences from an environmental sample, including viruses, bacteria, fungi, actinomycetes and so on.
Since viruses are of huge abundance and have caused vast mortality and morbidity to human society in history as a kind of major pathogens, detecting viruses from metagenomes plays a crucial role in analysing the viral component of samples and is the very first step for clinical diagnosis.
However, detecting viral fragments directly from the metagenomes is still a tough issue because of the existence of huge number of short sequences.
In this paper, a hybrid Deep lEarning model for idenTifying vIral sequences fRom mEtagenomes (DETIRE), is proposed to solve the problem.
Firstly, the graph-based nucleotide sequence embedding strategy is utilized to enrich the expression of DNA sequences by training an embedding matrix.
Then the spatial and sequential features are extracted by trained CNN and BiLSTM networks respectively to improve the feature expression of short sequences.
Finally, the two set of features are weighted combined for the final decision.
Trained by 220,000 sequences of 500bp subsampled from the Virus and Host RefSeq genomes, DETIRE identifies more short viral sequences (<1,000bp) than three latest methods, DeepVirFinder, PPR-Meta and CHEER.
DETIRE is freely available at
https://github.
com/crazyinter/DETIRE
.
Related Results
Selection of Injectable Drug Product Composition using Machine Learning Models (Preprint)
Selection of Injectable Drug Product Composition using Machine Learning Models (Preprint)
BACKGROUND
As of July 2020, a Web of Science search of “machine learning (ML)” nested within the search of “pharmacokinetics or pharmacodynamics” yielded over 100...
Viral Hijacking of Host RNA-Binding Proteins: Implications for Viral Replication and Pathogenesis
Viral Hijacking of Host RNA-Binding Proteins: Implications for Viral Replication and Pathogenesis
In the intricate dance between viruses and host cells, RNA-binding proteins (RBPs) serve as crucial orchestrators of gene expression and cellular processes. We will delve into the ...
Deep convolutional neural network and IoT technology for healthcare
Deep convolutional neural network and IoT technology for healthcare
Background Deep Learning is an AI technology that trains computers to analyze data in an approach similar to the human brain. Deep learning algorithms can find complex patterns in ...
Cataloging Viral Diversity from Nonaxenic Terrestrial Cyanobacteria Cultures
Cataloging Viral Diversity from Nonaxenic Terrestrial Cyanobacteria Cultures
Abstract
Viruses are exceedingly common, but little is known about their diversity let alone how they behave in extreme environments, and whether viruses facilitate...
Bioinformatics analysis and collection of protein post-translational modification sites in human viruses
Bioinformatics analysis and collection of protein post-translational modification sites in human viruses
AbstractIn viruses, post-translational modifications (PTMs) are essential for their life cycle. Recognizing viral PTMs is very important for better understanding the mechanism of v...
Lapidary: Identifying and reporting amino acid sequences in metagenomes using sequence reads and Diamond
Lapidary: Identifying and reporting amino acid sequences in metagenomes using sequence reads and Diamond
Abstract
Genome and metagenome comparisons rely on identifying genetic elements that differ or are in common between samples. These genetic elements can be identifi...
Abstract 11844: Viral Gene Signatures in Patients With Cardiomyopathy After Chemotherapy
Abstract 11844: Viral Gene Signatures in Patients With Cardiomyopathy After Chemotherapy
Introduction:
Chemotherapy for hematologic malignancies can cause severe cardiomyopathy. Chemotherapeutic agents also induce immune suppression increasing the risk of o...
Generating viral metagenomes from the coral holobiont v1
Generating viral metagenomes from the coral holobiont v1
Reef-building corals comprise multipartite symbioses where the cnidarian animal is host to an array of eukaryotic and prokaryotic organisms, and the viruses that infect them. These...

