Javascript must be enabled to continue!
Deep Learning Applied to the SARS-CoV-2 Classification
View through CrossRef
Abstract
Purpose: The primary objective of this study was to develop and evaluate a deep neural network model based on convolutional neural networks (CNNs) for accurately classifying SARS-CoV-2 viral sequences and other subtypes within the Coronaviridae family. With the rapid evolution of viral genomes and the increasing need for timely classification, we aimed to provide a robust and efficient tool that could enhance the accuracy of viral identification and classification processes. By harnessing the power of deep learning, we sought to contribute to advancing viral genomics research and aid in surveilling emerging viral strains.
Methods: We designed and implemented a CNN-based deep neural network architecture capable of processing complete cDNA genomic sequences to achieve our goal. We used a dataset comprising diverse viral subtypes, including SARSCoV- 2, for training and testing. The dataset was partitioned using a 5-fold cross-validation strategy to ensure rigorous evaluation. Our model’s performance was assessed using various metrics, including accuracy, precision, sensitivity, specificity, F1-score, and AUROC. Additionally, artificial mutation tests were conducted to evaluate the model’s generalization ability across sequence variations. We also used the BLAST algorithm and conducted comprehensive processing time analyses for comparison.
Results: The developed CNN-based model demonstrated exceptional performance across various evaluation metrics. In the training phase, the model consistently achieved maximum values for accuracy, sensitivity, specificity, and other key metrics, indicating its robust learning ability. Notably, during testing on over 10,000 viral sequences, the model exhibited a sensitivity of over 99% for sequences with fewer than 2,000 mutations. The CNN-based model showcased superior accuracy and significantly reduced processing times compared to the BLAST algorithm. These findings underscore the model’s effectiveness in accurately classifying viral sequences and its potential to revolutionize viral genomics research.
Conclusion: This study introduces a CNN-based deep neural network model as a powerful tool for precisely classifying viral sequences, specifically focusing on SARS-CoV-2 and other Coronaviridae family subtypes. Our model’s superiority is evident through rigorous evaluation and comparison with existing methods, offering enhanced accuracy and efficiency. The application of artificial mutation testing demonstrated the model’s robustness in handling sequence variations. By harnessing deep learning capabilities, our model significantly contributes to viral classification and genomics research. As viral surveillance becomes increasingly critical, our model holds promise in aiding rapid and accurate identification of emerging viral strains.
Research Square Platform LLC
Title: Deep Learning Applied to the SARS-CoV-2 Classification
Description:
Abstract
Purpose: The primary objective of this study was to develop and evaluate a deep neural network model based on convolutional neural networks (CNNs) for accurately classifying SARS-CoV-2 viral sequences and other subtypes within the Coronaviridae family.
With the rapid evolution of viral genomes and the increasing need for timely classification, we aimed to provide a robust and efficient tool that could enhance the accuracy of viral identification and classification processes.
By harnessing the power of deep learning, we sought to contribute to advancing viral genomics research and aid in surveilling emerging viral strains.
Methods: We designed and implemented a CNN-based deep neural network architecture capable of processing complete cDNA genomic sequences to achieve our goal.
We used a dataset comprising diverse viral subtypes, including SARSCoV- 2, for training and testing.
The dataset was partitioned using a 5-fold cross-validation strategy to ensure rigorous evaluation.
Our model’s performance was assessed using various metrics, including accuracy, precision, sensitivity, specificity, F1-score, and AUROC.
Additionally, artificial mutation tests were conducted to evaluate the model’s generalization ability across sequence variations.
We also used the BLAST algorithm and conducted comprehensive processing time analyses for comparison.
Results: The developed CNN-based model demonstrated exceptional performance across various evaluation metrics.
In the training phase, the model consistently achieved maximum values for accuracy, sensitivity, specificity, and other key metrics, indicating its robust learning ability.
Notably, during testing on over 10,000 viral sequences, the model exhibited a sensitivity of over 99% for sequences with fewer than 2,000 mutations.
The CNN-based model showcased superior accuracy and significantly reduced processing times compared to the BLAST algorithm.
These findings underscore the model’s effectiveness in accurately classifying viral sequences and its potential to revolutionize viral genomics research.
Conclusion: This study introduces a CNN-based deep neural network model as a powerful tool for precisely classifying viral sequences, specifically focusing on SARS-CoV-2 and other Coronaviridae family subtypes.
Our model’s superiority is evident through rigorous evaluation and comparison with existing methods, offering enhanced accuracy and efficiency.
The application of artificial mutation testing demonstrated the model’s robustness in handling sequence variations.
By harnessing deep learning capabilities, our model significantly contributes to viral classification and genomics research.
As viral surveillance becomes increasingly critical, our model holds promise in aiding rapid and accurate identification of emerging viral strains.
Related Results
The Hidden Problem of Cross-Reactivity: Challenges in HIV Testing During the COVID-19 Era: A Systematic Review
The Hidden Problem of Cross-Reactivity: Challenges in HIV Testing During the COVID-19 Era: A Systematic Review
Abstract
Introduction
Human immunodeficiency virus (HIV) and Severe Acute Respiratory Syndrome Coronavirus 2 (SARS-CoV2) surface glycoproteins, including shared epitope motifs, sho...
From SARS and MERS CoVs to SARS‐CoV‐2: Moving toward more biased codon usage in viral structural and nonstructural genes
From SARS and MERS CoVs to SARS‐CoV‐2: Moving toward more biased codon usage in viral structural and nonstructural genes
AbstractBackgroundSevere acute respiratory syndrome coronavirus 2 (SARS‐CoV‐2) is an emerging disease with fatal outcomes. In this study, a fundamental knowledge gap question is to...
Performance characteristics of the VIDAS® SARS-COV-2 IgM and IgG serological assays
Performance characteristics of the VIDAS® SARS-COV-2 IgM and IgG serological assays
ABSTRACTThe COVID-19 pandemic, caused by the new severe acute respiratory syndrome coronavirus 2 (SARS-CoV-2), continues to spread worldwide. Serological testing for SARS-CoV-2-spe...
Analyses of the Spike Proteins of Severe Acute Respiratory Syndrome-Related Coronaviruses
Analyses of the Spike Proteins of Severe Acute Respiratory Syndrome-Related Coronaviruses
Aim: To analyze spike proteins of Severe Acute Respiratory Syndrome (SARS)-related coronaviruses (CoVs) for their conserved motifs, Receptor-Binding Domain (RBD), Receptor Binding...
SARS-CoV-2 within-host diversity of human hosts and its implications for viral immune evasion
SARS-CoV-2 within-host diversity of human hosts and its implications for viral immune evasion
ABSTRACT
Severe acute respiratory syndrome coronavirus 2 (SARS-CoV-2) is continuously evolving, bringing great challenges to the control of the virus. In the...
The emerging SARS‐CoV‐2 papain‐like protease: Its relationship with recent coronavirus epidemics
The emerging SARS‐CoV‐2 papain‐like protease: Its relationship with recent coronavirus epidemics
AbstractThe papain‐like protease (PLpro) is an important enzyme for coronavirus polyprotein processing, as well as for virus‐host immune suppression. Previous studies reveal that a...
EPD Electronic Pathogen Detection v1
EPD Electronic Pathogen Detection v1
Electronic pathogen detection (EPD) is a non - invasive, rapid, affordable, point- of- care test, for Covid 19 resulting from infection with SARS-CoV-2 virus. EPD scanning techno...
Resistance of endothelial cells to SARS-CoV-2 infectionin vitro
Resistance of endothelial cells to SARS-CoV-2 infectionin vitro
AbstractRationaleThe secondary thrombotic/vascular clinical syndrome of COVID-19 suggests that SARS-CoV-2 infects not only respiratory epithelium but also the endothelium activatin...

