Javascript must be enabled to continue!
Deep Learning Applied to the SARS-CoV-2 Classification
View through CrossRef
Abstract
Purpose: The primary objective of this study was to develop and evaluate a deep neural network model based on convolutional neural networks (CNNs) for accurately classifying SARS-CoV-2 viral sequences and other subtypes within the Coronaviridae family. With the rapid evolution of viral genomes and the increasing need for timely classification, we aimed to provide a robust and efficient tool that could enhance the accuracy of viral identification and classification processes. By harnessing the power of deep learning, we sought to contribute to advancing viral genomics research and aid in surveilling emerging viral strains.
Methods: We designed and implemented a CNN-based deep neural network architecture capable of processing complete cDNA genomic sequences to achieve our goal. We used a dataset comprising diverse viral subtypes, including SARSCoV- 2, for training and testing. The dataset was partitioned using a 5-fold cross-validation strategy to ensure rigorous evaluation. Our model’s performance was assessed using various metrics, including accuracy, precision, sensitivity, specificity, F1-score, and AUROC. Additionally, artificial mutation tests were conducted to evaluate the model’s generalization ability across sequence variations. We also used the BLAST algorithm and conducted comprehensive processing time analyses for comparison.
Results: The developed CNN-based model demonstrated exceptional performance across various evaluation metrics. In the training phase, the model consistently achieved maximum values for accuracy, sensitivity, specificity, and other key metrics, indicating its robust learning ability. Notably, during testing on over 10,000 viral sequences, the model exhibited a sensitivity of over 99% for sequences with fewer than 2,000 mutations. The CNN-based model showcased superior accuracy and significantly reduced processing times compared to the BLAST algorithm. These findings underscore the model’s effectiveness in accurately classifying viral sequences and its potential to revolutionize viral genomics research.
Conclusion: This study introduces a CNN-based deep neural network model as a powerful tool for precisely classifying viral sequences, specifically focusing on SARS-CoV-2 and other Coronaviridae family subtypes. Our model’s superiority is evident through rigorous evaluation and comparison with existing methods, offering enhanced accuracy and efficiency. The application of artificial mutation testing demonstrated the model’s robustness in handling sequence variations. By harnessing deep learning capabilities, our model significantly contributes to viral classification and genomics research. As viral surveillance becomes increasingly critical, our model holds promise in aiding rapid and accurate identification of emerging viral strains.
Research Square Platform LLC
Title: Deep Learning Applied to the SARS-CoV-2 Classification
Description:
Abstract
Purpose: The primary objective of this study was to develop and evaluate a deep neural network model based on convolutional neural networks (CNNs) for accurately classifying SARS-CoV-2 viral sequences and other subtypes within the Coronaviridae family.
With the rapid evolution of viral genomes and the increasing need for timely classification, we aimed to provide a robust and efficient tool that could enhance the accuracy of viral identification and classification processes.
By harnessing the power of deep learning, we sought to contribute to advancing viral genomics research and aid in surveilling emerging viral strains.
Methods: We designed and implemented a CNN-based deep neural network architecture capable of processing complete cDNA genomic sequences to achieve our goal.
We used a dataset comprising diverse viral subtypes, including SARSCoV- 2, for training and testing.
The dataset was partitioned using a 5-fold cross-validation strategy to ensure rigorous evaluation.
Our model’s performance was assessed using various metrics, including accuracy, precision, sensitivity, specificity, F1-score, and AUROC.
Additionally, artificial mutation tests were conducted to evaluate the model’s generalization ability across sequence variations.
We also used the BLAST algorithm and conducted comprehensive processing time analyses for comparison.
Results: The developed CNN-based model demonstrated exceptional performance across various evaluation metrics.
In the training phase, the model consistently achieved maximum values for accuracy, sensitivity, specificity, and other key metrics, indicating its robust learning ability.
Notably, during testing on over 10,000 viral sequences, the model exhibited a sensitivity of over 99% for sequences with fewer than 2,000 mutations.
The CNN-based model showcased superior accuracy and significantly reduced processing times compared to the BLAST algorithm.
These findings underscore the model’s effectiveness in accurately classifying viral sequences and its potential to revolutionize viral genomics research.
Conclusion: This study introduces a CNN-based deep neural network model as a powerful tool for precisely classifying viral sequences, specifically focusing on SARS-CoV-2 and other Coronaviridae family subtypes.
Our model’s superiority is evident through rigorous evaluation and comparison with existing methods, offering enhanced accuracy and efficiency.
The application of artificial mutation testing demonstrated the model’s robustness in handling sequence variations.
By harnessing deep learning capabilities, our model significantly contributes to viral classification and genomics research.
As viral surveillance becomes increasingly critical, our model holds promise in aiding rapid and accurate identification of emerging viral strains.
Related Results
The Potential of Medicinal Plants and Bioactive Compounds in the Fight Against COVID-19
The Potential of Medicinal Plants and Bioactive Compounds in the Fight Against COVID-19
Severe acute respiratory syndrome coronavirus 2 (SARS-CoV-2), a novel coronavirus , is causing a serious worldwide COVID-19 pandemic. The emergence of strains with rapid spread and...
The Hidden Problem of Cross-Reactivity: Challenges in HIV Testing During the COVID-19 Era: A Systematic Review
The Hidden Problem of Cross-Reactivity: Challenges in HIV Testing During the COVID-19 Era: A Systematic Review
Abstract
Introduction
Human immunodeficiency virus (HIV) and Severe Acute Respiratory Syndrome Coronavirus 2 (SARS-CoV2) surface glycoproteins, including shared epitope motifs, sho...
Kinetics of the humoral immune response to SARS-CoV-2: comparative analytical performance of seven commercial serology tests
Kinetics of the humoral immune response to SARS-CoV-2: comparative analytical performance of seven commercial serology tests
Abstract
Background
SARS-CoV-2 serology tests are clinically useful to document a prior SARS-CoV-2 infection in patients with n...
Mutations in SARS-CoV
Mutations in SARS-CoV
The coronavirus family is named for the large spike protein molecules found
on the pathogen exterior, which give the virus a crown-like appearance, the coronavirus
genome is the bi...
From SARS and MERS CoVs to SARS‐CoV‐2: Moving toward more biased codon usage in viral structural and nonstructural genes
From SARS and MERS CoVs to SARS‐CoV‐2: Moving toward more biased codon usage in viral structural and nonstructural genes
AbstractBackgroundSevere acute respiratory syndrome coronavirus 2 (SARS‐CoV‐2) is an emerging disease with fatal outcomes. In this study, a fundamental knowledge gap question is to...
Performance characteristics of the VIDAS® SARS-COV-2 IgM and IgG serological assays
Performance characteristics of the VIDAS® SARS-COV-2 IgM and IgG serological assays
ABSTRACTThe COVID-19 pandemic, caused by the new severe acute respiratory syndrome coronavirus 2 (SARS-CoV-2), continues to spread worldwide. Serological testing for SARS-CoV-2-spe...
P-003 Impact of SARS-CoV-2 infection on circulating male sex hormones and human semen quality: a systematic review and meta-analysis
P-003 Impact of SARS-CoV-2 infection on circulating male sex hormones and human semen quality: a systematic review and meta-analysis
Abstract
Study question
Does SARS-CoV-2 infection alter human male sex steroid hormones and semen quality?
...
Modeling Coronavirus Spike Protein Dynamics: Implications for Immunogenicity and Immune Escape
Modeling Coronavirus Spike Protein Dynamics: Implications for Immunogenicity and Immune Escape
ABSTRACT
The ongoing COVID-19 pandemic is a global public health emergency requiring urgent development of efficacious vaccines. While concentrated research efforts...

