Javascript must be enabled to continue!
A Deep Learning Framework for Predicting Human Essential Genes by Integrating Sequence and Functional data
View through CrossRef
ABSTRACTEssential genes are necessary to the survival or reproduction of a living organism. The prediction and analysis of gene essentiality can advance our understanding to basic life and human diseases, and further boost the development of new drugs. Wet lab methods for identifying essential genes are often costly, time consuming, and laborious. As a complement, computational methods have been proposed to predict essential genes by integrating multiple biological data sources. Most of these methods are evaluated on model organisms. However, prediction methods for human essential genes are still limited and the relationship between human gene essentiality and different biological information still needs to be explored. In addition, exploring suitable deep learning techniques to overcome the limitations of traditional machine learning methods and improve the prediction accuracy is also important and interesting. We propose a deep learning based method, DeepSF, to predict human essential genes. DeepSF integrates sequence features derived from DNA and protein sequence data with features extracted or learned from different types of functional data, such as gene ontology, protein complex, protein domain, and protein-protein interaction network. More than 200 features from these biological data are extracted/learned which are integrated together to train a cost-sensitive deep neural network by utilizing multiple deep leaning techniques. The experimental results of 10-fold cross validation show that DeepSF can accurately predict human gene essentiality with an average AUC of 95.17%, the area under precision-recall curve (auPRC) of 92.21%, the accuracy of 91.59%, and the F1 measure about 78.71%. In addition, the comparison experimental results show that DeepSF significantly outperforms several popular traditional machine learning models (SVM, Random Forest, and Adaboost), and performs slightly better than a recent deep learning model (DeepHE). We have demonstrated that the proposed method, DeepSF, is effective for predicting human essential genes. Deep learning techniques are promising at both feature learning and classification levels for the task of essential gene prediction.
Title: A Deep Learning Framework for Predicting Human Essential Genes by Integrating Sequence and Functional data
Description:
ABSTRACTEssential genes are necessary to the survival or reproduction of a living organism.
The prediction and analysis of gene essentiality can advance our understanding to basic life and human diseases, and further boost the development of new drugs.
Wet lab methods for identifying essential genes are often costly, time consuming, and laborious.
As a complement, computational methods have been proposed to predict essential genes by integrating multiple biological data sources.
Most of these methods are evaluated on model organisms.
However, prediction methods for human essential genes are still limited and the relationship between human gene essentiality and different biological information still needs to be explored.
In addition, exploring suitable deep learning techniques to overcome the limitations of traditional machine learning methods and improve the prediction accuracy is also important and interesting.
We propose a deep learning based method, DeepSF, to predict human essential genes.
DeepSF integrates sequence features derived from DNA and protein sequence data with features extracted or learned from different types of functional data, such as gene ontology, protein complex, protein domain, and protein-protein interaction network.
More than 200 features from these biological data are extracted/learned which are integrated together to train a cost-sensitive deep neural network by utilizing multiple deep leaning techniques.
The experimental results of 10-fold cross validation show that DeepSF can accurately predict human gene essentiality with an average AUC of 95.
17%, the area under precision-recall curve (auPRC) of 92.
21%, the accuracy of 91.
59%, and the F1 measure about 78.
71%.
In addition, the comparison experimental results show that DeepSF significantly outperforms several popular traditional machine learning models (SVM, Random Forest, and Adaboost), and performs slightly better than a recent deep learning model (DeepHE).
We have demonstrated that the proposed method, DeepSF, is effective for predicting human essential genes.
Deep learning techniques are promising at both feature learning and classification levels for the task of essential gene prediction.
Related Results
DeepHE: Accurately Predicting Human Essential Genes based on Deep Learning
DeepHE: Accurately Predicting Human Essential Genes based on Deep Learning
AbstractMotivationAccurately predicting essential genes using computational methods can greatly reduce the effort in finding them via wet experiments at both time and resource scal...
DeEPsnap: human essential gene prediction by integrating multi-omics data
DeEPsnap: human essential gene prediction by integrating multi-omics data
Abstract
Essential genes are necessary for the survival or reproduction of a living organism. The prediction and analysis of gene essentiality can advance our understanding...
DeEPsnap: human essential gene prediction by integrating multi-omics data
DeEPsnap: human essential gene prediction by integrating multi-omics data
AbstractEssential genes are necessary for the survival or reproduction of a living organism. The prediction and analysis of gene essentiality can advance our understanding of basic...
Deep convolutional neural network and IoT technology for healthcare
Deep convolutional neural network and IoT technology for healthcare
Background Deep Learning is an AI technology that trains computers to analyze data in an approach similar to the human brain. Deep learning algorithms can find complex patterns in ...
Predicting essential genes in fungal genomes
Predicting essential genes in fungal genomes
Essential genes are required for an organism's viability, and the ability to identify these genes in pathogens is crucial to directed drug development. Predicting essential genes t...
Modeling Functional Genetic Alteration in Cancer Reveals New Candidate Driver Genes
Modeling Functional Genetic Alteration in Cancer Reveals New Candidate Driver Genes
AbstractCompiling the catalogue of genes actively involved in tumorigenesis (known as cancer drivers) is an ongoing endeavor, with profound implications to the understanding of tum...
Initial Experience with Pediatrics Online Learning for Nonclinical Medical Students During the COVID-19 Pandemic
Initial Experience with Pediatrics Online Learning for Nonclinical Medical Students During the COVID-19 Pandemic
Abstract
Background: To minimize the risk of infection during the COVID-19 pandemic, the learning mode of universities in China has been adjusted, and the online learning o...
SO004IDENTIFICATION OF HUB GENES, FOCUSED ON CHEMOKINES AND CHEMOKINE RECEPTORS, ASSOCIATED WITH THE DECLINE OF RENAL FUCTION OF DIABETIC KIDNEY DISEASE BY WEIGHTED GENE CO-EXPRESSION NETWORK ANALYSIS
SO004IDENTIFICATION OF HUB GENES, FOCUSED ON CHEMOKINES AND CHEMOKINE RECEPTORS, ASSOCIATED WITH THE DECLINE OF RENAL FUCTION OF DIABETIC KIDNEY DISEASE BY WEIGHTED GENE CO-EXPRESSION NETWORK ANALYSIS
Abstract
Background and Aims
The fact that activation of the innate immune system and chronic inflammation are closely involved ...


