Javascript must be enabled to continue!
Predicting gene knockout effects from expression data
View through CrossRef
AbstractBackgroundThe study of gene essentiality, which measures the importance of a gene for cell division and survival, is used for the identification of cancer drug targets and understanding of tissue-specific manifestation of genetic conditions. In this work, we analyze essentiality and gene expression data from over 900 cancer lines from the DepMap project to create predictive models of gene essentiality.MethodsWe developed machine learning algorithms to identify those genes whose essentiality levels are explained by the expression of a small set of “modifier genes”. To identify these gene sets, we developed an ensemble of statistical tests capturing linear and non-linear dependencies. We trained several regression models predicting the essentiality of each target gene, and used an automated model selection procedure to identify the optimal model and hyperparameters. Overall, we examined linear models, gradient boosted trees, Gaussian process regression models, and deep learning networks.ResultsWe identified nearly 3000 genes for which we accurately predict essentiality using gene expression data of a small set of modifier genes. We show that both in the number of genes we successfully make predictions for, as well as in the prediction accuracy, our model outperforms current state-of-the-art works.ConclusionsOur modeling framework avoids overfitting by identifying the small set of modifier genes, which are of clinical and genetic importance, and ignores the expression of noisy and irrelevant genes. Doing so improves the accuracy of essentiality prediction in various conditions and provides interpretable models. Overall, we present an accurate computational approach, as well as interpretable modeling of essentiality in a wide range of cellular conditions, thus contributing to a better understanding of the molecular mechanisms that govern tissue-specific effects of genetic disease and cancer.
Springer Science and Business Media LLC
Title: Predicting gene knockout effects from expression data
Description:
AbstractBackgroundThe study of gene essentiality, which measures the importance of a gene for cell division and survival, is used for the identification of cancer drug targets and understanding of tissue-specific manifestation of genetic conditions.
In this work, we analyze essentiality and gene expression data from over 900 cancer lines from the DepMap project to create predictive models of gene essentiality.
MethodsWe developed machine learning algorithms to identify those genes whose essentiality levels are explained by the expression of a small set of “modifier genes”.
To identify these gene sets, we developed an ensemble of statistical tests capturing linear and non-linear dependencies.
We trained several regression models predicting the essentiality of each target gene, and used an automated model selection procedure to identify the optimal model and hyperparameters.
Overall, we examined linear models, gradient boosted trees, Gaussian process regression models, and deep learning networks.
ResultsWe identified nearly 3000 genes for which we accurately predict essentiality using gene expression data of a small set of modifier genes.
We show that both in the number of genes we successfully make predictions for, as well as in the prediction accuracy, our model outperforms current state-of-the-art works.
ConclusionsOur modeling framework avoids overfitting by identifying the small set of modifier genes, which are of clinical and genetic importance, and ignores the expression of noisy and irrelevant genes.
Doing so improves the accuracy of essentiality prediction in various conditions and provides interpretable models.
Overall, we present an accurate computational approach, as well as interpretable modeling of essentiality in a wide range of cellular conditions, thus contributing to a better understanding of the molecular mechanisms that govern tissue-specific effects of genetic disease and cancer.
Related Results
Expression and polymorphism of genes in gallstones
Expression and polymorphism of genes in gallstones
ABSTRACT
Through the method of clinical case control study, to explore the expression and genetic polymorphism of KLF14 gene (rs4731702 and rs972283) and SR-B1 gene (rs...
Development of the Multiple Gene Knockout System with One-Step PCR in Thermoacidophilic Crenarchaeon Sulfolobus acidocaldarius
Development of the Multiple Gene Knockout System with One-Step PCR in Thermoacidophilic Crenarchaeon Sulfolobus acidocaldarius
Multiple gene knockout systems developed in the thermoacidophilic crenarchaeon Sulfolobus acidocaldarius are powerful genetic tools. However, plasmid construction typically require...
Variants of the vitamin D receptor gene and the expression of microRNA‑21, microRNA‑125a, microRNA‑125b and microRNA‑214 in coronary heart disease
Variants of the vitamin D receptor gene and the expression of microRNA‑21, microRNA‑125a, microRNA‑125b and microRNA‑214 in coronary heart disease
Background. The protective effects of vitamin D in relation to atherogenesis are realized by vitamin D receptors (VDR). Variants rs10735810, rs731236, rs1544410 and rs797532 of the...
Abstract 1836: Global gene expression profiles from bladder tumor FFPE samples
Abstract 1836: Global gene expression profiles from bladder tumor FFPE samples
Abstract
Cancer is a disease characterized by uncontrolled cell growth and proliferation. Recent advances in molecular medicine and cancer biology have changed the w...
Abstract P1-05-23: Utilities and challenges of RNA-Seq based expression and variant calling in a clinical setting
Abstract P1-05-23: Utilities and challenges of RNA-Seq based expression and variant calling in a clinical setting
Abstract
Introduction
Variant calling based on DNA samples has been the gold standard of clinical testing since the advent of Sanger sequencing. The u...
Use of CRISPR/Cas9 Gene Editing Methods to Investigate the Mechanism of Trem2-Dependent Gene Expression in Macrophages
Use of CRISPR/Cas9 Gene Editing Methods to Investigate the Mechanism of Trem2-Dependent Gene Expression in Macrophages
Triggering Receptor Expressed on Myeloid Cells 2 (TREM2) is a surface receptor expressed in macrophages during tissue injury. This receptor plays a role in driving phagocytosis and...
Effect of testosterone on within-sex gene expression across 40 human tissues
Effect of testosterone on within-sex gene expression across 40 human tissues
Abstract
Background
Variations in testosterone levels is associated with pronounced health risks, often in a discordant manner between males and females. While studies hav...
Abstract 1841: Dissecting gene expression programs that define tumor aggression and patient outcome in pancreatic cancer
Abstract 1841: Dissecting gene expression programs that define tumor aggression and patient outcome in pancreatic cancer
Abstract
By the year 2020, pancreatic cancer (PDAC) is projected to be the second leading cause of cancer deaths in the United States. Current systemic therapies off...

