Search engine for discovering works of Art, research articles, and books related to Art and Culture
ShareThis
Javascript must be enabled to continue!

A Fast Lasso-Based Method for Inferring Higher-Order Interactions

View through CrossRef
AbstractLarge-scale genotype-phenotype screens provide a wealth of data for identifying molecular alterations associated with a phenotype. Epistatic effects play an important role in such association studies. For example, siRNA perturbation screens can be used to identify combinatorial gene-silencing effects. In bacteria, epistasis has practical consequences in determining antimicrobial resistance as the genetic background of a strain plays an important role in determining resistance. Recently developed tools scale to human exome-wide screens for pairwise interactions, but none to date have included the possibility of three-way interactions. Expanding upon recent state-of-the art methods, we make a number of improvements to the performance on large-scale data, making consideration of three-way interactions possible. We demonstrate our proposed method, Pint, on both simulated and real data sets, including antibiotic resistance testing and siRNA perturbation screens. Pint outperforms known methods in simulated data, and identifies a number of biologically plausible gene effects in both the antibiotic and siRNA models. For example, we have identified a combination of known tumor suppressor genes that is predicted (using Pint) to cause a significant increase in cell proliferation.Author SummaryIn recent years, large-scale genetic datasets have become available for analysis. These large datasets often stretch the limits of classic computational methods, requiring too much memory or simply taking a prohibitively long time to run. Due to the enormous number of potential interactions, each gene or variation in the data is often modeled on its own, without considering interactions between them. Recently, methods have been developed to solve regression problems that include these interacting effects. Even the fastest of these cannot include threeway interactions, however. We improve upon one such method, developing an approach that is significantly faster than the current state of the art. Moreover, our method scales to three-way interactions among thousands of genes, while avoiding a number of the limitations of previous approaches. We analyse large-scale simulated data, antibiotic resistance, and gene-silencing datasets to demonstrate the accuracy and performance of our approach.
Title: A Fast Lasso-Based Method for Inferring Higher-Order Interactions
Description:
AbstractLarge-scale genotype-phenotype screens provide a wealth of data for identifying molecular alterations associated with a phenotype.
Epistatic effects play an important role in such association studies.
For example, siRNA perturbation screens can be used to identify combinatorial gene-silencing effects.
In bacteria, epistasis has practical consequences in determining antimicrobial resistance as the genetic background of a strain plays an important role in determining resistance.
Recently developed tools scale to human exome-wide screens for pairwise interactions, but none to date have included the possibility of three-way interactions.
Expanding upon recent state-of-the art methods, we make a number of improvements to the performance on large-scale data, making consideration of three-way interactions possible.
We demonstrate our proposed method, Pint, on both simulated and real data sets, including antibiotic resistance testing and siRNA perturbation screens.
Pint outperforms known methods in simulated data, and identifies a number of biologically plausible gene effects in both the antibiotic and siRNA models.
For example, we have identified a combination of known tumor suppressor genes that is predicted (using Pint) to cause a significant increase in cell proliferation.
Author SummaryIn recent years, large-scale genetic datasets have become available for analysis.
These large datasets often stretch the limits of classic computational methods, requiring too much memory or simply taking a prohibitively long time to run.
Due to the enormous number of potential interactions, each gene or variation in the data is often modeled on its own, without considering interactions between them.
Recently, methods have been developed to solve regression problems that include these interacting effects.
Even the fastest of these cannot include threeway interactions, however.
We improve upon one such method, developing an approach that is significantly faster than the current state of the art.
Moreover, our method scales to three-way interactions among thousands of genes, while avoiding a number of the limitations of previous approaches.
We analyse large-scale simulated data, antibiotic resistance, and gene-silencing datasets to demonstrate the accuracy and performance of our approach.

Related Results

Seagull: lasso, group lasso and sparse-group lasso regularization for linear regression models via proximal gradient descent
Seagull: lasso, group lasso and sparse-group lasso regularization for linear regression models via proximal gradient descent
Abstract Background Statistical analyses of biological problems in life sciences often lead to high-dimensional linear models. To solve the corresponding system of equations, penal...
seagull: lasso, group lasso and sparse-group lasso regularisation for linear regression models via proximal gradient descent
seagull: lasso, group lasso and sparse-group lasso regularisation for linear regression models via proximal gradient descent
SummaryStatistical analyses of biological problems in life sciences often lead to high-dimensional linear models. To solve the corresponding system of equations, penalisation appro...
Fast Fourier Transforms in Electromagnetics
Fast Fourier Transforms in Electromagnetics
This Chapter review the fast Fourier transform (FFT) technique and its application to computational electromagnetics, especially to the fast solver algorithms including the Conjuga...
Kernel-lasso feature expansion method: boosting the prediction ability of machine learning in heart attack
Kernel-lasso feature expansion method: boosting the prediction ability of machine learning in heart attack
Abstract Heart attack needs to cause a high degree in modern society, which is now at the top of the list of most common diseases. One-third of all deaths in the wor...
IMPLEMENTATION OF GEOGRAPHICALLY WEIGHTED LASSO (GWL) IN ANALYZING RICE PRODUCTION FACTORS IN INDONESIA
IMPLEMENTATION OF GEOGRAPHICALLY WEIGHTED LASSO (GWL) IN ANALYZING RICE PRODUCTION FACTORS IN INDONESIA
Geographically Weighted Lasso (GWL) is a combination of two regression methods, namely Geographically Weighted Regression (GWR) and Least Absolute Shringkage Selection Operator (LA...
Enhanced Fast Rerouting Mechanisms for Protected Traffic in MPLS Networks
Enhanced Fast Rerouting Mechanisms for Protected Traffic in MPLS Networks
Multiprotocol Label Switching (MPLS) fuses the intelligence of routing with the performance of switching and provides significant benefits to networks with a pure IP architecture a...
Transgenerational coexistence history attenuates negative direct interactions and strengthens facilitation
Transgenerational coexistence history attenuates negative direct interactions and strengthens facilitation
AbstractBackgroundInteractions among species are a fundamental aspect of biodiversity and driving ecosystem functioning and services. Species interactions include direct (pairwise)...

Back to Top