Search engine for discovering works of Art, research articles, and books related to Art and Culture
ShareThis
Javascript must be enabled to continue!

A Fast Lasso-Based Method for Inferring Higher-Order Interactions

View through CrossRef
A bstract Large-scale genotype-phenotype screens provide a wealth of data for identifying molecular alterations associated with a phenotype. Epistatic effects play an important role in such association studies. For example, siRNA perturbation screens can be used to identify combinatorial gene-silencing effects. In bacteria, epistasis has practical consequences in determining antimicrobial resistance as the genetic background of a strain plays an important role in determining resistance. Recently developed tools scale to human exome-wide screens for pairwise interactions, but none to date have included the possibility of three-way interactions. Expanding upon recent state-of-the art methods, we make a number of improvements to the performance on large-scale data, making consideration of three-way interactions possible. We demonstrate our proposed method, Pint , on both simulated and real data sets, including antibiotic resistance testing and siRNA perturbation screens. Pint outperforms known methods in simulated data, and identifies a number of biologically plausible gene effects in both the antibiotic and siRNA models. For example, we have identified a combination of known tumor suppressor genes that is predicted (using Pint ) to cause a significant increase in cell proliferation. A uthor S ummary In recent years, large-scale genetic datasets have become available for analysis. These large datasets often stretch the limits of classic computational methods, requiring too much memory or simply taking a prohibitively long time to run. Due to the enormous number of potential interactions, each gene or variation in the data is often modeled on its own, without considering interactions between them. Recently, methods have been developed to solve regression problems that include these interacting effects. Even the fastest of these cannot include threeway interactions, however. We improve upon one such method, developing an approach that is significantly faster than the current state of the art. Moreover, our method scales to three-way interactions among thousands of genes, while avoiding a number of the limitations of previous approaches. We analyse large-scale simulated data, antibiotic resistance, and gene-silencing datasets to demonstrate the accuracy and performance of our approach.
Title: A Fast Lasso-Based Method for Inferring Higher-Order Interactions
Description:
A bstract Large-scale genotype-phenotype screens provide a wealth of data for identifying molecular alterations associated with a phenotype.
Epistatic effects play an important role in such association studies.
For example, siRNA perturbation screens can be used to identify combinatorial gene-silencing effects.
In bacteria, epistasis has practical consequences in determining antimicrobial resistance as the genetic background of a strain plays an important role in determining resistance.
Recently developed tools scale to human exome-wide screens for pairwise interactions, but none to date have included the possibility of three-way interactions.
Expanding upon recent state-of-the art methods, we make a number of improvements to the performance on large-scale data, making consideration of three-way interactions possible.
We demonstrate our proposed method, Pint , on both simulated and real data sets, including antibiotic resistance testing and siRNA perturbation screens.
Pint outperforms known methods in simulated data, and identifies a number of biologically plausible gene effects in both the antibiotic and siRNA models.
For example, we have identified a combination of known tumor suppressor genes that is predicted (using Pint ) to cause a significant increase in cell proliferation.
A uthor S ummary In recent years, large-scale genetic datasets have become available for analysis.
These large datasets often stretch the limits of classic computational methods, requiring too much memory or simply taking a prohibitively long time to run.
Due to the enormous number of potential interactions, each gene or variation in the data is often modeled on its own, without considering interactions between them.
Recently, methods have been developed to solve regression problems that include these interacting effects.
Even the fastest of these cannot include threeway interactions, however.
We improve upon one such method, developing an approach that is significantly faster than the current state of the art.
Moreover, our method scales to three-way interactions among thousands of genes, while avoiding a number of the limitations of previous approaches.
We analyse large-scale simulated data, antibiotic resistance, and gene-silencing datasets to demonstrate the accuracy and performance of our approach.

Related Results

Seagull: lasso, group lasso and sparse-group lasso regularization for linear regression models via proximal gradient descent
Seagull: lasso, group lasso and sparse-group lasso regularization for linear regression models via proximal gradient descent
Abstract Background Statistical analyses of biological problems in life sciences often lead to high-dimensional linear models. To solve the corresponding system of equations, penal...
Méthodes quasi-Monte Carlo et Monte Carlo : application aux calculs des estimateurs Lasso et Lasso bayésien
Méthodes quasi-Monte Carlo et Monte Carlo : application aux calculs des estimateurs Lasso et Lasso bayésien
La thèse contient 6 chapitres. Le premier chapitre contient une introduction à la régression linéaire et aux problèmes Lasso et Lasso bayésien. Le chapitre 2 rappelle les algorithm...
Canal-LASSO: A sparse noise-resilient online linear regression model
Canal-LASSO: A sparse noise-resilient online linear regression model
Least absolute shrinkage and selection operator (LASSO) is one of the most commonly used methods for shrinkage estimation and variable selection. Robust variable selection methods ...
Variable Selection using Lasso Regression
Variable Selection using Lasso Regression
This study employs Lasso regression to analyze highdimensional genetic data for predicting flowering time in maize, specifically Days to Anthesis (DtoA). Lasso, or Least Absolute S...
A Smoothed LASSO Based DNN Sparsification Technique
A Smoothed LASSO Based DNN Sparsification Technique
Deep Neural Networks (DNNs) are increasingly being used in a variety of applications. However, DNNs have huge computational and memory requirements. One way to reduce these require...
A Smoothed LASSO Based DNN Sparsification Technique
A Smoothed LASSO Based DNN Sparsification Technique
<div>Deep Neural Networks (DNNs) are increasingly being used in a variety of applications. However, DNNs have huge computational and memory requirements. One way to reduce th...
Approches pénalisées pour les analyses en sous-groupes : application en épidémiologie
Approches pénalisées pour les analyses en sous-groupes : application en épidémiologie
Dans un contexte où les cancers et l'insécurité routière font partie des principales causes de décès en France et dans le monde, chercher à en étudier les risques et aider à la pri...

Back to Top