Javascript must be enabled to continue!
Discovering potential key features of genome wide profiling data using Decision Variable Analysis
View through CrossRef
The identification of key features related to phenotype of interest
(POI) from high dimensional data has been one of the important issues
for omics-data study, such as transcriptome or DNA methylome data.
However, these data are commonly contaminated by sources of unwanted
variation caused by platforms, batches or other types of biological
factors. Thus, the data can be considered as a combination of variation
derived from POI and other confounding factors. Not taking into
consideration for these factors could lead to spurious associations and
missing important signals. Based on this idea, we propose a novel
feature selection method called Decision Variable Analysis (DVA) to
extract the important features related to POI from the data containing
potential confounding factors. Using this method on the simulated data
and real data, respectively, we found DVA performed better in
identifying confounding factors comparing to other methods, including
linear regression and surrogate variable analysis. Especially, our
method is more efficient for the data in which there are much more
feature number than sample size. We show improvements of DVA across
high-dimensional datasets with smaller samples size compared to feature
number on different platforms. The results indicate that DVA is an
effective method to dissect sources of variation for omics-data with
potential confounding factors. DVA is freely available for use at
[https://github.com/xvon1/DVA](https://github.com/xvon1/DVA).
Title: Discovering potential key features of genome wide profiling data using Decision Variable Analysis
Description:
The identification of key features related to phenotype of interest
(POI) from high dimensional data has been one of the important issues
for omics-data study, such as transcriptome or DNA methylome data.
However, these data are commonly contaminated by sources of unwanted
variation caused by platforms, batches or other types of biological
factors.
Thus, the data can be considered as a combination of variation
derived from POI and other confounding factors.
Not taking into
consideration for these factors could lead to spurious associations and
missing important signals.
Based on this idea, we propose a novel
feature selection method called Decision Variable Analysis (DVA) to
extract the important features related to POI from the data containing
potential confounding factors.
Using this method on the simulated data
and real data, respectively, we found DVA performed better in
identifying confounding factors comparing to other methods, including
linear regression and surrogate variable analysis.
Especially, our
method is more efficient for the data in which there are much more
feature number than sample size.
We show improvements of DVA across
high-dimensional datasets with smaller samples size compared to feature
number on different platforms.
The results indicate that DVA is an
effective method to dissect sources of variation for omics-data with
potential confounding factors.
DVA is freely available for use at
[https://github.
com/xvon1/DVA](https://github.
com/xvon1/DVA).
Related Results
Autonomy on Trial
Autonomy on Trial
Photo by CHUTTERSNAP on Unsplash
Abstract
This paper critically examines how US bioethics and health law conceptualize patient autonomy, contrasting the rights-based, individualist...
Optimising primary molecular profiling in NSCLC
Optimising primary molecular profiling in NSCLC
AbstractIntroductionMolecular profiling of NSCLC is essential for optimising treatment decisions, but often incomplete. We assessed the efficacy of protocolised molecular profiling...
The Effect of Product Quality and Service Quality on Customer Satisfaction at SLV Room Boutique
The Effect of Product Quality and Service Quality on Customer Satisfaction at SLV Room Boutique
The purpose of the study was to determine the effect of product quality and service quality on customer satisfaction at the SLV Room Boutique. The population in this study were con...
The relationship between money supply and inflation: analysis with PANELVAR approach
The relationship between money supply and inflation: analysis with PANELVAR approach
Purpose- Central banks serve as institutions responsible for executing monetary policy in countries, with the primary objective of managing the money supply and ensuring price stab...
Genetic Variation of Adenoviruses Causing Conjunctivitis
Genetic Variation of Adenoviruses Causing Conjunctivitis
Human adenovirus type 8 (HAdV-8) is the most common causative agent of a highly contagious eye disease known as epidemic keratoconjunctivitis (EKC). HAdV-8 strains have been classi...
Evolution of the core and pan-genome of Streptococcus: positive selection, recombination, and genome composition
Evolution of the core and pan-genome of Streptococcus: positive selection, recombination, and genome composition
Abstract
Background
The genus Streptococcus is one of the most diverse and important human and agricultural pathogens. This study em...
Dynamic information aggregation decision-making methods based on variable precision rough set and grey clustering
Dynamic information aggregation decision-making methods based on variable precision rough set and grey clustering
Purpose
– The purpose of this paper is to construct a dynamic information aggregation decision-making model based on variable precision rough set.
...
IHPP: An In-Depth Profiling Tool for Advanced Performance Analysis
IHPP: An In-Depth Profiling Tool for Advanced Performance Analysis
IHPP (In-Depth High-Precision Profiler) is a special- ized profiling tool designed to provide detailed performance analysis and insights into program execution. It addresses limita...

