Javascript must be enabled to continue!
A likelihood-free estimator of population structure bridging admixture models and principal components analysis
View through CrossRef
Abstract
We introduce a simple and computationally efficient method for fitting the admixture model of genetic population structure, called
ALStructure
. The strategy of
ALStructure
is to first estimate the low-dimensional linear subspace of the population admixture components and then search for a model within this subspace that is consistent with the admixture model’s natural probabilistic constraints. Central to this strategy is the observation that all models belonging to this constrained space of solutions are risk-minimizing and have equal likelihood, rendering any additional optimization unnecessary. The low-dimensional linear subspace is estimated through a recently introduced principal components analysis method that is appropriate for genotype data, thereby providing a solution that has both principal components and probabilistic admixture interpretations. Our approach differs fundamentally from other existing methods for estimating admixture, which aim to fit the admixture model directly by searching for parameters that maximize the likelihood function or the posterior probability. We observe that
ALStructure
typically outperforms existing methods both in accuracy and computational speed under a wide array of simulated and real human genotype datasets. Throughout this work we emphasize that the admixture model is a special case of a much broader class of models for which algorithms similar to
ALStructure
may be successfully employed.
Title: A likelihood-free estimator of population structure bridging admixture models and principal components analysis
Description:
Abstract
We introduce a simple and computationally efficient method for fitting the admixture model of genetic population structure, called
ALStructure
.
The strategy of
ALStructure
is to first estimate the low-dimensional linear subspace of the population admixture components and then search for a model within this subspace that is consistent with the admixture model’s natural probabilistic constraints.
Central to this strategy is the observation that all models belonging to this constrained space of solutions are risk-minimizing and have equal likelihood, rendering any additional optimization unnecessary.
The low-dimensional linear subspace is estimated through a recently introduced principal components analysis method that is appropriate for genotype data, thereby providing a solution that has both principal components and probabilistic admixture interpretations.
Our approach differs fundamentally from other existing methods for estimating admixture, which aim to fit the admixture model directly by searching for parameters that maximize the likelihood function or the posterior probability.
We observe that
ALStructure
typically outperforms existing methods both in accuracy and computational speed under a wide array of simulated and real human genotype datasets.
Throughout this work we emphasize that the admixture model is a special case of a much broader class of models for which algorithms similar to
ALStructure
may be successfully employed.
Related Results
Generalized Estimator of Population Variance utilizing Auxiliary Information in Simple Random Sampling Scheme
Generalized Estimator of Population Variance utilizing Auxiliary Information in Simple Random Sampling Scheme
In this study, using the Simple Random Sampling without Replacement (SRSWOR) method, we propose a generalized estimator of population variance of the primary variable. Up to the fi...
Frequency of Common Chromosomal Abnormalities in Patients with Idiopathic Acquired Aplastic Anemia
Frequency of Common Chromosomal Abnormalities in Patients with Idiopathic Acquired Aplastic Anemia
Objective: To determine the frequency of common chromosomal aberrations in local population idiopathic determine the frequency of common chromosomal aberrations in local population...
Inference of recent admixture using genotype data
Inference of recent admixture using genotype data
Abstract
The inference of biogeographic ancestry (BGA) has become a focus of forensic genetics. Misinference of BGA can have profound unwanted consequences for inve...
Revealing the range of maximum likelihood estimates in the admixture model
Revealing the range of maximum likelihood estimates in the admixture model
Abstract
Many ancestry inference tools, including STRUCTURE and ADMIXTURE, rely on the admixture model to infer both, allele frequencies
...
Evaluation of Model Fit of Inferred Admixture Proportions
Evaluation of Model Fit of Inferred Admixture Proportions
Abstract
Model based methods for genetic clustering of individuals such as those implemented in
structure
or ...
A New Efficient Difference-Type Estimator for Estimating Population Mean using Dual Auxiliary Information under Non-Response
A New Efficient Difference-Type Estimator for Estimating Population Mean using Dual Auxiliary Information under Non-Response
In this paper, the problem of estimating the finite population mean by using dual auxiliary information under non-response. This paper proposed a difference-type estimator of popul...
Ecological genomics of divergence, admixture, and fitness in the smallmouth bass (Micropterus dolomieu)
Ecological genomics of divergence, admixture, and fitness in the smallmouth bass (Micropterus dolomieu)
The Smallmouth Bass (Micropterus dolomieu) is one of the most highly targeted sport fishes in the world. Anglers vie for the opportunity to catch Smallmouth Bass recreationally and...
Almost Unbiased Liu Estimator in Bell Regression Model
Almost Unbiased Liu Estimator in Bell Regression Model
Abstract
In this research, we propose a novel regression estimator as an alternative to the Liu estimator for addressing multicollinearity in the Bell regression model, ref...

