Javascript must be enabled to continue!
Evaluation of Model Fit of Inferred Admixture Proportions
View through CrossRef
Abstract
Model based methods for genetic clustering of individuals such as those implemented in
structure
or ADMIXTURE allow to infer individual ancestries and study population structure. The underlying model makes several assumptions about the demographic history that shaped the analysed genetic data. One assumption is that all individuals are a result of K homogeneous ancestral populations that are all well represented in the data, while another assumption is that no drift happened after the admixture event. The histories of many real world populations do not conform to that model, and in that case taking the inferred admixture proportions at face value might be misleading. We propose a method to evaluate the fit of admixture models based on estimating the correlation of the residual difference between the true genotypes and the genotypes predicted by the model. When the model assumptions are not violated, the residuals from a pair of individuals are not correlated. In case of a bad fit, individuals with similar demographic histories have a positive correlation of their residuals. Using simulated and real data, we show how the method is able to detect a bad fit of inferred admixture proportions due to using an insufficient number of clusters K or to demographic histories that deviate significantly from the admixture model assumptions, such as admixture from ghost populations, drift after admixture events and non-discrete ancestral populations. We have implemented the method as an open source software that can be applied to both unphased genotypes and next generation sequencing data.
Title: Evaluation of Model Fit of Inferred Admixture Proportions
Description:
Abstract
Model based methods for genetic clustering of individuals such as those implemented in
structure
or ADMIXTURE allow to infer individual ancestries and study population structure.
The underlying model makes several assumptions about the demographic history that shaped the analysed genetic data.
One assumption is that all individuals are a result of K homogeneous ancestral populations that are all well represented in the data, while another assumption is that no drift happened after the admixture event.
The histories of many real world populations do not conform to that model, and in that case taking the inferred admixture proportions at face value might be misleading.
We propose a method to evaluate the fit of admixture models based on estimating the correlation of the residual difference between the true genotypes and the genotypes predicted by the model.
When the model assumptions are not violated, the residuals from a pair of individuals are not correlated.
In case of a bad fit, individuals with similar demographic histories have a positive correlation of their residuals.
Using simulated and real data, we show how the method is able to detect a bad fit of inferred admixture proportions due to using an insufficient number of clusters K or to demographic histories that deviate significantly from the admixture model assumptions, such as admixture from ghost populations, drift after admixture events and non-discrete ancestral populations.
We have implemented the method as an open source software that can be applied to both unphased genotypes and next generation sequencing data.
Related Results
Study on Engineering Properties of Concrete Containing Marble Powder as Admixture
Study on Engineering Properties of Concrete Containing Marble Powder as Admixture
The construction industry relies heavily on concrete for its operations in the development of houses and other infrastructural facilities due to its structural stability and streng...
Origin and Evolutionary History of the Malagasy
Origin and Evolutionary History of the Malagasy
Abstract
The uniqueness of Malagasy people comes from a balanced admixture between deep‐rooted branches of the human evolutionary history, the S...
Overview of Admixture Mapping
Overview of Admixture Mapping
AbstractAdmixture mapping is a powerful method of gene mapping for diseases or traits that show differential risk by ancestry. Admixture mapping has been applied most often to Amer...
Local Fit Evaluation of Structural Equation Models Using Graphical Criteria
Local Fit Evaluation of Structural Equation Models Using Graphical Criteria
Evaluation of model fit is critically important for every structural equation model and sophisticated methods have been developed for this task. Among them are the χ2 goodness-of-f...
Person-work Environment Fit Perceptions on employee performance in Civil Service Sector employees in Addis Ababa and Dire Dawa
Person-work Environment Fit Perceptions on employee performance in Civil Service Sector employees in Addis Ababa and Dire Dawa
Scholars of organizational behaviour have long been interested in understanding the interactions between employees and their environments, and how these interactions can influence ...
Admixture mapping to identify breast cancer susceptibility loci in African American women.
Admixture mapping to identify breast cancer susceptibility loci in African American women.
Abstract
Abstract #2090
Background: The incidence of breast cancer in young women is higher in African American (AAW) compared to Caucasian (CW) and i...
Pengaruh Person-job Fit terhadap Komitmen Organisasi pada Karyawan Ritel Kota Bandung
Pengaruh Person-job Fit terhadap Komitmen Organisasi pada Karyawan Ritel Kota Bandung
Abstract. This study aims to examine the influence of person-job fit on organizational commitment among retail employees in Bandung City. The background of this research is the hig...
Ancestry-informative markers for African Americans based on the Affymetrix Pan-African genotyping array
Ancestry-informative markers for African Americans based on the Affymetrix Pan-African genotyping array
Genetic admixture has been utilized as a tool for identifying loci associated with complex traits and diseases in recently admixed populations such as African Americans. In particu...

