Javascript must be enabled to continue!
The signal of admixture can decay rapidly when using clustering-based methods
View through CrossRef
Gene flow shapes evolutionary trajectories by introducing novel alleles, facilitating or retarding adaptation, or eroding divergence among populations. Studies commonly infer gene flow through estimates of genetic admixture from methods that cluster individuals by allele-frequency similarity. However, the ability of these methods to reliably detect historical admixture remains poorly understood, particularly under spatially restricted dispersal and across a range of migration rates. Here we evaluate how signals of admixture arise and decay during and after gene flow across a range of within- and between-population dispersal rates, using three common inference methods: ADMIXTURE, sNMF, and PopCluster. Using forward-time simulations in a two-dimensional landscape, we modelled two metapopulations that diverged in isolation, experienced a pulse of secondary contact, and subsequently returned to isolation. All three methods systematically underestimated admixture, with the bias increasing once gene flow ceased: inferred admixture averaged 56% of the true value at the end of gene flow, declined to 23% after 1,000 generations, and fell below 5% by 2,000 generations, even where true admixture exceeded 20%. This underestimation was accompanied by corresponding overestimation of barrier strength, assessed using a metric analogous to cline width. Both patterns reflect the rapid re-establishment of Hardy–Weinberg and linkage equilibrium following secondary contact, which erodes the allele-frequency differences upon which these methods depend. Consequently, the absence of inferred admixture in present-day samples should not be taken as evidence against historical gene flow.
Title: The signal of admixture can decay rapidly when using clustering-based methods
Description:
Gene flow shapes evolutionary trajectories by introducing novel alleles, facilitating or retarding adaptation, or eroding divergence among populations.
Studies commonly infer gene flow through estimates of genetic admixture from methods that cluster individuals by allele-frequency similarity.
However, the ability of these methods to reliably detect historical admixture remains poorly understood, particularly under spatially restricted dispersal and across a range of migration rates.
Here we evaluate how signals of admixture arise and decay during and after gene flow across a range of within- and between-population dispersal rates, using three common inference methods: ADMIXTURE, sNMF, and PopCluster.
Using forward-time simulations in a two-dimensional landscape, we modelled two metapopulations that diverged in isolation, experienced a pulse of secondary contact, and subsequently returned to isolation.
All three methods systematically underestimated admixture, with the bias increasing once gene flow ceased: inferred admixture averaged 56% of the true value at the end of gene flow, declined to 23% after 1,000 generations, and fell below 5% by 2,000 generations, even where true admixture exceeded 20%.
This underestimation was accompanied by corresponding overestimation of barrier strength, assessed using a metric analogous to cline width.
Both patterns reflect the rapid re-establishment of Hardy–Weinberg and linkage equilibrium following secondary contact, which erodes the allele-frequency differences upon which these methods depend.
Consequently, the absence of inferred admixture in present-day samples should not be taken as evidence against historical gene flow.
Related Results
Inference of recent admixture using genotype data
Inference of recent admixture using genotype data
Abstract
The inference of biogeographic ancestry (BGA) has become a focus of forensic genetics. Misinference of BGA can have profound unwanted consequences for inve...
Axisymmetric antidynamo theorems in compressible non-uniform conducting fluids
Axisymmetric antidynamo theorems in compressible non-uniform conducting fluids
Abstract
This paper considers magnetic fields generated by dynamo action in electrically conducting fluids under axisymmetric but otherwise very general non-stead...
Evaluation of decay times in coupled spaces: Bayesian decay model selection
Evaluation of decay times in coupled spaces: Bayesian decay model selection
This paper applies Bayesian probability theory to determination of the decay times in coupled spaces. A previous paper [N. Xiang and P. M. Goggans, J. Acoust. Soc. Am. 110, 1415–14...
The Kernel Rough K-Means Algorithm
The Kernel Rough K-Means Algorithm
Background:
Clustering is one of the most important data mining methods. The k-means
(c-means ) and its derivative methods are the hotspot in the field of clustering research in re...
Evaluation of Model Fit of Inferred Admixture Proportions
Evaluation of Model Fit of Inferred Admixture Proportions
Abstract
Model based methods for genetic clustering of individuals such as those implemented in
structure
or ...
Genetic ancestry, admixture, and population structure in rural Dominica
Genetic ancestry, admixture, and population structure in rural Dominica
The Caribbean is a genetically diverse region with heterogeneous admixture compositions influenced by local island ecologies, migrations, colonial conflicts, and demographic histor...
Research on a microseismic signal picking algorithm based on GTOA clustering
Research on a microseismic signal picking algorithm based on GTOA clustering
Abstract. Clustering is one of the challenging problems in machine learning. Adopting clustering methods for the picking of microseismic signals has emerged as a new approach. Howe...
Loter: A software package to infer local ancestry for a wide range of species
Loter: A software package to infer local ancestry for a wide range of species
Abstract
Admixture between populations provides opportunity to study biological adaptation and phenotypic variation. Admixture studies rely on lo...

