Search engine for discovering works of Art, research articles, and books related to Art and Culture
ShareThis
Javascript must be enabled to continue!

Timesweeper: Accurately Identifying Selective Sweeps Using Population Genomic Time Series

View through CrossRef
ABSTRACT Despite decades of research, identifying selective sweeps, the genomic footprints of positive selection, remains a core problem in population genetics. Of the myriad methods that have been developed to tackle this task, few are designed to leverage the potential of genomic time-series data. This is because in most population genetic studies of natural populations only a single period of time can be sampled. Recent advancements in sequencing technology, including improvements in extracting and sequencing ancient DNA, have made repeated samplings of a population possible, allowing for more direct analysis of recent evolutionary dynamics. Serial sampling of organisms with shorter generation times has also become more feasible due to improvements in the cost and throughput of sequencing. With these advances in mind, here we present Timesweeper, a fast and accurate convolutional neural network-based tool for identifying selective sweeps in data consisting of multiple genomic samplings of a population over time. Timesweeper population genomic time-series data by first simulating training data under a demographic model appropriate for the data of interest, training a one-dimensional Convolutional Neural Network on said simulations, and inferring which polymorphisms in this serialized dataset were the direct target of a completed or ongoing selective sweep. We show that Timesweeper is accurate under multiple simulated demographic and sampling scenarios, identifies selected variants with high resolution, and estimates selection coefficients more accurately than existing methods. In sum, we show that more accurate inferences about natural selection are possible when genomic time-series data are available; such data will continue to proliferate in coming years due to both the sequencing of ancient samples and repeated samplings of extant populations with faster generation times, as well as experimentally evolved populations where time-series data are often generated. Methodological advances such as Timesweeper thus have the potential to help resolve the controversy over the role of positive selection in the genome. We provide Timesweeper as a Python package for use by the community.
Title: Timesweeper: Accurately Identifying Selective Sweeps Using Population Genomic Time Series
Description:
ABSTRACT Despite decades of research, identifying selective sweeps, the genomic footprints of positive selection, remains a core problem in population genetics.
Of the myriad methods that have been developed to tackle this task, few are designed to leverage the potential of genomic time-series data.
This is because in most population genetic studies of natural populations only a single period of time can be sampled.
Recent advancements in sequencing technology, including improvements in extracting and sequencing ancient DNA, have made repeated samplings of a population possible, allowing for more direct analysis of recent evolutionary dynamics.
Serial sampling of organisms with shorter generation times has also become more feasible due to improvements in the cost and throughput of sequencing.
With these advances in mind, here we present Timesweeper, a fast and accurate convolutional neural network-based tool for identifying selective sweeps in data consisting of multiple genomic samplings of a population over time.
Timesweeper population genomic time-series data by first simulating training data under a demographic model appropriate for the data of interest, training a one-dimensional Convolutional Neural Network on said simulations, and inferring which polymorphisms in this serialized dataset were the direct target of a completed or ongoing selective sweep.
We show that Timesweeper is accurate under multiple simulated demographic and sampling scenarios, identifies selected variants with high resolution, and estimates selection coefficients more accurately than existing methods.
In sum, we show that more accurate inferences about natural selection are possible when genomic time-series data are available; such data will continue to proliferate in coming years due to both the sequencing of ancient samples and repeated samplings of extant populations with faster generation times, as well as experimentally evolved populations where time-series data are often generated.
Methodological advances such as Timesweeper thus have the potential to help resolve the controversy over the role of positive selection in the genome.
We provide Timesweeper as a Python package for use by the community.

Related Results

Allelic gene conversion softens selective sweeps
Allelic gene conversion softens selective sweeps
Abstract The prominence of positive selection, in which beneficial mutations are favored by natural selection and rapidly increase in frequency, ...
Behavior-dependent spatial maps enable efficient theta phase coding
Behavior-dependent spatial maps enable efficient theta phase coding
Abstract Navigation through space involves learning and representing relationships between past, current and future locations. In mammals, this might rely on the hi...
VolcanoFinder: genomic scans for adaptive introgression
VolcanoFinder: genomic scans for adaptive introgression
AbstractRecent research shows that introgression between closely-related species is an important source of adaptive alleles for a wide range of taxa. Typically, detection of adapti...
Deciphering genomic signatures in different Indian milch cattle breeds
Deciphering genomic signatures in different Indian milch cattle breeds
In the present study, the population genomic data of different cattle breeds were explored to decipher the genomic regions affected due to selective events and reflected in the pro...
Repeated global adaptation across plant species
Repeated global adaptation across plant species
AbstractGlobal adaptation occurs when all populations of a species undergo selection toward a common optimum. This can occur by a hard selective sweep with the emergence of a new g...
diploS/HIC: an updated approach to classifying selective sweeps
diploS/HIC: an updated approach to classifying selective sweeps
Abstract Identifying selective sweeps in populations that have complex demographic histories remains a difficult problem in population genetics. ...
Accuracy and computational efficiency of genomic selection with high-density SNP and whole-genome sequence data.
Accuracy and computational efficiency of genomic selection with high-density SNP and whole-genome sequence data.
Abstract The prediction of complex or quantitative traits from single nucleotide polymorphism (SNP) genotypes has transformed livestock and plant breeding, and is also pl...

Back to Top