Search engine for discovering works of Art, research articles, and books related to Art and Culture
ShareThis
Javascript must be enabled to continue!

The Ancestry of a Sample of Sequences Subject to Recombination

View through CrossRef
Abstract In this article we discuss the ancestry of sequences sampled from the coalescent with recombination with constant population size 2N. We have studied a number of variables based on simulations of sample histories, and some analytical results are derived. Consider the leftmost nucleotide in the sequences. We show that the number of nucleotides sharing a most recent common ancestor (MRCA) with the leftmost nucleotide is ≈log(1 + 4N Lr)/4Nr when two sequences are compared, where L denotes sequence length in nucleotides, and r the recombination rate between any two neighboring nucleotides per generation. For larger samples, the number of nucleotides sharing MRCA with the leftmost nucleotide decreases and becomes almost independent of 4N Lr. Further, we show that a segment of the sequences sharing a MRCA consists in mean of 3/8Nr nucleotides, when two sequences are compared, and that this decreases toward 1/4Nr nucleotides when the whole population is sampled. A measure of the correlation between the genealogies of two nucleotides on two sequences is introduced. We show analytically that even when the nucleotides are separated by a large genetic distance, but share MRCA, the genealogies will show only little correlation. This is surprising, because the time until the two nucleotides shared MRCA is reciprocal to the genetic distance. Using simulations, the mean time until all positions in the sample have found a MRCA increases logarithmically with increasing sequence length and is considerably lower than a theoretically predicted upper bound. On the basis of simulations, it turns out that important properties of the coalescent with recombinations of the whole population are reflected in the properties of a sample of low size.
Oxford University Press (OUP)
Title: The Ancestry of a Sample of Sequences Subject to Recombination
Description:
Abstract In this article we discuss the ancestry of sequences sampled from the coalescent with recombination with constant population size 2N.
We have studied a number of variables based on simulations of sample histories, and some analytical results are derived.
Consider the leftmost nucleotide in the sequences.
We show that the number of nucleotides sharing a most recent common ancestor (MRCA) with the leftmost nucleotide is ≈log(1 + 4N Lr)/4Nr when two sequences are compared, where L denotes sequence length in nucleotides, and r the recombination rate between any two neighboring nucleotides per generation.
For larger samples, the number of nucleotides sharing MRCA with the leftmost nucleotide decreases and becomes almost independent of 4N Lr.
Further, we show that a segment of the sequences sharing a MRCA consists in mean of 3/8Nr nucleotides, when two sequences are compared, and that this decreases toward 1/4Nr nucleotides when the whole population is sampled.
A measure of the correlation between the genealogies of two nucleotides on two sequences is introduced.
We show analytically that even when the nucleotides are separated by a large genetic distance, but share MRCA, the genealogies will show only little correlation.
This is surprising, because the time until the two nucleotides shared MRCA is reciprocal to the genetic distance.
Using simulations, the mean time until all positions in the sample have found a MRCA increases logarithmically with increasing sequence length and is considerably lower than a theoretically predicted upper bound.
On the basis of simulations, it turns out that important properties of the coalescent with recombinations of the whole population are reflected in the properties of a sample of low size.

Related Results

Abstract 1166: Leveraging existing data to identify ancestry-associated features across multiple cancer types
Abstract 1166: Leveraging existing data to identify ancestry-associated features across multiple cancer types
Abstract People with different ancestries inherit different risks and encounter different environmental exposures resulting in different somatic profiles. A lack of ...
Tekstualni subjekt u poeziji Marije Stepanove od 2001. do 2017. godine
Tekstualni subjekt u poeziji Marije Stepanove od 2001. do 2017. godine
Maria Stepanova (b. 1972) is a contemporary Russian poet who has emerged in recent decades as one of the most original and complex voices on the poetically highly heterogeneous and...
Abstract 1599: Determining patient ancestry based on targeted tumor comprehensive genomic profiling
Abstract 1599: Determining patient ancestry based on targeted tumor comprehensive genomic profiling
Abstract Background. Cancer gene mutations exhibit mutation patterns of prevalence that vary across different ancestry groups. For example, EGFR variants are more fr...
Data from Blood Vitamin D Levels in Relation to Genetic Estimation of African Ancestry
Data from Blood Vitamin D Levels in Relation to Genetic Estimation of African Ancestry
<div>Abstract<p><b>Background:</b> African-Americans generally have lower circulating levels of 25 hydroxyvitamin D [25(OH)D] than Whites, attributed to ski...
Data from Blood Vitamin D Levels in Relation to Genetic Estimation of African Ancestry
Data from Blood Vitamin D Levels in Relation to Genetic Estimation of African Ancestry
<div>Abstract<p><b>Background:</b> African-Americans generally have lower circulating levels of 25 hydroxyvitamin D [25(OH)D] than Whites, attributed to ski...
Phylogenetic Classification of Feline Immunodeficiency Virus
Phylogenetic Classification of Feline Immunodeficiency Virus
Background: The feline immunodeficiency virus (FIV) is responsible for a retroviral disease that affects domestic and wild cats worldwide, causing Feline Acquired Immunodeficiency ...
Jk DNA GAGA MOTIFS ARE REQUIRED FOR LOCAL NUCLEOSOME REMODELING AND Vk-Jk RECOMBINATION
Jk DNA GAGA MOTIFS ARE REQUIRED FOR LOCAL NUCLEOSOME REMODELING AND Vk-Jk RECOMBINATION
Abstract Immunoreceptor gene recombination requires complementary 12 bp and 23 bp recombination signal sequences (RSSs). In addition, the RSSs th...

Back to Top