Javascript must be enabled to continue!
SPEARS: Standard Performance Evaluation of Ancestral haplotype Reconstruction through Simulation
View through CrossRef
Abstract
Motivation
Ancestral haplotype maps provide useful information about genomic variation and insights into biological processes. Reconstructing the descendent haplotype structure of homologous chromosomes, particularly for large numbers of individuals, can help with characterizing the recombination landscape, elucidating genotype-to-phenotype relationships, improving genomic predictions and more. Inferring haplotype maps from sparse genotype data is an efficient approach to whole-genome haplotyping, but this is a non-trivial problem. A standardized approach is needed to validate whether haplotype reconstruction software, conceived population designs and existing data for a given population provides accurate haplotype information for further inference.
Results
We introduce SPEARS, a pipeline for the simulation-based appraisal of genome-wide haplotype maps constructed from sparse genotype data. Using a specified pedigree, the pipeline generates virtual genotypes (known data) with genotyping errors and missing data structure. It then proceeds to mimic analysis in practice, capturing sources of error due to genotyping, imputation and haplotype inference. Standard metrics allow researchers to assess different population designs and which features of haplotype structure or regions of the genome are sufficiently accurate for analysis. Haplotype maps for 1000 outcross progeny from a multi-parent population of maize are used to demonstrate SPEARS.
Availabilityand implementation
SPEARS, the protocol and suite of scripts, are publicly available under an MIT license at GitHub (https://github.com/maizeatlas/spears).
Supplementary information
Supplementary data are available at Bioinformatics online.
Title: SPEARS: Standard Performance Evaluation of Ancestral haplotype Reconstruction through Simulation
Description:
Abstract
Motivation
Ancestral haplotype maps provide useful information about genomic variation and insights into biological processes.
Reconstructing the descendent haplotype structure of homologous chromosomes, particularly for large numbers of individuals, can help with characterizing the recombination landscape, elucidating genotype-to-phenotype relationships, improving genomic predictions and more.
Inferring haplotype maps from sparse genotype data is an efficient approach to whole-genome haplotyping, but this is a non-trivial problem.
A standardized approach is needed to validate whether haplotype reconstruction software, conceived population designs and existing data for a given population provides accurate haplotype information for further inference.
Results
We introduce SPEARS, a pipeline for the simulation-based appraisal of genome-wide haplotype maps constructed from sparse genotype data.
Using a specified pedigree, the pipeline generates virtual genotypes (known data) with genotyping errors and missing data structure.
It then proceeds to mimic analysis in practice, capturing sources of error due to genotyping, imputation and haplotype inference.
Standard metrics allow researchers to assess different population designs and which features of haplotype structure or regions of the genome are sufficiently accurate for analysis.
Haplotype maps for 1000 outcross progeny from a multi-parent population of maize are used to demonstrate SPEARS.
Availabilityand implementation
SPEARS, the protocol and suite of scripts, are publicly available under an MIT license at GitHub (https://github.
com/maizeatlas/spears).
Supplementary information
Supplementary data are available at Bioinformatics online.
Related Results
SPEARS: Standard Performance Evaluation of Ancestral Reconstruction through Simulation
SPEARS: Standard Performance Evaluation of Ancestral Reconstruction through Simulation
Motivation
Ancestral haplotype maps provide useful information about genomic variation and biological processes. Reconstructing the descendent haplotype structure o...
Haplotype Matching with GBWT for Pangenome Graphs
Haplotype Matching with GBWT for Pangenome Graphs
Traditionally, variations from a linear reference genome were used to represent large sets of haplotypes compactly. In the linear reference genome based paradigm, the positional Bu...
Ancestral sequence alignment under optimal conditions
Ancestral sequence alignment under optimal conditions
Abstract
Background
Multiple genome alignment is an important problem in bioinformatics. An important subproblem used by many multiple alignment app...
Non-Recommended Publishing Lists: Strategies for Detecting Deceitful Journals
Non-Recommended Publishing Lists: Strategies for Detecting Deceitful Journals
Abstract
The rapid growth of open access publishing (OAP) has significantly improved the accessibility and dissemination of scientific knowledge. However, this expansion has also c...
Haplotype Reconstruction in Connected Tetraploid F1 Populations
Haplotype Reconstruction in Connected Tetraploid F1 Populations
Abstract
In diploid species, many multi-parental populations have been developed to increase genetic diversity and quantitative trait loci (QTL) mapping resolution....
Abstract P6-09-10: Informational needs among women considering breast reconstruction post-mastectomy.
Abstract P6-09-10: Informational needs among women considering breast reconstruction post-mastectomy.
Abstract
For many women, the complexity of processing and learning about their breast cancer diagnosis is further complicated by decisions to be made about breast re...
mtDNA haplotype network analysis: Exploring genetic relationships and diversity in dog haplogroups
mtDNA haplotype network analysis: Exploring genetic relationships and diversity in dog haplogroups
The genetic diversity and relationships of dog haplogroups were studied by analyzing the HV1 region of mitochondrial DNA. Previous studies have found six distinct haplogroups (A, B...
Minimal Positional Substring Cover: A Haplotype Threading Alternative to Li & Stephens Model
Minimal Positional Substring Cover: A Haplotype Threading Alternative to Li & Stephens Model
Abstract
The Li & Stephens (LS) hidden Markov model (HMM) models the process of reconstructing a haplotype as a mosaic copy of haplotypes in ...

