Javascript must be enabled to continue!
A massively parallel strategy for STR marker development, capture, and genotyping
View through CrossRef
Abstract
Short tandem repeat (STRs or microsatellites) variants, are highly polymorphic markers that facilitate powerful, high-precision population genetic analyses. STRs are especially valuable in conservation and ecological genetic research, yielding detailed information on population structure and short-term demographic flux. However, STR marker development and analysis by conventional PCR-based methods imposes a workflow bottleneck and is suboptimal for noninvasive sampling strategies such as fecal DNA recovery. While massively parallel sequencing has not previously been leveraged for scalable, efficient STR recovery, here we present a pipeline for developing STR markers directly from high-throughput shotgun sequencing data without requiring a reference genome assembly, and a methodological approach for highly parallel recovery of enriched STR loci. We first employed our approach to design and capture a panel of 5,000 STR loci from a test group of diademed sifakas (
Propithecus diadema
, n=3), endangered Malagasy rainforest lemurs, and we report extremely efficient recovery of targeted loci—97.3-99.6% of STRs characterized with ≥10x non-redundant coverage. Second, we tested our STR capture strategy on a
P. diadema
fecal DNA preparation, and report robust initial results and methodological suggestions for future implementations. In addition to STR targets, this approach also generates large, genome-wide single nucleotide polymorphism (SNP) panels from regions flanking the STR loci. Our method provides a cost-effective and highly scalable solution for rapid recovery of large STR and SNP datasets in any species without need for a reference genome, and can be used even with suboptimal DNA, which is more easily acquired in conservation and ecological genetic studies.
Data Deposition
Raw sequencing data are available under Study Accession numbers SRP073167 (genomic shotgun data for Oberon and Tatiana) and SRP076225 (targeted re-sequencing data) from the NCBI Sequence Read Archive. BaitSTR software is available at Github (core BaitSTR programs:
https://github.com/aakrosh/BaitSTR
; BaitSTR_type.pl companion script for genotyping and block manipulation:
https://github.com/lkistler/BaitSTR_type
).
Title: A massively parallel strategy for STR marker development, capture, and genotyping
Description:
Abstract
Short tandem repeat (STRs or microsatellites) variants, are highly polymorphic markers that facilitate powerful, high-precision population genetic analyses.
STRs are especially valuable in conservation and ecological genetic research, yielding detailed information on population structure and short-term demographic flux.
However, STR marker development and analysis by conventional PCR-based methods imposes a workflow bottleneck and is suboptimal for noninvasive sampling strategies such as fecal DNA recovery.
While massively parallel sequencing has not previously been leveraged for scalable, efficient STR recovery, here we present a pipeline for developing STR markers directly from high-throughput shotgun sequencing data without requiring a reference genome assembly, and a methodological approach for highly parallel recovery of enriched STR loci.
We first employed our approach to design and capture a panel of 5,000 STR loci from a test group of diademed sifakas (
Propithecus diadema
, n=3), endangered Malagasy rainforest lemurs, and we report extremely efficient recovery of targeted loci—97.
3-99.
6% of STRs characterized with ≥10x non-redundant coverage.
Second, we tested our STR capture strategy on a
P.
diadema
fecal DNA preparation, and report robust initial results and methodological suggestions for future implementations.
In addition to STR targets, this approach also generates large, genome-wide single nucleotide polymorphism (SNP) panels from regions flanking the STR loci.
Our method provides a cost-effective and highly scalable solution for rapid recovery of large STR and SNP datasets in any species without need for a reference genome, and can be used even with suboptimal DNA, which is more easily acquired in conservation and ecological genetic studies.
Data Deposition
Raw sequencing data are available under Study Accession numbers SRP073167 (genomic shotgun data for Oberon and Tatiana) and SRP076225 (targeted re-sequencing data) from the NCBI Sequence Read Archive.
BaitSTR software is available at Github (core BaitSTR programs:
https://github.
com/aakrosh/BaitSTR
; BaitSTR_type.
pl companion script for genotyping and block manipulation:
https://github.
com/lkistler/BaitSTR_type
).
Related Results
Identification of risk phenogroups among patients with moderate-to-severe tricuspid regurgitation by unsupervised cluster analysis
Identification of risk phenogroups among patients with moderate-to-severe tricuspid regurgitation by unsupervised cluster analysis
Abstract
Background
Assessing the individual risk of patients with secondary tricuspid regurgitation (STR) is challenging as it ...
Development of Efficient Genotyping Workflow for Accelerating Maize Improvement in Developing Countries
Development of Efficient Genotyping Workflow for Accelerating Maize Improvement in Developing Countries
Abstract
BackgroundMolecular breeding has been recognized as one of the pillars to accelerate the rate of genetic gain in crop improvement towards meeting the need to feed ...
Advancements and Applications of STR Kits in Forensic DNA Profiling: A Comprehensive Review
Advancements and Applications of STR Kits in Forensic DNA Profiling: A Comprehensive Review
Short tandem repeat (STR) analysis is a crucial technique in forensic science, providing high-resolution DNA profiling for criminal investigations, paternity tests, and identificat...
Advancements and Applications of STR Kits in Forensic DNA Profiling: A Comprehensive Review
Advancements and Applications of STR Kits in Forensic DNA Profiling: A Comprehensive Review
Short tandem repeat (STR) analysis is a crucial technique in forensic science, providing high-resolution DNA profiling for criminal investigations, paternity tests, and identificat...
Genome-wide Identification and Characterization of the STR Gene in Dendrobium officinale
Genome-wide Identification and Characterization of the STR Gene in Dendrobium officinale
Abstract
Alkaloids are the main active ingredients in the traditional Chinese medicine Dendrobium spp. Terpenoid indole alkaloids (TIAs), a class of secondary metabolites w...
Short tandem repeat typing technologies used in paternity testing: a case study
Short tandem repeat typing technologies used in paternity testing: a case study
Background: In forensic field, for establishing the paternity of disputed offspring, STR typing (autosomal, and Y typing) are commonly used to dissolve the cases. Short tandem repe...
High‐Throughput Single Nucleotide Polymorphisms Genotyping Technologies
High‐Throughput Single Nucleotide Polymorphisms Genotyping Technologies
Abstract
Genome‐wide association studies have successfully identified many novel genetic loci for various human complex diseases and quantitativ...
Biocomputational Screening and Pharmacological Studies of Gymnema sylvester in Contrast to Human CCR5 and CXCR4 Chemokine Receptors
Biocomputational Screening and Pharmacological Studies of Gymnema sylvester in Contrast to Human CCR5 and CXCR4 Chemokine Receptors
In last twenty years DNA, profiling has become an obligatory technology, which has brought scientists towards forensic identification. Thousands of STR markers are present in human...

