Javascript must be enabled to continue!
rDNAcaller: a fast and robust pipeline to call ribosomal DNA variants
View through CrossRef
Ribosomal DNA (rDNA) genes are essential components of the ribosome, organized as tandem repeats on the mammalian genomes. Extensive genetic and copy number variation in rDNA has been reported both within and between individuals, contributing to phenotypic diversity. However, previous rDNA variant calling strategies have relied on methods designed for diploid regions and have not been systematically benchmarked. With the recent availability of a telomere-to-telomere (T2T) genome assembly, rDNA regions have been fully assembled for the first time, enabling benchmarking and optimization of rDNA variant calling strategies. We developed a customized simulator that replicates real intra- and inter-individual rDNA variation based on the T2T assembly to benchmark the performance of commonly used variant callers, including GATK, Mutect2, and Lofreq. Additionally, we optimized the preprocessing and mapping steps to remove pseudogenes and significantly improve accuracy. Based on these optimizations, we introduce rDNAcaller, a novel pipeline for accurate rDNA variant detection using short-read whole-genome sequencing. rDNAcaller integrates optimized preprocessing with the top-performing variant caller and achieves 94% precision on experimental data from the T2T cell line. Applying our pipeline to data from the 1000 Genomes Project, we identify 5,607 novel rDNA variant positions across human populations, with African individuals showing the highest number of variants. Overall, rDNAcaller is a robust and versatile tool for analyzing rDNA variation, addressing the limitations of existing methods in handling high ploidies. By enabling accurate detection of rDNA variants, it facilitates deeper exploration of rDNA's role in phenotypic diversity, supporting future genomic studies and broadening our understanding of rDNA biology in health and disease.
Cold Spring Harbor Laboratory
Title: rDNAcaller: a fast and robust pipeline to call ribosomal DNA variants
Description:
Ribosomal DNA (rDNA) genes are essential components of the ribosome, organized as tandem repeats on the mammalian genomes.
Extensive genetic and copy number variation in rDNA has been reported both within and between individuals, contributing to phenotypic diversity.
However, previous rDNA variant calling strategies have relied on methods designed for diploid regions and have not been systematically benchmarked.
With the recent availability of a telomere-to-telomere (T2T) genome assembly, rDNA regions have been fully assembled for the first time, enabling benchmarking and optimization of rDNA variant calling strategies.
We developed a customized simulator that replicates real intra- and inter-individual rDNA variation based on the T2T assembly to benchmark the performance of commonly used variant callers, including GATK, Mutect2, and Lofreq.
Additionally, we optimized the preprocessing and mapping steps to remove pseudogenes and significantly improve accuracy.
Based on these optimizations, we introduce rDNAcaller, a novel pipeline for accurate rDNA variant detection using short-read whole-genome sequencing.
rDNAcaller integrates optimized preprocessing with the top-performing variant caller and achieves 94% precision on experimental data from the T2T cell line.
Applying our pipeline to data from the 1000 Genomes Project, we identify 5,607 novel rDNA variant positions across human populations, with African individuals showing the highest number of variants.
Overall, rDNAcaller is a robust and versatile tool for analyzing rDNA variation, addressing the limitations of existing methods in handling high ploidies.
By enabling accurate detection of rDNA variants, it facilitates deeper exploration of rDNA's role in phenotypic diversity, supporting future genomic studies and broadening our understanding of rDNA biology in health and disease.
Related Results
Installation Analysis of Matterhorn Pipeline Replacement
Installation Analysis of Matterhorn Pipeline Replacement
Abstract
The paper describes the installation analysis for the Matterhorn field pipeline replacement, located in water depths between 800-ft to 1200-ft in the Gul...
Genome wide hypomethylation and youth-associated DNA gap reduction promoting DNA damage and senescence-associated pathogenesis
Genome wide hypomethylation and youth-associated DNA gap reduction promoting DNA damage and senescence-associated pathogenesis
Abstract
Background: Age-associated epigenetic alteration is the underlying cause of DNA damage in aging cells. Two types of youth-associated DNA-protection epigenetic mark...
Echinococcus granulosus in Environmental Samples: A Cross-Sectional Molecular Study
Echinococcus granulosus in Environmental Samples: A Cross-Sectional Molecular Study
Abstract
Introduction
Echinococcosis, caused by tapeworms of the Echinococcus genus, remains a significant zoonotic disease globally. The disease is particularly prevalent in areas...
A Fluid-pipe-soil Approach to Stability Design of Submarine Pipelines
A Fluid-pipe-soil Approach to Stability Design of Submarine Pipelines
Abstract
The conventional approach to submarine pipeline stability design considers interactions between water and pipeline (fluid-pipe) and pipeline and seabed (...
Pipeline Resistance
Pipeline Resistance
Pipeline resistance is where an often abstract and wonky climate movement meets the bravery and boldness of Indigenous and other frontline defenders of land and water who inspire d...
Clinical Implications of Germline Predisposition Gene Variants in Patients with Refractory or Relapsed B Acute Lymphoblastic Leukemia
Clinical Implications of Germline Predisposition Gene Variants in Patients with Refractory or Relapsed B Acute Lymphoblastic Leukemia
Objectives:Gene variants are important factors in prognosis of the patients with hematological malignancies. In current study, our team investigate the relationship between blood a...
Seismic Vulnerability of the Subsea Pipeline
Seismic Vulnerability of the Subsea Pipeline
Abstract
Unburied marine pipeline vulnerability under seismic impact, a new approach of investigation, and conclusion / recommendations for certain analyzed cases...
MARS-seq2.0: an experimental and analytical pipeline for indexed sorting combined with single-cell RNA sequencing v1
MARS-seq2.0: an experimental and analytical pipeline for indexed sorting combined with single-cell RNA sequencing v1
Human tissues comprise trillions of cells that populate a complex space of molecular phenotypes and functions and that vary in abundance by 4–9 orders of magnitude. Relying solely ...

