Search engine for discovering works of Art, research articles, and books related to Art and Culture
ShareThis
Javascript must be enabled to continue!

Dysgu: efficient structural variant calling using short or long reads

View through CrossRef
AbstractStructural variation (SV) plays a fundamental role in genome evolution and can underlie inherited or acquired diseases such as cancer. Long-read sequencing technologies have led to improvements in the characterization of structural variants (SVs), although paired-end sequencing offers better scalability. Here, we present dysgu, which calls SVs or indels using paired-end or long reads. Dysgu detects signals from alignment gaps, discordant and supplementary mappings, and generates consensus contigs, before classifying events using machine learning. Additional SVs are identified by remapping of anomalous sequences. Dysgu outperforms existing state-of-the-art tools using paired-end or long-reads, offering high sensitivity and precision whilst being among the fastest tools to run. We find that combining low coverage paired-end and long-reads is competitive in terms of performance with long-reads at higher coverage values.
Cold Spring Harbor Laboratory
Title: Dysgu: efficient structural variant calling using short or long reads
Description:
AbstractStructural variation (SV) plays a fundamental role in genome evolution and can underlie inherited or acquired diseases such as cancer.
Long-read sequencing technologies have led to improvements in the characterization of structural variants (SVs), although paired-end sequencing offers better scalability.
Here, we present dysgu, which calls SVs or indels using paired-end or long reads.
Dysgu detects signals from alignment gaps, discordant and supplementary mappings, and generates consensus contigs, before classifying events using machine learning.
Additional SVs are identified by remapping of anomalous sequences.
Dysgu outperforms existing state-of-the-art tools using paired-end or long-reads, offering high sensitivity and precision whilst being among the fastest tools to run.
We find that combining low coverage paired-end and long-reads is competitive in terms of performance with long-reads at higher coverage values.

Related Results

Abstract P1-05-23: Utilities and challenges of RNA-Seq based expression and variant calling in a clinical setting
Abstract P1-05-23: Utilities and challenges of RNA-Seq based expression and variant calling in a clinical setting
Abstract Introduction Variant calling based on DNA samples has been the gold standard of clinical testing since the advent of Sanger sequencing. The u...
GraphK-LR: Enhancing Long-read Metagenomic Binning with Read-overlap Graphs Across Microbial Kingdoms
GraphK-LR: Enhancing Long-read Metagenomic Binning with Read-overlap Graphs Across Microbial Kingdoms
Abstract Background: Metagenomics, the study of genetic material from environmental samples, relies on binning - the process of grouping DNA sequences from the same organis...
Frequency and Diversity of Variant Philadelphia Chromosome In Chronic Myeloid Leukemia Patients
Frequency and Diversity of Variant Philadelphia Chromosome In Chronic Myeloid Leukemia Patients
Abstract Abstract 4903 The Philadelphia chromosome (Ph), t(9;22), is detected in around 90% of the chronic myeloid leukemia (CML) patients, but in the...
ProbAlign: a re-alignment method for long sequencing reads
ProbAlign: a re-alignment method for long sequencing reads
AbstractThe incorrect alignments are a severe problem in variant calling, and remain as a challenge computational issue in Bioinformatics field. Although there have been some metho...
DEEP-LONG: A Fast and Accurate Aligner for Long RNA-Seq
DEEP-LONG: A Fast and Accurate Aligner for Long RNA-Seq
Abstract BackgroundIn recent years, because of the development of sequencing technology, long reads were widely used in many studies, include transcriptomics studies. Obvio...
Aldy 4: An efficient genotyper and star-allele caller for pharmacogenomics
Aldy 4: An efficient genotyper and star-allele caller for pharmacogenomics
AbstractHigh-throughput sequencing provides sufficient means for determining genotypes of clinically important pharmacogenes that can be used to tailor medical decisions to individ...

Back to Top