Javascript must be enabled to continue!
ProbAlign: a re-alignment method for long sequencing reads
View through CrossRef
AbstractThe incorrect alignments are a severe problem in variant calling, and remain as a challenge computational issue in Bioinformatics field. Although there have been some methods utilizing the re-alignment approach to tackle the misalignments, a standalone re-alignment tool for long sequencing reads is lacking. Hence, we present a standalone tool to correct the misalignments, called ProbAlign. It can be integrated into the pipelines of not only variant calling but also other genomic applications. We demonstrate the use of re-alignment in two diverse and important genomics fields: variant calling and viral quasispecies reconstruction. First, variant calling results in the Pacific Biosciences SMRT re-sequencing data of NA12878 show that false positives can be reduced by 43.5%, and true positives can be increased by 24.8% averagely, after re-alignment. Second, results in reconstructing a 5-virus-mix show that the viral population can be completely unraveled, and also the estimation of quasispecies frequencies has been improved, after re-alignment. ProbAlign is freely available in the PyroTools toolkit (https://github.com/homopolymer/PyroTools).
Title: ProbAlign: a re-alignment method for long sequencing reads
Description:
AbstractThe incorrect alignments are a severe problem in variant calling, and remain as a challenge computational issue in Bioinformatics field.
Although there have been some methods utilizing the re-alignment approach to tackle the misalignments, a standalone re-alignment tool for long sequencing reads is lacking.
Hence, we present a standalone tool to correct the misalignments, called ProbAlign.
It can be integrated into the pipelines of not only variant calling but also other genomic applications.
We demonstrate the use of re-alignment in two diverse and important genomics fields: variant calling and viral quasispecies reconstruction.
First, variant calling results in the Pacific Biosciences SMRT re-sequencing data of NA12878 show that false positives can be reduced by 43.
5%, and true positives can be increased by 24.
8% averagely, after re-alignment.
Second, results in reconstructing a 5-virus-mix show that the viral population can be completely unraveled, and also the estimation of quasispecies frequencies has been improved, after re-alignment.
ProbAlign is freely available in the PyroTools toolkit (https://github.
com/homopolymer/PyroTools).
Related Results
MARS-seq2.0: an experimental and analytical pipeline for indexed sorting combined with single-cell RNA sequencing v1
MARS-seq2.0: an experimental and analytical pipeline for indexed sorting combined with single-cell RNA sequencing v1
Human tissues comprise trillions of cells that populate a complex space of molecular phenotypes and functions and that vary in abundance by 4–9 orders of magnitude. Relying solely ...
GraphK-LR: Enhancing Long-read Metagenomic Binning with Read-overlap Graphs Across Microbial Kingdoms
GraphK-LR: Enhancing Long-read Metagenomic Binning with Read-overlap Graphs Across Microbial Kingdoms
Abstract
Background: Metagenomics, the study of genetic material from environmental samples, relies on binning - the process of grouping DNA sequences from the same organis...
DEEP-LONG: A Fast and Accurate Aligner for Long RNA-Seq
DEEP-LONG: A Fast and Accurate Aligner for Long RNA-Seq
Abstract
BackgroundIn recent years, because of the development of sequencing technology, long reads were widely used in many studies, include transcriptomics studies. Obvio...
Ontology Alignment Techniques
Ontology Alignment Techniques
Sometimes the use of a single ontology is not sufficient to cover different vocabularies for the same domain, and it becomes necessary to use several ontologies in order to encompa...
Effects of Waterlogging on Soybean Rhizosphere Microbial Community Profiled Using Illumina MiSeq, LoopSeq, and PacBio 16S rRNA Genes Sequences
Effects of Waterlogging on Soybean Rhizosphere Microbial Community Profiled Using Illumina MiSeq, LoopSeq, and PacBio 16S rRNA Genes Sequences
Abstract
Background: Waterlogging on the global environment has led to a significant decline in crop yields. However, the response of plant-associated microbes to waterlogg...
Abstract 1360: Understanding genetic variation in cancer using targeted nanopore long read sequencing
Abstract 1360: Understanding genetic variation in cancer using targeted nanopore long read sequencing
Abstract
Structural variations (SV), a hallmark of genomic instability in cancer can either activate oncogenes or inactivate tumor suppressor genes. SVs tend to be r...
A Polar Moving-Base Alignment Based on Backtracking Scheme
A Polar Moving-Base Alignment Based on Backtracking Scheme
In the polar region, the gravity vector and Earth's rotation vector tend to be in the same direction, leading to slower convergence speed and longer alignment time of the moving ba...
Fast Noisy Long Read Alignment with Multi-Level Parallelism
Fast Noisy Long Read Alignment with Multi-Level Parallelism
Abstract
Background:
The advent of Single Molecule Real-Time (SMRT) sequencing has overcome many limitations of second-generation sequencing, such as limited read lengths, ...

