Javascript must be enabled to continue!
ProbAlign: a re-alignment method for long sequencing reads
View through CrossRef
AbstractThe incorrect alignments are a severe problem in variant calling, and remain as a challenge computational issue in Bioinformatics field. Although there have been some methods utilizing the re-alignment approach to tackle the misalignments, a standalone re-alignment tool for long sequencing reads is lacking. Hence, we present a standalone tool to correct the misalignments, called ProbAlign. It can be integrated into the pipelines of not only variant calling but also other genomic applications. We demonstrate the use of re-alignment in two diverse and important genomics fields: variant calling and viral quasispecies reconstruction. First, variant calling results in the Pacific Biosciences SMRT re-sequencing data of NA12878 show that false positives can be reduced by 43.5%, and true positives can be increased by 24.8% averagely, after re-alignment. Second, results in reconstructing a 5-virus-mix show that the viral population can be completely unraveled, and also the estimation of quasispecies frequencies has been improved, after re-alignment. ProbAlign is freely available in the PyroTools toolkit (https://github.com/homopolymer/PyroTools).
Title: ProbAlign: a re-alignment method for long sequencing reads
Description:
AbstractThe incorrect alignments are a severe problem in variant calling, and remain as a challenge computational issue in Bioinformatics field.
Although there have been some methods utilizing the re-alignment approach to tackle the misalignments, a standalone re-alignment tool for long sequencing reads is lacking.
Hence, we present a standalone tool to correct the misalignments, called ProbAlign.
It can be integrated into the pipelines of not only variant calling but also other genomic applications.
We demonstrate the use of re-alignment in two diverse and important genomics fields: variant calling and viral quasispecies reconstruction.
First, variant calling results in the Pacific Biosciences SMRT re-sequencing data of NA12878 show that false positives can be reduced by 43.
5%, and true positives can be increased by 24.
8% averagely, after re-alignment.
Second, results in reconstructing a 5-virus-mix show that the viral population can be completely unraveled, and also the estimation of quasispecies frequencies has been improved, after re-alignment.
ProbAlign is freely available in the PyroTools toolkit (https://github.
com/homopolymer/PyroTools).
Related Results
MARS-seq2.0: an experimental and analytical pipeline for indexed sorting combined with single-cell RNA sequencing v1
MARS-seq2.0: an experimental and analytical pipeline for indexed sorting combined with single-cell RNA sequencing v1
Human tissues comprise trillions of cells that populate a complex space of molecular phenotypes and functions and that vary in abundance by 4–9 orders of magnitude. Relying solely ...
Next Generation Sequencing Technologies and Their Applications
Next Generation Sequencing Technologies and Their Applications
Abstract
The advances in next generation sequencing (NGS) technologies have tremendous impacts on the studies of structural and f...
GraphK-LR: Enhancing Long-read Metagenomic Binning with Read-overlap Graphs Across Microbial Kingdoms
GraphK-LR: Enhancing Long-read Metagenomic Binning with Read-overlap Graphs Across Microbial Kingdoms
Abstract
Background: Metagenomics, the study of genetic material from environmental samples, relies on binning - the process of grouping DNA sequences from the same organis...
Pipeline for species-resolved full-length16S rRNA amplicon nanopore sequencing analysis of low-complexity bacterial microbiota
Pipeline for species-resolved full-length16S rRNA amplicon nanopore sequencing analysis of low-complexity bacterial microbiota
Abstract
16S rRNA amplicon sequencing is a fundamental tool for characterizing prokaryotic microbial communities. While short-read 16S rRNA sequencing is a proven s...
DEEP-LONG: A Fast and Accurate Aligner for Long RNA-Seq
DEEP-LONG: A Fast and Accurate Aligner for Long RNA-Seq
Abstract
BackgroundIn recent years, because of the development of sequencing technology, long reads were widely used in many studies, include transcriptomics studies. Obvio...
Ontology Alignment Techniques
Ontology Alignment Techniques
Sometimes the use of a single ontology is not sufficient to cover different vocabularies for the same domain, and it becomes necessary to use several ontologies in order to encompa...
Effects of Waterlogging on Soybean Rhizosphere Microbial Community Profiled Using Illumina MiSeq, LoopSeq, and PacBio 16S rRNA Genes Sequences
Effects of Waterlogging on Soybean Rhizosphere Microbial Community Profiled Using Illumina MiSeq, LoopSeq, and PacBio 16S rRNA Genes Sequences
Abstract
Background: Waterlogging on the global environment has led to a significant decline in crop yields. However, the response of plant-associated microbes to waterlogg...
Abstract 1360: Understanding genetic variation in cancer using targeted nanopore long read sequencing
Abstract 1360: Understanding genetic variation in cancer using targeted nanopore long read sequencing
Abstract
Structural variations (SV), a hallmark of genomic instability in cancer can either activate oncogenes or inactivate tumor suppressor genes. SVs tend to be r...

