Search engine for discovering works of Art, research articles, and books related to Art and Culture
ShareThis
Javascript must be enabled to continue!

ProbAlign: a re-alignment method for long sequencing reads

View through CrossRef
AbstractThe incorrect alignments are a severe problem in variant calling, and remain as a challenge computational issue in Bioinformatics field. Although there have been some methods utilizing the re-alignment approach to tackle the misalignments, a standalone re-alignment tool for long sequencing reads is lacking. Hence, we present a standalone tool to correct the misalignments, called ProbAlign. It can be integrated into the pipelines of not only variant calling but also other genomic applications. We demonstrate the use of re-alignment in two diverse and important genomics fields: variant calling and viral quasispecies reconstruction. First, variant calling results in the Pacific Biosciences SMRT re-sequencing data of NA12878 show that false positives can be reduced by 43.5%, and true positives can be increased by 24.8% averagely, after re-alignment. Second, results in reconstructing a 5-virus-mix show that the viral population can be completely unraveled, and also the estimation of quasispecies frequencies has been improved, after re-alignment. ProbAlign is freely available in the PyroTools toolkit (https://github.com/homopolymer/PyroTools).
Title: ProbAlign: a re-alignment method for long sequencing reads
Description:
AbstractThe incorrect alignments are a severe problem in variant calling, and remain as a challenge computational issue in Bioinformatics field.
Although there have been some methods utilizing the re-alignment approach to tackle the misalignments, a standalone re-alignment tool for long sequencing reads is lacking.
Hence, we present a standalone tool to correct the misalignments, called ProbAlign.
It can be integrated into the pipelines of not only variant calling but also other genomic applications.
We demonstrate the use of re-alignment in two diverse and important genomics fields: variant calling and viral quasispecies reconstruction.
First, variant calling results in the Pacific Biosciences SMRT re-sequencing data of NA12878 show that false positives can be reduced by 43.
5%, and true positives can be increased by 24.
8% averagely, after re-alignment.
Second, results in reconstructing a 5-virus-mix show that the viral population can be completely unraveled, and also the estimation of quasispecies frequencies has been improved, after re-alignment.
ProbAlign is freely available in the PyroTools toolkit (https://github.
com/homopolymer/PyroTools).

Related Results

MARS-seq2.0: an experimental and analytical pipeline for indexed sorting combined with single-cell RNA sequencing v1
MARS-seq2.0: an experimental and analytical pipeline for indexed sorting combined with single-cell RNA sequencing v1
Human tissues comprise trillions of cells that populate a complex space of molecular phenotypes and functions and that vary in abundance by 4–9 orders of magnitude. Relying solely ...
GraphK-LR: Enhancing Long-read Metagenomic Binning with Read-overlap Graphs Across Microbial Kingdoms
GraphK-LR: Enhancing Long-read Metagenomic Binning with Read-overlap Graphs Across Microbial Kingdoms
Abstract Background: Metagenomics, the study of genetic material from environmental samples, relies on binning - the process of grouping DNA sequences from the same organis...
DEEP-LONG: A Fast and Accurate Aligner for Long RNA-Seq
DEEP-LONG: A Fast and Accurate Aligner for Long RNA-Seq
Abstract BackgroundIn recent years, because of the development of sequencing technology, long reads were widely used in many studies, include transcriptomics studies. Obvio...
Ontology Alignment Techniques
Ontology Alignment Techniques
Sometimes the use of a single ontology is not sufficient to cover different vocabularies for the same domain, and it becomes necessary to use several ontologies in order to encompa...
Abstract 1360: Understanding genetic variation in cancer using targeted nanopore long read sequencing
Abstract 1360: Understanding genetic variation in cancer using targeted nanopore long read sequencing
Abstract Structural variations (SV), a hallmark of genomic instability in cancer can either activate oncogenes or inactivate tumor suppressor genes. SVs tend to be r...
A Polar Moving-Base Alignment Based on Backtracking Scheme
A Polar Moving-Base Alignment Based on Backtracking Scheme
In the polar region, the gravity vector and Earth's rotation vector tend to be in the same direction, leading to slower convergence speed and longer alignment time of the moving ba...
Fast Noisy Long Read Alignment with Multi-Level Parallelism
Fast Noisy Long Read Alignment with Multi-Level Parallelism
Abstract Background: The advent of Single Molecule Real-Time (SMRT) sequencing has overcome many limitations of second-generation sequencing, such as limited read lengths, ...

Back to Top