Search engine for discovering works of Art, research articles, and books related to Art and Culture
ShareThis
Javascript must be enabled to continue!

Pheniqs: Fast and flexible quality-aware sequence demultiplexing

View through CrossRef
1AbstractMotivationOutput from high throughput sequencing instruments often exceeds what is necessary to assay a single sample. To better utilize this capacity, multiple samples are independently tagged with a unique “barcode” sequence and are then pooled, or “multiplexed”, and sequenced together. Classifying, or “demultiplexing”, the reads involves decoding the barcode sequence. Although instruments estimate the probability of incorrectly calling each nucleobase, available demultiplexers do not consult those estimates or report classification error probabilities.ResultsWe present Pheniqs, a fast and flexible sequence demultiplexer and quality analyzer. In addition to providing an efficient implementation of the widespread minimum distance decoder, Pheniqs introduces a novel Phred-adjusted maximum likelihood decoder that consults base calling quality scores and estimates the probability of a barcode decoding error. Setting an upper bound on the permissible error provides an intuitive way to control demultiplexing confidence and directly influence precision and recall. Pheniqs supports FASTQ and multiple Sequence Alignment/Map formats and uses auxiliary SAM tags to report both library classification and demultiplexing error probability. Evaluation on both real and semi-synthetic data indicates that Pheniqs is faster than existing demultiplexers, substantially when demultiplexing longer reads, and achieves greater accuracy by correctly reflecting quality measurements.Availability and ImplementationImplemented in multithreaded C++ and available under the terms of the AGPL-3.0 license agreement at http://github.com/biosails/pheniqs. Manual and examples are available at http://biosails.github.io/pheniqs.
Title: Pheniqs: Fast and flexible quality-aware sequence demultiplexing
Description:
1AbstractMotivationOutput from high throughput sequencing instruments often exceeds what is necessary to assay a single sample.
To better utilize this capacity, multiple samples are independently tagged with a unique “barcode” sequence and are then pooled, or “multiplexed”, and sequenced together.
Classifying, or “demultiplexing”, the reads involves decoding the barcode sequence.
Although instruments estimate the probability of incorrectly calling each nucleobase, available demultiplexers do not consult those estimates or report classification error probabilities.
ResultsWe present Pheniqs, a fast and flexible sequence demultiplexer and quality analyzer.
In addition to providing an efficient implementation of the widespread minimum distance decoder, Pheniqs introduces a novel Phred-adjusted maximum likelihood decoder that consults base calling quality scores and estimates the probability of a barcode decoding error.
Setting an upper bound on the permissible error provides an intuitive way to control demultiplexing confidence and directly influence precision and recall.
Pheniqs supports FASTQ and multiple Sequence Alignment/Map formats and uses auxiliary SAM tags to report both library classification and demultiplexing error probability.
Evaluation on both real and semi-synthetic data indicates that Pheniqs is faster than existing demultiplexers, substantially when demultiplexing longer reads, and achieves greater accuracy by correctly reflecting quality measurements.
Availability and ImplementationImplemented in multithreaded C++ and available under the terms of the AGPL-3.
0 license agreement at http://github.
com/biosails/pheniqs.
Manual and examples are available at http://biosails.
github.
io/pheniqs.

Related Results

Pheniqs 2.0: accurate, high performance Bayesian decoding and confidence estimation for combinatorial barcode indexing
Pheniqs 2.0: accurate, high performance Bayesian decoding and confidence estimation for combinatorial barcode indexing
AbstractBackgroundSystems biology increasingly relies on deep sequencing with combinatorial index tags to associate biological sequences with their sample, cell, or molecule of ori...
Some aspects of the approach to the formation of flexible organizational structure at Ukrainian enterprises
Some aspects of the approach to the formation of flexible organizational structure at Ukrainian enterprises
The article aims to improve the flexible organizational structure formation approach by formulating and explaining the stages of the process and related specifics in the mechanism ...
Pengaruh Flexible Working Arrangement terhadap Work-life Balance Karyawan Generasi Z
Pengaruh Flexible Working Arrangement terhadap Work-life Balance Karyawan Generasi Z
Abstract. Technological advancements and digitalization have driven the emergence of flexible working arrangements, a work system that is increasingly relevant for Generation Z, wh...
Research on Key Technologies of Mode Multiplexing / Demultiplexing
Research on Key Technologies of Mode Multiplexing / Demultiplexing
The rapid growth of Internet capacity has driven the continuous progress of optical fiber communication technology. The transmission capacity of the current optical fiber communica...
Failure Detection Monitoring System Qualification Through a Full-Scale Dynamic Test
Failure Detection Monitoring System Qualification Through a Full-Scale Dynamic Test
Abstract Flexible riser systems are widely adopted by the industry as solutions to interconnect subsea equipment and pipelines to floating production units. Their...
The Genetic Mechanism of the Sequence Stratigraphy of the Rift Lacustrine Basin in Jiyang Depression, East China
The Genetic Mechanism of the Sequence Stratigraphy of the Rift Lacustrine Basin in Jiyang Depression, East China
Abstract Through the studies of sequence stratigraphy of early Tertiary in the east part of Jiyang depression, the characteristics of sequence evolution in contin...
SeqDown: An Efficient Sequence Retrieval Software and Comparative Sequence Retrieval Analysis
SeqDown: An Efficient Sequence Retrieval Software and Comparative Sequence Retrieval Analysis
For any sequence analysis procedure, a single or multiple sequence must be retrieved, stored, organized. One of the most common public databases used for biological sequence retrie...

Back to Top