Javascript must be enabled to continue!
Fast Noisy Long Read Alignment with Multi-Level Parallelism
View through CrossRef
Abstract
Background:
The advent of Single Molecule Real-Time (SMRT) sequencing has overcome many limitations of second-generation sequencing, such as limited read lengths, PCR amplification biases. However, longer reads increase data volume exponentially and high error rates make many existing alignment tools inapplicable. Additionally, a single CPU's performance bottleneck restricts the effectiveness of alignment algorithms for SMRT sequencing.
Methods:
To address these challenges, we introduce ParaHAT, a parallel alignment algorithm for noisy long reads. ParaHAT utilizes vector-level, thread-level, process-level, and heterogeneous parallelism. We redesign the dynamic programming matrices layouts to eliminate data dependency in the base-level alignment, enabling effective vectorization. We further enhance computational speed through heterogeneous parallel technology and implement the algorithm for multi-node computing using MPI, overcoming the computational limits of a single node.
Conclusion:
Performance evaluations show that ParaHAT got a 5.39x speedup in base-level alignment, with a parallel acceleration ratio and weak scalability metric of 94.61 and 98.98% on 128 nodes, respectively.
Springer Science and Business Media LLC
Title: Fast Noisy Long Read Alignment with Multi-Level Parallelism
Description:
Abstract
Background:
The advent of Single Molecule Real-Time (SMRT) sequencing has overcome many limitations of second-generation sequencing, such as limited read lengths, PCR amplification biases.
However, longer reads increase data volume exponentially and high error rates make many existing alignment tools inapplicable.
Additionally, a single CPU's performance bottleneck restricts the effectiveness of alignment algorithms for SMRT sequencing.
Methods:
To address these challenges, we introduce ParaHAT, a parallel alignment algorithm for noisy long reads.
ParaHAT utilizes vector-level, thread-level, process-level, and heterogeneous parallelism.
We redesign the dynamic programming matrices layouts to eliminate data dependency in the base-level alignment, enabling effective vectorization.
We further enhance computational speed through heterogeneous parallel technology and implement the algorithm for multi-node computing using MPI, overcoming the computational limits of a single node.
Conclusion:
Performance evaluations show that ParaHAT got a 5.
39x speedup in base-level alignment, with a parallel acceleration ratio and weak scalability metric of 94.
61 and 98.
98% on 128 nodes, respectively.
Related Results
ПОЛІТИЧНИЙ ПАРАЛЕЛІЗМ: УКРАЇНСЬКИЙ КОНТЕКСТ
ПОЛІТИЧНИЙ ПАРАЛЕЛІЗМ: УКРАЇНСЬКИЙ КОНТЕКСТ
<p><em>The study uses an empirical method which involves free finding of right material to study the origin and genesis of political parallelism as an integral characte...
Ontology Alignment Techniques
Ontology Alignment Techniques
Sometimes the use of a single ontology is not sufficient to cover different vocabularies for the same domain, and it becomes necessary to use several ontologies in order to encompa...
The Concept of Parallelism in the Lyrical Poems of Besarani (1641- 1702)
The Concept of Parallelism in the Lyrical Poems of Besarani (1641- 1702)
The following study is about “The Concept of Parallelism in the Lyrical Poems of Besarani (1641- 1702)”, who wrote his lyrics in the Kurdish dialect Gorani (Hawrami). Parallelism (...
On the technique of artistic parallelism in Dargin quatrains
On the technique of artistic parallelism in Dargin quatrains
The subject of this research is the quatrains – a variety of lyrical songs, one of widespread and popular poetic genres in the folklore of the peoples of Dagestan. The ob...
Relation Graph Alignment for Learning with Noisy Labels
Relation Graph Alignment for Learning with Noisy Labels
Abstract
Noisy label learning is crucial for improving model performance on datasets with unreliable labels, which enables models to better tolerate inaccurate data. Existi...
Love the one you’re with: replicate viral adaptations converge on the same phenotypic change
Love the one you’re with: replicate viral adaptations converge on the same phenotypic change
Parallelism is important because it reveals how inherently stochastic adaptation is. Even as we come to better understand evolutionary forces, stochasticity limits how well we can ...
Reference-Based Speech Enhancement via Feature Alignment and Fusion Network
Reference-Based Speech Enhancement via Feature Alignment and Fusion Network
Speech enhancement aims at recovering a clean speech from a noisy input, which can be classified into single speech enhancement and personalized speech enhancement. Personalized sp...
Discussion of Mask Alignment Accuracy for EUV Lithography
Discussion of Mask Alignment Accuracy for EUV Lithography
The shorter lithography printing light source wavelength is, the more accurate alignment is required for resist printing of high packing density integrated circuits (ICs). As to lo...

