Javascript must be enabled to continue!
Execution Performance Analysis of the ABySS Genome Sequence Assembler using Scalasca on the K Computer
View through CrossRef
Performance analysis of the ABySS genome sequence assembler (ABYSS-P) executing on the K computer with up to 8192 compute nodes is described which identified issues that limited scalability to less than 1024 compute nodes and required prohibitive message buffer memory with 16384 or more compute nodes. The open-source Scalasca toolset was employed to analyse executions, revealing the impact of massive amounts of MPI point-to-point communication used particularly for master/worker process coordination, and inefficient parallel file operations that manifest as waiting time at later MPI collective synchronisations and communications. Initial remediation via use of collective communication operations and alternate strategies for parallel file handling show large performance and scalability improvements, with partial executions validated on the full 82,944 compute nodes of the K computer.
Title: Execution Performance Analysis of the ABySS Genome Sequence Assembler using Scalasca on the K Computer
Description:
Performance analysis of the ABySS genome sequence assembler (ABYSS-P) executing on the K computer with up to 8192 compute nodes is described which identified issues that limited scalability to less than 1024 compute nodes and required prohibitive message buffer memory with 16384 or more compute nodes.
The open-source Scalasca toolset was employed to analyse executions, revealing the impact of massive amounts of MPI point-to-point communication used particularly for master/worker process coordination, and inefficient parallel file operations that manifest as waiting time at later MPI collective synchronisations and communications.
Initial remediation via use of collective communication operations and alternate strategies for parallel file handling show large performance and scalability improvements, with partial executions validated on the full 82,944 compute nodes of the K computer.
Related Results
LMAS: evaluating metagenomic short de novo assembly methods through defined communities
LMAS: evaluating metagenomic short de novo assembly methods through defined communities
Abstract
Background
The de novo assembly of raw sequence data is key in metagenomic analysis. It allows recovering draft genomes...
A place for assembler in structured programming
A place for assembler in structured programming
Today, with the thrust being made with structured programming, questions arise concerning the relevance of assembler language code. The purpose of this paper is to defend assembler...
Depth-aware salient object segmentation
Depth-aware salient object segmentation
Object segmentation is an important task which is widely employed in many computer vision applications such as object detection, tracking, recognition, and ret...
PC/370 virtual machine
PC/370 virtual machine
PC/370 Virtual Machine is a software package containing an IBM 370 cross assembler, linkage editor, and 370 emulator for use by students and programmers on any MS-DOS compatible PC...
Market orientations, product innovation and organizational performance: A case study on selected beer factories found in Ethiopia
Market orientations, product innovation and organizational performance: A case study on selected beer factories found in Ethiopia
Abstract
Overview was led to explore the relationship between advertise direction, creation process, item execution, authoritative execution and budgetary execution. The mo...
Market orientations, product innovation and organizational performance: A case study on selected beer factories found in Ethiopia
Market orientations, product innovation and organizational performance: A case study on selected beer factories found in Ethiopia
Abstract
Overview was led to explore the relationship between advertise direction, creation process, item execution, authoritative execution and budgetary execution. The mo...
Bacterial genome annotation script using BLASTN v2
Bacterial genome annotation script using BLASTN v2
This protocol uses the command line tools provided by the Python package TnAtlas to identify and annotate transposon integration events in genomes. Given a set of sequencing reads...
Quantification of Regression Test Suite Execution Time in Parallel Execution Setup with Weighted Test Suite Split Algorithm
Quantification of Regression Test Suite Execution Time in Parallel Execution Setup with Weighted Test Suite Split Algorithm
Regression test suite execution time study focus is essentially on two aspects. They are execution time reduction and making effective use of available hardware resources and manpo...

