Javascript must be enabled to continue!
IonCRAM: a reference-based compression tool for ion torrent sequence files
View through CrossRef
Abstract
Background
Ion Torrent is one of the major next generation sequencing (NGS) technologies and it is frequently used in medical research and diagnosis. The built-in software for the Ion Torrent sequencing machines delivers the sequencing results in the BAM format. In addition to the usual SAM/BAM fields, the Ion Torrent BAM file includes technology-specific
flow signal
data. The flow signals occupy a big portion of the BAM file (about 75% for the human genome). Compressing SAM/BAM into CRAM format significantly reduces the space needed to store the NGS results. However, the tools for generating the CRAM formats are not designed to handle the flow signals. This missing feature has motivated us to develop a new program to improve the compression of the Ion Torrent files for long term archiving.
Results
In this paper, we present IonCRAM, the first reference-based compression tool to compress Ion Torrent BAM files for long term archiving. For the BAM files, IonCRAM could achieve a space saving of about 43%. This space saving is superior to what achieved with the CRAM format by about 8–9%.
Conclusions
Reducing the space consumption of NGS data reduces the cost of storage and data transfer. Therefore, developing efficient compression software for clinical NGS data goes beyond the computational interest; as it ultimately contributes to the overall cost reduction of the clinical test. The space saving achieved by our tool is a practical step in this direction. The tool is open source and available at Code Ocean, github, and
http://ioncram.saudigenomeproject.com
.
Title: IonCRAM: a reference-based compression tool for ion torrent sequence files
Description:
Abstract
Background
Ion Torrent is one of the major next generation sequencing (NGS) technologies and it is frequently used in medical research and diagnosis.
The built-in software for the Ion Torrent sequencing machines delivers the sequencing results in the BAM format.
In addition to the usual SAM/BAM fields, the Ion Torrent BAM file includes technology-specific
flow signal
data.
The flow signals occupy a big portion of the BAM file (about 75% for the human genome).
Compressing SAM/BAM into CRAM format significantly reduces the space needed to store the NGS results.
However, the tools for generating the CRAM formats are not designed to handle the flow signals.
This missing feature has motivated us to develop a new program to improve the compression of the Ion Torrent files for long term archiving.
Results
In this paper, we present IonCRAM, the first reference-based compression tool to compress Ion Torrent BAM files for long term archiving.
For the BAM files, IonCRAM could achieve a space saving of about 43%.
This space saving is superior to what achieved with the CRAM format by about 8–9%.
Conclusions
Reducing the space consumption of NGS data reduces the cost of storage and data transfer.
Therefore, developing efficient compression software for clinical NGS data goes beyond the computational interest; as it ultimately contributes to the overall cost reduction of the clinical test.
The space saving achieved by our tool is a practical step in this direction.
The tool is open source and available at Code Ocean, github, and
http://ioncram.
saudigenomeproject.
com
.
Related Results
Differential Diagnosis of Neurogenic Thoracic Outlet Syndrome: A Review
Differential Diagnosis of Neurogenic Thoracic Outlet Syndrome: A Review
Abstract
Thoracic outlet syndrome (TOS) is a complex and often overlooked condition caused by the compression of neurovascular structures as they pass through the thoracic outlet. ...
Provocative Tests in Diagnosis of Thoracic Outlet Syndrome: A Narrative Review
Provocative Tests in Diagnosis of Thoracic Outlet Syndrome: A Narrative Review
Abstract
Thoracic outlet syndrome (TOS) is a group of conditions caused by the compression of the neurovascular bundle within the thoracic outlet. It is classified into three main ...
Optimising tool wear and workpiece condition monitoring via cyber-physical systems for smart manufacturing
Optimising tool wear and workpiece condition monitoring via cyber-physical systems for smart manufacturing
Smart manufacturing has been developed since the introduction of Industry 4.0. It consists of resource sharing and networking, predictive engineering, and material and data analyti...
Deep learning-based Point Cloud Compression
Deep learning-based Point Cloud Compression
Compression de nuages de points par apprentissage profond
Les nuages de points deviennent essentiels dans de nombreuses applications et les progrès des technologies...
X-Ray CT Measurement of All Solid State Lithium-Ion Battery Under High Pressure Condition
X-Ray CT Measurement of All Solid State Lithium-Ion Battery Under High Pressure Condition
All Solid-state lithium ion Battery (ASB) with sulfide Solid Electrolyte (SE) would achieve high energy density and high-speed charging with high ion conductivity and wide potentia...
Pediatric Rotary Files: Evolution to Revolution
Pediatric Rotary Files: Evolution to Revolution
The main goal of pulp therapy in primary dentition is to preserve the primary tooth thus protecting future normal occlusion. Routinely, pulp debridement and canal shaping ar...
Linear ion traps in mass spectrometry
Linear ion traps in mass spectrometry
Abstract
I.
Introduction
000
II.
Linear Multipoles
000
A. Multipole Fields
000
1. Multipole Potentials
000
2. Ion Motion in 2D Multipole Fields
000...
BT02 Artificial intelligence-ready skin cancer alchemy: transforming routine teledermatology data into metadata-embedded DICOM files
BT02 Artificial intelligence-ready skin cancer alchemy: transforming routine teledermatology data into metadata-embedded DICOM files
Abstract
Most skin artificial intelligence (AI) classifiers are trained only on images with diagnostic labels. However, the addition of clinical information can impr...

