Javascript must be enabled to continue!

Kun-peng: an ultra-memory-efficient, fast, and accurate pan-domain taxonomic classifier for all

AbstractComprehensive metagenomic sequence classification of diverse environmental samples faces significant computing memory challenges due to exponentially expanding genome databases. Here, we present Kun-peng, featuring a unique ordered 4GB block database design for ultra-efficient resource management, faster processing, and higher accuracy. When benchmarked on mock communities (Amos HiLo, Mixed, and NIST) against Kraken2, Centrifuge, and Sylph. Kun-peng matched Sylph, achieving the highest precision and lowest false-positive rates while demonstrating superior time and memory efficiency among all tested tools. Furthermore, Kun-peng’s efficient database architecture enables the practical utilization of large-scale reference databases that were previously computationally prohibitive. In comprehensive testing across 586 air, water, soil, and human metagenomic samples using an expansive pan-domain database (204,477 genomes, 4.3TB), Kun-peng classified 69.78-94.29% of reads, achieving 38-43% higher classification rates than Kraken2 with the standard database. Unexpectedly, Sylph failed to classify any reads in air samples and left > 99.85% of reads unclassified in water and soil samples. In terms of computational efficiency, Kun-peng processed each sample in 0.2∼11.2 minutes using only 4.0∼35.4GB peak memory. Remarkably, these processing times were comparable to Kraken2 using the standard database (81GB, 5% of the pan-domain database). Memory-wise, Kun-peng required only 35.4GB peak memory with the pan-domain database, representing a 473-fold reduction compared to Kraken2. When compared to Sylph, Kun-peng processes samples up to 46.3 times faster while using up to 20.6 times less memory. Overall, Kun-peng offers an ultra-memory-efficient, fast, and accurate solution for pan-domain metagenomic classifications.

Cold Spring Harbor Laboratory

Qiong Chen Boliang Zhang Chen Peng Jiajun Huang Xiaotao Shen Chao Jiang

2025

Title: Kun-peng: an ultra-memory-efficient, fast, and accurate pan-domain taxonomic classifier for all

Description:

AbstractComprehensive metagenomic sequence classification of diverse environmental samples faces significant computing memory challenges due to exponentially expanding genome databases.

Here, we present Kun-peng, featuring a unique ordered 4GB block database design for ultra-efficient resource management, faster processing, and higher accuracy.

When benchmarked on mock communities (Amos HiLo, Mixed, and NIST) against Kraken2, Centrifuge, and Sylph.

Kun-peng matched Sylph, achieving the highest precision and lowest false-positive rates while demonstrating superior time and memory efficiency among all tested tools.

Furthermore, Kun-peng’s efficient database architecture enables the practical utilization of large-scale reference databases that were previously computationally prohibitive.

In comprehensive testing across 586 air, water, soil, and human metagenomic samples using an expansive pan-domain database (204,477 genomes, 4.

3TB), Kun-peng classified 69.

78-94.

29% of reads, achieving 38-43% higher classification rates than Kraken2 with the standard database.

Unexpectedly, Sylph failed to classify any reads in air samples and left > 99.

85% of reads unclassified in water and soil samples.

In terms of computational efficiency, Kun-peng processed each sample in 0.

2∼11.

2 minutes using only 4.

0∼35.

4GB peak memory.

Remarkably, these processing times were comparable to Kraken2 using the standard database (81GB, 5% of the pan-domain database).

Memory-wise, Kun-peng required only 35.

4GB peak memory with the pan-domain database, representing a 473-fold reduction compared to Kraken2.

When compared to Sylph, Kun-peng processes samples up to 46.

3 times faster while using up to 20.

6 times less memory.

Overall, Kun-peng offers an ultra-memory-efficient, fast, and accurate solution for pan-domain metagenomic classifications.

Back

Abstract Ultra-low permeability reserves have accounted for a very large proportion of China's proven reserves and undeveloped reserves at present, so it is very ...

Sustainability and ultra-processed foods: role of youth

The objective of this research is to study and look at the ways how processed food affects human and environmental health and to find alternatives to processed food. Sustainabilit...

Sustainability and ultra-processed foods: role of youth

The objective of this research is to study and look at the ways how processed food affects human and environmental health and to find alternatives to processed food. Sustainabilit...

The Value of Lateral Flow Urine Lipoarabinomannan Assay and Empirical Treatment in the Xpert MTB/RIF Ultra Era: a Prospective Cohort Study

Abstract Introduction: The value of Lateral Flow urine Lipoarabinomannan (LF-LAM) assay and the role of empiric tuberculosis (TB) treatment in the era of the highly sensiti...

The value of lateral flow urine lipoarabinomannan assay and empirical treatment in Xpert MTB/RIF ultra negative patients with presumptive TB: a prospective cohort study

AbstractThe value of Lateral Flow urine Lipoarabinomannan (LF-LAM) assay and the role of empiric tuberculosis (TB) treatment in the era of the highly sensitive Xpert MTB/RIF Ultra ...

Perspective Chapter: Mix-Unmix Pan-Sharpener – Novel Pan-Sharpening Method Based on Mixing Constituent Multispectral Bands and Unmixing Panchromatic Band

A panchromatic band (Pan-band) spectrally covers a number of the other bands (multispectral-bands, MS). The Pan-band is of higher spatial resolution than the MS. The respective adv...

Research on Key Materials and Systems of Ultra High Temperature Cement Slurry for Ultra-Deep Wells Cementing

Abstract With the continuous deepening of oil and gas exploration and development, the number of ultra-deep and ultra-high temperature wells is gradually increasing ...

Strategi Pemasaran Jasa Dalam Meningkatkan Daya Minat Terhadap Kun Anta Presschool

This research aims to explore the service marketing strategies implemented by Kun Anta Preschool, Kasturi Branch, Palu City in increasing parents' interest in this school. Kun Anta...

Email:
Password:

Email:

Kun-peng: an ultra-memory-efficient, fast, and accurate pan-domain taxonomic classifier for all

Related Results