Search engine for discovering works of Art, research articles, and books related to Art and Culture
ShareThis
Javascript must be enabled to continue!

Prediction of sporulating Firmicutes from uncultured gut microbiota using SpoMAG, an ensemble learning tool

View through CrossRef
Sporulation represents a key adaptive strategy among Firmicutes, facilitating bacterial persistence under environmental stress while mediating host colonization, transmission dynamics, and microbiome stability. Despite the recognized ecological and biomedical significance of spore-forming Bacilli and Clostridia, most taxa remain uncultivated, limiting phenotypic characterization of their sporulation capacity. To bridge this knowledge gap, we developed SpoMAG, an ensemble machine learning framework that predicts sporulation potential of metagenome-assembled genomes (MAGs) through supervised classification models trained on the presence/absence of 160 sporulation-associated genes. This R-based tool integrates Random Forest and support vector machine algorithms, achieving probabilistic predictions with high performance (AUC = 92.2%, F1-score = 88.2%). Application to fecal metagenomes from humans, cattle, poultry, and swine identified 63 putatively spore-forming MAGs exhibiting distinct host- and order-specific patterns. Bacilli MAGs from Bacillales and Paenibacillales orders showed high sporulation probabilities and gene richness, while Clostridia MAGs exhibited more heterogeneous profiles. Predictions included undercharacterized families in the spore-forming perspective, such as Acetivibrionaceae, Christensenellaceae, and UBA1381, expanding the known phylogenetic breadth of sporulation capacity. Nine genes were consistently present across all predicted spore-formers (namely pth, yaaT, spoIIAB, spoIIIAE, spoIIIAD, ctpB, ftsW, spoVD, and lgt), suggesting conserved genetic elements across uncultivated Firmicutes for future research. Average nucleotide identity (ANI) analysis revealed seven cases of species-level sharing (ANI value > 95%) among hosts, including a putative novel Acetivibrionaceae species, suggesting possible cross-host transmission facilitated by sporulation. In all 63 genomes predicted to sporulate, we identified nine genes across sporulation steps. In addition, SHapley Additive exPlanations (SHAP) analysis indicated 16 consensus genes consistently contributing to predictions (namely lytH, cotP, spoIIIAG, spoIIR, spoVAD, gerC, yabP, yqfD, gerD, spoVAA, gpr, ytaF, gdh, ypeB, spoVID, and ymfJ), bringing biologically meaningful features across sporulation stages. By combining gene annotation with interpretable machine learning, SpoMAG provides a reproducible and accessible framework to infer sporulation potential in uncultured microbial taxa. This tool enhances targeted investigations into microbial survival strategies and supports research in microbiome ecology, probiotic discovery, food safety, and public health surveillance. SpoMAG is freely available as an R package and expands current capabilities for functional inference in metagenomic datasets.
Title: Prediction of sporulating Firmicutes from uncultured gut microbiota using SpoMAG, an ensemble learning tool
Description:
Sporulation represents a key adaptive strategy among Firmicutes, facilitating bacterial persistence under environmental stress while mediating host colonization, transmission dynamics, and microbiome stability.
Despite the recognized ecological and biomedical significance of spore-forming Bacilli and Clostridia, most taxa remain uncultivated, limiting phenotypic characterization of their sporulation capacity.
To bridge this knowledge gap, we developed SpoMAG, an ensemble machine learning framework that predicts sporulation potential of metagenome-assembled genomes (MAGs) through supervised classification models trained on the presence/absence of 160 sporulation-associated genes.
This R-based tool integrates Random Forest and support vector machine algorithms, achieving probabilistic predictions with high performance (AUC = 92.
2%, F1-score = 88.
2%).
Application to fecal metagenomes from humans, cattle, poultry, and swine identified 63 putatively spore-forming MAGs exhibiting distinct host- and order-specific patterns.
Bacilli MAGs from Bacillales and Paenibacillales orders showed high sporulation probabilities and gene richness, while Clostridia MAGs exhibited more heterogeneous profiles.
Predictions included undercharacterized families in the spore-forming perspective, such as Acetivibrionaceae, Christensenellaceae, and UBA1381, expanding the known phylogenetic breadth of sporulation capacity.
Nine genes were consistently present across all predicted spore-formers (namely pth, yaaT, spoIIAB, spoIIIAE, spoIIIAD, ctpB, ftsW, spoVD, and lgt), suggesting conserved genetic elements across uncultivated Firmicutes for future research.
Average nucleotide identity (ANI) analysis revealed seven cases of species-level sharing (ANI value > 95%) among hosts, including a putative novel Acetivibrionaceae species, suggesting possible cross-host transmission facilitated by sporulation.
In all 63 genomes predicted to sporulate, we identified nine genes across sporulation steps.
In addition, SHapley Additive exPlanations (SHAP) analysis indicated 16 consensus genes consistently contributing to predictions (namely lytH, cotP, spoIIIAG, spoIIR, spoVAD, gerC, yabP, yqfD, gerD, spoVAA, gpr, ytaF, gdh, ypeB, spoVID, and ymfJ), bringing biologically meaningful features across sporulation stages.
By combining gene annotation with interpretable machine learning, SpoMAG provides a reproducible and accessible framework to infer sporulation potential in uncultured microbial taxa.
This tool enhances targeted investigations into microbial survival strategies and supports research in microbiome ecology, probiotic discovery, food safety, and public health surveillance.
SpoMAG is freely available as an R package and expands current capabilities for functional inference in metagenomic datasets.

Related Results

IMPACT OF GUT MICROBIOTA ON POSTOPERATIVE RECOVERY AND WOUND HEALING
IMPACT OF GUT MICROBIOTA ON POSTOPERATIVE RECOVERY AND WOUND HEALING
The gut microbiota, made up of trillions of microorganisms that inhabit the gastrointestinal tract, plays a fundamental role in human health, influencing immunological and metaboli...
The Microbiota and Microbiome in COVID-19 in Adults and Children and Potential Therapeutic Interventions: A Review
The Microbiota and Microbiome in COVID-19 in Adults and Children and Potential Therapeutic Interventions: A Review
The work presented is a comprehensive review of the role of the human microbiota in the context of the COVID-19 pandemic. A diverse microbial community heavily colonizes the human ...
The gut bacterial microbiome of Nile tilapia (Oreochromis niloticus) from lakes across an altitudinal gradient
The gut bacterial microbiome of Nile tilapia (Oreochromis niloticus) from lakes across an altitudinal gradient
Abstract BackgroundMicroorganisms inhabiting the gut play a significant role in supporting fundamental physiological processes of the host, which contributes to their survi...
Comparative study of the gut microbiota in three captive Rhinopithecus species
Comparative study of the gut microbiota in three captive Rhinopithecus species
Abstract Background Snub-nosed monkeys are highly endangered primates and their population continues to decline with the habitat fragmentation. Arti...
Causal relationship between gut microbiota and malignant lymphoma:a two-way two-sample Mendelian randomization study
Causal relationship between gut microbiota and malignant lymphoma:a two-way two-sample Mendelian randomization study
Abstract Background The significance of gut microbiota in human health is gaining attention, leading to a rise in observational and clinical studies focused on understandi...
Diversity analysis of oral and gut microbiota in osteoporotic rats
Diversity analysis of oral and gut microbiota in osteoporotic rats
The oral and gut microbiota had been shown to control bone metabolism and have a strong correlation with osteoporosis. However, to reveal the oral and gut bacteria characteristics ...
Gut microbial profile of treatment-naive patients with primary biliary cholangitis
Gut microbial profile of treatment-naive patients with primary biliary cholangitis
Abstract Background and aims: The pathogenesis of primary biliary cholangitis (PBC) is associated with alterations of gut microbiota. We compared the gut microbiota of PBC...

Back to Top