Search engine for discovering works of Art, research articles, and books related to Art and Culture
ShareThis
Javascript must be enabled to continue!

Microbial networks in SPRING - Semi-parametric rank-based correlation and partial correlation estimation for quantitative microbiome data

View through CrossRef
Abstract High-throughput microbial sequencing techniques, such as targeted amplicon-based and metagenomic profiling, provide low-cost genomic survey data of microbial communities in their natural environment, ranging from marine ecosystems to host-associated habitats. While standard microbiome profiling data can provide sparse relative abundances of operational taxonomic units or genes, recent advances in experimental protocols give a more quantitative picture of microbial communities by pairing sequencing-based techniques with orthogonal measurements of microbial cell counts from the same sample. These tandem measurements provide absolute microbial count data albeit with a large excess of zeros due to limited sequencing depth. In this contribution we consider the fundamental statistical problem of estimating correlations and partial correlations from such quantitative microbiome data. To this end, we propose a semi-parametric rank-based approach to correlation estimation that can naturally deal with the excess zeros in the data. Combining this estimator with sparse graphical modeling techniques leads to the S emi- P arametric R ank-based approach for IN ference in G raphical model (SPRING). SPRING enables inference of statistical microbial association networks from quantitative microbiome data which can serve as high-level statistical summary of the underlying microbial ecosystem and can provide testable hypotheses for functional species-species interactions. Due to the absence of verified microbial associations we also introduce a novel quantitative microbiome data generation mechanism which mimics empirical marginal distributions of measured count data while simultaneously allowing user-specified dependencies among the variables. SPRING shows superior network recovery performance on a wide range of realistic benchmark problems with varying network topologies and is robust to misspecifications of the total cell count estimate. To highlight SPRING’s broad applicability we infer taxon-taxon associations from the American Gut Project data and genus-genus associations from a recent quantitative gut microbiome dataset. We believe that, as quantitative microbiome profiling data will become increasingly available, the semi-parametric estimators for correlation and partial correlation estimation introduced here provide an important tool for reliable statistical analysis of quantitative microbiome data.
Title: Microbial networks in SPRING - Semi-parametric rank-based correlation and partial correlation estimation for quantitative microbiome data
Description:
Abstract High-throughput microbial sequencing techniques, such as targeted amplicon-based and metagenomic profiling, provide low-cost genomic survey data of microbial communities in their natural environment, ranging from marine ecosystems to host-associated habitats.
While standard microbiome profiling data can provide sparse relative abundances of operational taxonomic units or genes, recent advances in experimental protocols give a more quantitative picture of microbial communities by pairing sequencing-based techniques with orthogonal measurements of microbial cell counts from the same sample.
These tandem measurements provide absolute microbial count data albeit with a large excess of zeros due to limited sequencing depth.
In this contribution we consider the fundamental statistical problem of estimating correlations and partial correlations from such quantitative microbiome data.
To this end, we propose a semi-parametric rank-based approach to correlation estimation that can naturally deal with the excess zeros in the data.
Combining this estimator with sparse graphical modeling techniques leads to the S emi- P arametric R ank-based approach for IN ference in G raphical model (SPRING).
SPRING enables inference of statistical microbial association networks from quantitative microbiome data which can serve as high-level statistical summary of the underlying microbial ecosystem and can provide testable hypotheses for functional species-species interactions.
Due to the absence of verified microbial associations we also introduce a novel quantitative microbiome data generation mechanism which mimics empirical marginal distributions of measured count data while simultaneously allowing user-specified dependencies among the variables.
SPRING shows superior network recovery performance on a wide range of realistic benchmark problems with varying network topologies and is robust to misspecifications of the total cell count estimate.
To highlight SPRING’s broad applicability we infer taxon-taxon associations from the American Gut Project data and genus-genus associations from a recent quantitative gut microbiome dataset.
We believe that, as quantitative microbiome profiling data will become increasingly available, the semi-parametric estimators for correlation and partial correlation estimation introduced here provide an important tool for reliable statistical analysis of quantitative microbiome data.

Related Results

Exploring Large Language Models Integration in the Histopathologic Diagnosis of Skin Diseases: A Comparative Study
Exploring Large Language Models Integration in the Histopathologic Diagnosis of Skin Diseases: A Comparative Study
Abstract Introduction The exact manner in which large language models (LLMs) will be integrated into pathology is not yet fully comprehended. This study examines the accuracy, bene...
Quantifying the impact of Human Leukocyte Antigen on the human gut microbiome
Quantifying the impact of Human Leukocyte Antigen on the human gut microbiome
AbstractObjectiveThe gut microbiome is affected by a number of factors, including the innate and adaptive immune system. The major histocompatibility complex (MHC), or the human le...
Impact of urbanization on gut microbiome mosaics across geographic and dietary contexts
Impact of urbanization on gut microbiome mosaics across geographic and dietary contexts
ABSTRACT This study provides a comprehensive assessment of how urban-rural divides influence gut microbial diversity and composit...
Learning a deep language model for microbiomes: the power of large scale unlabeled microbiome data
Learning a deep language model for microbiomes: the power of large scale unlabeled microbiome data
Abstract We use open source human gut microbiome data to learn a microbial “language” model by adapting techniques from Natural Language Processing (NLP). Our micro...
The Future of Microbiome Medicine – An Editor’s Perspective
The Future of Microbiome Medicine – An Editor’s Perspective
The microbiome field continues to grow at an exponential rate with sophisticated approaches that are pushing the frontiers of science and translating fast into clinical practice. T...
Interpolation of Microbiome Composition in Longitudinal Datasets
Interpolation of Microbiome Composition in Longitudinal Datasets
Abstract The human gut microbiome significantly impacts health, prompting a rise in longitudinal studies that capture microbiome samples at multiple time points. Su...
The Regional Diversity of Gut Microbiome Along the GI Tract
The Regional Diversity of Gut Microbiome Along the GI Tract
Abstract Background: The proliferation and survival of microbial organisms including intestinal microbes are determined by their surrounding environments. Contrary to popul...

Back to Top