Javascript must be enabled to continue!
Global Metagenomics Reveals Hidden Protist Diversity
View through CrossRef
Abstract
Protists, defined as eukaryotes distinct from animals, plants, and fungi, are a polyphyletic group that dominates the eukaryotic tree of life, exhibiting significant phylogenetic diversity and fulfilling critical ecological roles. Historically, research has prioritized protists associated with animals and plants, particularly those of medical significance, thereby overlooking the majority of protist diversity. Conventional molecular methods, such as 18S rRNA gene amplicon sequencing, frequently encounter limitations, including primer binding bias and PCR bias caused by gene length variations, resulting in a biased representation of protistan diversity. Further, most protist lineages are notoriously difficult to cultivate. Here, we analyzed over 30,000 metagenome assemblies and protist single-cell genomes, together with 21 long-amplicon 18S datasets, spanning marine, freshwater, and soil ecosystems. We recovered 157,956 18S rRNA gene sequences (≥800 bp), which clustered into 103,338 operational taxonomic units (OTUs) at 97% sequence identity and 24,438 OTUs at 85% identity. Notably, 69% of 13,238 non-singleton clusters at 85% identity consisted exclusively of environmental sequences, uncovering a wealth of novel, uncultivated, and unclassified protist diversity. A comprehensive taxonomic framework of eukaryotes based on concatenated 18S and 28S rRNA genes that incorporated most novel lineages revealed substantial underrepresentation of Amoebozoa, Discoba, and Rhizaria in reference databases, with many lacking isolate or genome sequence representation. Further, we identified 13 lineages that likely represent deeply branching diversity, including candidates at approximately class- to phylum-level depth, that lack representation in public databases. The corresponding 85% OTUs were primarily affiliated with Excavata, with some branching deeply in the eukaryotic tree. Comprehensive analysis of the global distribution of eukaryotes revealed uneven microbial eukaryotic diversity across supergroups and ecosystems, with notable novelty particularly in soil and marine environments. We then examined co-occurrence between protists and prokaryotes, predicting putative symbiotic or predator-prey relationships, particularly among understudied protist groups with bacteria such as Verrucomicrobia and Rickettsiales. Our results extend current reference coverage and provide a global, contig-based framework for protistan diversity and distribution, highlighting major gaps in curated databases and metabarcoding coverage and guiding targeted studies of these organisms’ ecological roles.
Title: Global Metagenomics Reveals Hidden Protist Diversity
Description:
Abstract
Protists, defined as eukaryotes distinct from animals, plants, and fungi, are a polyphyletic group that dominates the eukaryotic tree of life, exhibiting significant phylogenetic diversity and fulfilling critical ecological roles.
Historically, research has prioritized protists associated with animals and plants, particularly those of medical significance, thereby overlooking the majority of protist diversity.
Conventional molecular methods, such as 18S rRNA gene amplicon sequencing, frequently encounter limitations, including primer binding bias and PCR bias caused by gene length variations, resulting in a biased representation of protistan diversity.
Further, most protist lineages are notoriously difficult to cultivate.
Here, we analyzed over 30,000 metagenome assemblies and protist single-cell genomes, together with 21 long-amplicon 18S datasets, spanning marine, freshwater, and soil ecosystems.
We recovered 157,956 18S rRNA gene sequences (≥800 bp), which clustered into 103,338 operational taxonomic units (OTUs) at 97% sequence identity and 24,438 OTUs at 85% identity.
Notably, 69% of 13,238 non-singleton clusters at 85% identity consisted exclusively of environmental sequences, uncovering a wealth of novel, uncultivated, and unclassified protist diversity.
A comprehensive taxonomic framework of eukaryotes based on concatenated 18S and 28S rRNA genes that incorporated most novel lineages revealed substantial underrepresentation of Amoebozoa, Discoba, and Rhizaria in reference databases, with many lacking isolate or genome sequence representation.
Further, we identified 13 lineages that likely represent deeply branching diversity, including candidates at approximately class- to phylum-level depth, that lack representation in public databases.
The corresponding 85% OTUs were primarily affiliated with Excavata, with some branching deeply in the eukaryotic tree.
Comprehensive analysis of the global distribution of eukaryotes revealed uneven microbial eukaryotic diversity across supergroups and ecosystems, with notable novelty particularly in soil and marine environments.
We then examined co-occurrence between protists and prokaryotes, predicting putative symbiotic or predator-prey relationships, particularly among understudied protist groups with bacteria such as Verrucomicrobia and Rickettsiales.
Our results extend current reference coverage and provide a global, contig-based framework for protistan diversity and distribution, highlighting major gaps in curated databases and metabarcoding coverage and guiding targeted studies of these organisms’ ecological roles.
Related Results
The spatial variability of qualitative and quantitative structure of planktonic protist communities in the North Atlantic Current (the Nordic Seas)
The spatial variability of qualitative and quantitative structure of planktonic protist communities in the North Atlantic Current (the Nordic Seas)
We investigated the spatial variability of qualitative and quantitative planktonic protist community structure in the Nordic Seas in relation to the environmental factors. Our stud...
Developing a Phylogeny Based Machine Learning Algorithm for Metagenomics
Developing a Phylogeny Based Machine Learning Algorithm for Metagenomics
Metagenomics is the study of the totality of the complete genetic elements discovered from a defined environment. Different from traditional microbiology study, which only analyzes...
Comparison of targeted metagenomics and IS-Pro methods for analysing the lung microbiome
Comparison of targeted metagenomics and IS-Pro methods for analysing the lung microbiome
AbstractBackgroundTargeted metagenomics and IS-Pro method are two of the many methods that have been used to study the microbiome. The two methods target different regions of the 1...
METAGENOMICS CURRENT RESEARCH, APPLICATION AND COMPUTATIONAL ANALYSIS
METAGENOMICS CURRENT RESEARCH, APPLICATION AND COMPUTATIONAL ANALYSIS
Metagenomics is the combination of genomics branch and meta that means huge set of genomes from different organisms. Metagenomics is also called as environmental genomics or commun...
Spatial, temporal and behavioral patterns of marine protists = Patrons espaials, temporals i de comportament dels protistes marins
Spatial, temporal and behavioral patterns of marine protists = Patrons espaials, temporals i de comportament dels protistes marins
The oceans are microbial-dominated ecosystems, where protists (single-celled eukaryotes) play fundamental roles performing multiple functions as primary producers, consumers, decom...
Metagenomics provides broad detection of pathogens, antimicrobial resistance, and virulence genes in pig diarrhoea and complement conventional methods
Metagenomics provides broad detection of pathogens, antimicrobial resistance, and virulence genes in pig diarrhoea and complement conventional methods
Abstract
Background:
Post-weaning diarrhoea (PWD) remains a major cause of morbidity in pig production and is commonly associated with enterotox...
Making a pathogen? Evaluating the impact of protist predation on the evolution of virulence in
Serratia marcescens
Making a pathogen? Evaluating the impact of protist predation on the evolution of virulence in
Serratia marcescens
Abstract
Opportunistic pathogens are environmental microbes that are generally harmless and only occasionally cause disease. Unlike obligate path...
Making a Pathogen? Evaluating the Impact of Protist Predation on the Evolution of Virulence in Serratia marcescens
Making a Pathogen? Evaluating the Impact of Protist Predation on the Evolution of Virulence in Serratia marcescens
Abstract
Opportunistic pathogens are environmental microbes that are generally harmless and only occasionally cause disease. Unlike obligate pathogens, the growth an...

