Javascript must be enabled to continue!
Carbohydrate-active enzyme annotation in microbiomes using dbCAN
View through CrossRef
AbstractCAZymes or carbohydrate-active enzymes are critically important for human gut health, lignocellulose degradation, global carbon recycling, soil health, and plant disease. We developed dbCAN as a web server in 2012 and actively maintain it for automated CAZyme annotation. Considering data privacy and scalability, we provide run_dbcan as a standalone software package since 2018 to allow users perform more secure and scalable CAZyme annotation on their local servers. Here, we offer a comprehensive computational protocol on automated CAZyme annotation of microbiome sequencing data, covering everything from short read pre-processing to data visualization of CAZyme and glycan substrate occurrence and abundance in multiple samples. Using a real-world metagenomic sequencing dataset, this protocol describes commands for dataset and software preparation, metagenome assembly, gene prediction, CAZyme prediction, CAZyme gene cluster (CGC) prediction, glycan substrate prediction, and data visualization. The expected results include publication-quality plots for the abundance of CAZymes, CGCs, and substrates from multiple CAZyme annotation routes (individual sample assembly, co-assembly, and assembly-free). For the individual sample assembly route, this protocol takes ∼33h on a Linux computer with 40 CPUs, while other routes will be faster. This protocol does not require programming experience from users, but it does assume a familiarity with the Linux command-line interface and the ability to run Python scripts in the terminal. The target audience includes the tens of thousands of microbiome researchers who routinely use our web server. This protocol will encourage them to perform more secure, rapid, and scalable CAZyme annotation on their local computer servers.
Cold Spring Harbor Laboratory
Title: Carbohydrate-active enzyme annotation in microbiomes using dbCAN
Description:
AbstractCAZymes or carbohydrate-active enzymes are critically important for human gut health, lignocellulose degradation, global carbon recycling, soil health, and plant disease.
We developed dbCAN as a web server in 2012 and actively maintain it for automated CAZyme annotation.
Considering data privacy and scalability, we provide run_dbcan as a standalone software package since 2018 to allow users perform more secure and scalable CAZyme annotation on their local servers.
Here, we offer a comprehensive computational protocol on automated CAZyme annotation of microbiome sequencing data, covering everything from short read pre-processing to data visualization of CAZyme and glycan substrate occurrence and abundance in multiple samples.
Using a real-world metagenomic sequencing dataset, this protocol describes commands for dataset and software preparation, metagenome assembly, gene prediction, CAZyme prediction, CAZyme gene cluster (CGC) prediction, glycan substrate prediction, and data visualization.
The expected results include publication-quality plots for the abundance of CAZymes, CGCs, and substrates from multiple CAZyme annotation routes (individual sample assembly, co-assembly, and assembly-free).
For the individual sample assembly route, this protocol takes ∼33h on a Linux computer with 40 CPUs, while other routes will be faster.
This protocol does not require programming experience from users, but it does assume a familiarity with the Linux command-line interface and the ability to run Python scripts in the terminal.
The target audience includes the tens of thousands of microbiome researchers who routinely use our web server.
This protocol will encourage them to perform more secure, rapid, and scalable CAZyme annotation on their local computer servers.
Related Results
Immune-oncology-microbiome axis may result in AKP or anti-AKP effects in intratumor microbiomes
Immune-oncology-microbiome axis may result in AKP or anti-AKP effects in intratumor microbiomes
AbstractAn emerging consensus regarding the triangle relationship between tumor, immune cells, and microbiomes is the immune-oncology-microbiome (IOM) axis, which stipulates that m...
Phycobiliprotein production with cyanobacteria-rich cultures and microbiomes
Phycobiliprotein production with cyanobacteria-rich cultures and microbiomes
(English) Phycobiliproteins are pigments found in cyanobacteria, which are exploited in the food, cosmetic, and pharmaceutical industries. However, the large-scale production of th...
Lysogeny destabilizes computationally simulated microbiomes
Lysogeny destabilizes computationally simulated microbiomes
AbstractBackgroundThe Anna Karenina Principle predicts that stability in host-associated microbiomes correlates with health in the host. Microbiomes are ecosystems, and classical e...
Phylogenetic Measures of the Core Microbiome
Phylogenetic Measures of the Core Microbiome
Abstract
Background
A useful concept in microbial ecology is the ‘core microbiome.’ Typically, core microbiomes are defined as the microb...
Association Between a Low Carbohydrate Diet, Quality of Life, and Glycemic Control in Australian Adults Living With Type 1 Diabetes: Protocol for a Mixed Methods Pilot Study (Preprint)
Association Between a Low Carbohydrate Diet, Quality of Life, and Glycemic Control in Australian Adults Living With Type 1 Diabetes: Protocol for a Mixed Methods Pilot Study (Preprint)
BACKGROUND
Globally, the prevalence of type 1 diabetes mellitus (T1DM) is rising. In 2020, a total of 124,652 Australians had T1DM. Maintaining optimal glyc...
Designing function-specific minimal microbiomes from large microbial communities
Designing function-specific minimal microbiomes from large microbial communities
AbstractMotivationMicroorganisms thrive in large communities of diverse species, exhibiting various functionalities. The mammalian gut microbiome, for instance, has the functionali...
Supramolecular Approaches to the Study of Glycobiology
Supramolecular Approaches to the Study of Glycobiology
AbstractEvery mammalian cell is surrounded by carbohydrate conjugates forming a nano‐dimensioned layer termed “glycocalyx,” a name derived from the Greek words “glykos” for “sugar”...
Mining sequence annotation databanks for association patterns
Mining sequence annotation databanks for association patterns
Abstract
Motivation: Millions of protein sequences currently being deposited to sequence databanks will never be annotated manually. Similarity-based annotation gene...

