Javascript must be enabled to continue!
Reproducible Untargeted Metabolomics Data Analysis Workflow for Exhaustive MS/MS Annotation
View through CrossRef
Unknown features in untargeted metabolomics and non-targeted analysis (NTA) are identified using fragment ions from MS/MS spectra to predict the structures of the unknown compounds. The precursor ion selected for fragmentation is commonly performed using data dependent acquisition (DDA) strategies or following statistical analysis using targeted MS/MS approaches. However, the selected precursor ions from DDA only cover a biased subset of the peaks or features found in full scan data. In addition, different statistical analysis can select different precursor ions for MS/MS analysis, which make the post-hoc validation of ions selected by new statistical methods impossible for precursor ions selected by the original statistical method. Here we propose an automated, exhaustive, statistical model-free workflow: paired mass distance-dependent analysis (PMDDA), for untargeted mass spectrometry identification of unknown compounds. By removing redundant peaks and performing pseudo-targeted MS/MS analysis on independent peaks, we can comprehensively cover unknown compounds found in full scan analysis using a “one peak for one compound” workflow without a priori redundant peak information. We show that compared to DDA, PMDDA is more comprehensive and robust against samples' matrix effects. Further, more compounds were identified by database annotation using PMDDA compared with CAMERA and RAMClustR. Finally, compounds with signals in both positive and negative modes can be identified by the PMDDA workflow, to further reduce redundancies. The whole workflow is fully reproducible as a docker image xcmsrocker with both the original data and the data processing template.
Title: Reproducible Untargeted Metabolomics Data Analysis Workflow for Exhaustive MS/MS Annotation
Description:
Unknown features in untargeted metabolomics and non-targeted analysis (NTA) are identified using fragment ions from MS/MS spectra to predict the structures of the unknown compounds.
The precursor ion selected for fragmentation is commonly performed using data dependent acquisition (DDA) strategies or following statistical analysis using targeted MS/MS approaches.
However, the selected precursor ions from DDA only cover a biased subset of the peaks or features found in full scan data.
In addition, different statistical analysis can select different precursor ions for MS/MS analysis, which make the post-hoc validation of ions selected by new statistical methods impossible for precursor ions selected by the original statistical method.
Here we propose an automated, exhaustive, statistical model-free workflow: paired mass distance-dependent analysis (PMDDA), for untargeted mass spectrometry identification of unknown compounds.
By removing redundant peaks and performing pseudo-targeted MS/MS analysis on independent peaks, we can comprehensively cover unknown compounds found in full scan analysis using a “one peak for one compound” workflow without a priori redundant peak information.
We show that compared to DDA, PMDDA is more comprehensive and robust against samples' matrix effects.
Further, more compounds were identified by database annotation using PMDDA compared with CAMERA and RAMClustR.
Finally, compounds with signals in both positive and negative modes can be identified by the PMDDA workflow, to further reduce redundancies.
The whole workflow is fully reproducible as a docker image xcmsrocker with both the original data and the data processing template.
Related Results
Reproducible Untargeted Metabolomics Data Analysis Workflow for Exhaustive MS/MS Annotation
Reproducible Untargeted Metabolomics Data Analysis Workflow for Exhaustive MS/MS Annotation
Motivation Unknown features in untargeted metabolomics and non-targeted analysis (NTA) are identified using fragment ions from MS/MS spectra to predict the structures of the unknow...
High-Precision Automated Workflow for Urinary Untargeted Metabolomic Epidemiology
High-Precision Automated Workflow for Urinary Untargeted Metabolomic Epidemiology
Urine is a non-invasive biofluid that is rich in polar metabolites and well-suited for metabolomic epidemiology. However, due to individual variability in health and hydration stat...
Pivotal interplays between fecal metabolome and gut microbiome reveal functional signatures in cerebral ischemic stroke
Pivotal interplays between fecal metabolome and gut microbiome reveal functional signatures in cerebral ischemic stroke
Abstract
Background
Integrative analysis approaches of metagenomics and metabolomics have been widely developed to understand the association betwee...
A New Approach of Outlier-robust Missing Value Imputation for Metabolomics Data Analysis
A New Approach of Outlier-robust Missing Value Imputation for Metabolomics Data Analysis
Background:Metabolomics data generation and quantification are different from other types of molecular “omics” data in bioinformatics. Mass spectrometry (MS) based (gas chromatogra...
THE TRANSFORMATIVE FUTURE OF METABOLOMICS FROM DIAGNOSTICS TO THERAPEUTICS
THE TRANSFORMATIVE FUTURE OF METABOLOMICS FROM DIAGNOSTICS TO THERAPEUTICS
Metabolomics, or the study of metabolites produced during chemical reactions in living organisms, is a fast-growing topic within the "omics" sciences. It has proven valuable in var...
The Hitchhiker’s Guide to Untargeted Lipidomics Analysis: Practical Guidelines
The Hitchhiker’s Guide to Untargeted Lipidomics Analysis: Practical Guidelines
Lipidomics is a newly emerged discipline involving the identification and quantification of thousands of lipids. As a part of the omics field, lipidomics has shown rapid growth bot...
Optimizing Emergency Department Workflow Using Radio Frequency Identification Device (RFID) Data Analytics
Optimizing Emergency Department Workflow Using Radio Frequency Identification Device (RFID) Data Analytics
Emergency Department (ED) is a complex care delivery environment in a hospital that provides time sensitive urgent and lifesaving care [1]. Emergency medicine is an unscheduled pra...
Mining sequence annotation databanks for association patterns
Mining sequence annotation databanks for association patterns
Abstract
Motivation: Millions of protein sequences currently being deposited to sequence databanks will never be annotated manually. Similarity-based annotation gene...

