Search engine for discovering works of Art, research articles, and books related to Art and Culture
ShareThis
Javascript must be enabled to continue!

Reproducible Untargeted Metabolomics Data Analysis Workflow for Exhaustive MS/MS Annotation

View through CrossRef
Motivation Unknown features in untargeted metabolomics and non-targeted analysis (NTA) are identified using fragment ions from MS/MS spectra to predict the structures of the unknown compounds. The precursor ion selected for fragmentation is commonly performed using data dependent acquisition (DDA) strategies or following statistical analysis using targeted MS/MS approaches. However, the selected precursor ions from DDA only cover a biased subset of the peaks or features found in full scan data. In addition, different statistical analysis can select different precursor ions for MS/MS analysis, which make the post-hoc validation of ions selected by new statistical methods impossible for precursor ions selected by the original statistical method. By removing redundant peaks and performing pseudo-targeted MS/MS analysis on independent peaks, we can comprehensively cover unknown compounds found in full scan analysis using a “one peak for one compound” workflow without a priori redundant peak information. Here we propose an reproducible, automated, exhaustive, statistical model-free workflow: paired mass distance-dependent analysis (PMDDA), for untargeted mass spectrometry identification of unknown compounds found in MS1 full scan. Results More annotated compounds/molecular networks/spectrum were found using PMDDA compared with CAMERA and RAMClustR. Meanwhile, PMDDA can generate the preferred ions list for iterative DDA to cover more compounds when instruments support such functions. Availability and implementation The whole workflow is fully reproducible as a docker image xcmsrocker with both the original data and the data processing template. https://hub.docker.com/r/yufree/xcmsrocker A related R package is developed and released online: https://github.com/yufree/rmwf. R script, data files and links of GNPS annotation results including MS1 peaks list and MS2 MGF files were provided in supplementary information.
American Chemical Society (ACS)
Title: Reproducible Untargeted Metabolomics Data Analysis Workflow for Exhaustive MS/MS Annotation
Description:
Motivation Unknown features in untargeted metabolomics and non-targeted analysis (NTA) are identified using fragment ions from MS/MS spectra to predict the structures of the unknown compounds.
The precursor ion selected for fragmentation is commonly performed using data dependent acquisition (DDA) strategies or following statistical analysis using targeted MS/MS approaches.
However, the selected precursor ions from DDA only cover a biased subset of the peaks or features found in full scan data.
In addition, different statistical analysis can select different precursor ions for MS/MS analysis, which make the post-hoc validation of ions selected by new statistical methods impossible for precursor ions selected by the original statistical method.
By removing redundant peaks and performing pseudo-targeted MS/MS analysis on independent peaks, we can comprehensively cover unknown compounds found in full scan analysis using a “one peak for one compound” workflow without a priori redundant peak information.
Here we propose an reproducible, automated, exhaustive, statistical model-free workflow: paired mass distance-dependent analysis (PMDDA), for untargeted mass spectrometry identification of unknown compounds found in MS1 full scan.
Results More annotated compounds/molecular networks/spectrum were found using PMDDA compared with CAMERA and RAMClustR.
Meanwhile, PMDDA can generate the preferred ions list for iterative DDA to cover more compounds when instruments support such functions.
Availability and implementation The whole workflow is fully reproducible as a docker image xcmsrocker with both the original data and the data processing template.
https://hub.
docker.
com/r/yufree/xcmsrocker A related R package is developed and released online: https://github.
com/yufree/rmwf.
R script, data files and links of GNPS annotation results including MS1 peaks list and MS2 MGF files were provided in supplementary information.

Related Results

Reproducible Untargeted Metabolomics Data Analysis Workflow for Exhaustive MS/MS Annotation
Reproducible Untargeted Metabolomics Data Analysis Workflow for Exhaustive MS/MS Annotation
Unknown features in untargeted metabolomics and non-targeted analysis (NTA) are identified using fragment ions from MS/MS spectra to predict the structures of the unknown compounds...
High-Precision Automated Workflow for Urinary Untargeted Metabolomic Epidemiology
High-Precision Automated Workflow for Urinary Untargeted Metabolomic Epidemiology
Urine is a non-invasive biofluid that is rich in polar metabolites and well-suited for metabolomic epidemiology. However, due to individual variability in health and hydration stat...
A New Approach of Outlier-robust Missing Value Imputation for Metabolomics Data Analysis
A New Approach of Outlier-robust Missing Value Imputation for Metabolomics Data Analysis
Background:Metabolomics data generation and quantification are different from other types of molecular “omics” data in bioinformatics. Mass spectrometry (MS) based (gas chromatogra...
THE TRANSFORMATIVE FUTURE OF METABOLOMICS FROM DIAGNOSTICS TO THERAPEUTICS
THE TRANSFORMATIVE FUTURE OF METABOLOMICS FROM DIAGNOSTICS TO THERAPEUTICS
Metabolomics, or the study of metabolites produced during chemical reactions in living organisms, is a fast-growing topic within the "omics" sciences. It has proven valuable in var...
The Hitchhiker’s Guide to Untargeted Lipidomics Analysis: Practical Guidelines
The Hitchhiker’s Guide to Untargeted Lipidomics Analysis: Practical Guidelines
Lipidomics is a newly emerged discipline involving the identification and quantification of thousands of lipids. As a part of the omics field, lipidomics has shown rapid growth bot...
Optimizing Emergency Department Workflow Using Radio Frequency Identification Device (RFID) Data Analytics
Optimizing Emergency Department Workflow Using Radio Frequency Identification Device (RFID) Data Analytics
Emergency Department (ED) is a complex care delivery environment in a hospital that provides time sensitive urgent and lifesaving care [1]. Emergency medicine is an unscheduled pra...
Mining sequence annotation databanks for association patterns
Mining sequence annotation databanks for association patterns
Abstract Motivation: Millions of protein sequences currently being deposited to sequence databanks will never be annotated manually. Similarity-based annotation gene...

Back to Top