Search engine for discovering works of Art, research articles, and books related to Art and Culture
ShareThis
Javascript must be enabled to continue!

DuReS: An R package for denoising experimental tandem mass spectrometry-based metabolomics data

View through CrossRef
Abstract Mass spectrometry-based untargeted metabolomics is a powerful technique for profiling small molecules in biological samples, yet accurate metabolite identification remains challenging. One of the primary obstacles in processing tandem mass spectrometry data is the prevalence of random noise peaks, which can result in false annotations and necessitate labor-intensive verification. A common method for removing noise from MS/MS spectra is intensity thresholding, where low-intensity peaks are discarded based on a user-defined cutoff or by analyzing the top “N” most intense peaks. However, determining an optimal threshold is often dataset-specific and may retain many noisy peaks. In this study, we hypothesize that true signal peaks consistently recur across replicate MS/MS spectra generated from the same precursor ion, unlike random noise. An optimal recurrence frequency of 0.12 (95% CI: 0.087-0.15) was derived using an open-source metabolomics dataset, which enhanced the dot product score between the experimental and library spectra by 66% post-denoising and resulted in a median signal and noise reduction of 5.83% and 99.07%, respectively. Validated across multiple metabolomics datasets, our denoising workflow significantly improved spectral matching metrics, leading to more accurate annotations and fewer false positives. Available freely as an R package, Denoising Using Replicate Spectra (DuReS) ( https://github.com/BiosystemEngineeringLab-IITB/dures ) is designed to remove noise while retaining diagnostically significant peaks efficiently. It accepts mzML files and feature lists from standard global untargeted metabolomics analysis software as input, enabling users to seamlessly integrate the denoising pipeline into their workflow without additional data manipulation.
Title: DuReS: An R package for denoising experimental tandem mass spectrometry-based metabolomics data
Description:
Abstract Mass spectrometry-based untargeted metabolomics is a powerful technique for profiling small molecules in biological samples, yet accurate metabolite identification remains challenging.
One of the primary obstacles in processing tandem mass spectrometry data is the prevalence of random noise peaks, which can result in false annotations and necessitate labor-intensive verification.
A common method for removing noise from MS/MS spectra is intensity thresholding, where low-intensity peaks are discarded based on a user-defined cutoff or by analyzing the top “N” most intense peaks.
However, determining an optimal threshold is often dataset-specific and may retain many noisy peaks.
In this study, we hypothesize that true signal peaks consistently recur across replicate MS/MS spectra generated from the same precursor ion, unlike random noise.
An optimal recurrence frequency of 0.
12 (95% CI: 0.
087-0.
15) was derived using an open-source metabolomics dataset, which enhanced the dot product score between the experimental and library spectra by 66% post-denoising and resulted in a median signal and noise reduction of 5.
83% and 99.
07%, respectively.
Validated across multiple metabolomics datasets, our denoising workflow significantly improved spectral matching metrics, leading to more accurate annotations and fewer false positives.
Available freely as an R package, Denoising Using Replicate Spectra (DuReS) ( https://github.
com/BiosystemEngineeringLab-IITB/dures ) is designed to remove noise while retaining diagnostically significant peaks efficiently.
It accepts mzML files and feature lists from standard global untargeted metabolomics analysis software as input, enabling users to seamlessly integrate the denoising pipeline into their workflow without additional data manipulation.

Related Results

Mass spectrometry of oligosaccharides
Mass spectrometry of oligosaccharides
Abstract I. Introduction 162  II. CHARACTERISTICS OF TANDEM MASS SPECTRA OF CARBOHYDRATES 163 A.  Ionization of Carbohydrates 163     1.  Electrospray Ionization (E...
Breast Carcinoma within Fibroadenoma: A Systematic Review
Breast Carcinoma within Fibroadenoma: A Systematic Review
Abstract Introduction Fibroadenoma is the most common benign breast lesion; however, it carries a potential risk of malignant transformation. This systematic review provides an ove...
Enhancing bone scan image quality: an improved self-supervised denoising approach
Enhancing bone scan image quality: an improved self-supervised denoising approach
Abstract Objective. Bone scans play an important role in skeletal lesion assessment, but gamma cameras exhibit challenges with low sensitivity and...
Desmoid-Type Fibromatosis of The Breast: A Case Series
Desmoid-Type Fibromatosis of The Breast: A Case Series
Abstract IntroductionDesmoid-type fibromatosis (DTF), also called aggressive fibromatosis, is a rare, benign, locally aggressive condition. Mammary DTF originates from fibroblasts ...
A New Approach of Outlier-robust Missing Value Imputation for Metabolomics Data Analysis
A New Approach of Outlier-robust Missing Value Imputation for Metabolomics Data Analysis
Background:Metabolomics data generation and quantification are different from other types of molecular “omics” data in bioinformatics. Mass spectrometry (MS) based (gas chromatogra...
From Targeted Quantification to Untargeted Metabolomics
From Targeted Quantification to Untargeted Metabolomics
Metabolomics is an emerging and rapidly evolving technology tool, which involves quantitative and qualitative metabolite assessments science. It offers tremendous promise for diffe...
MSFC: A New Feature Construction Method for Accurate Diagnosis of Mass Spectrometry Data
MSFC: A New Feature Construction Method for Accurate Diagnosis of Mass Spectrometry Data
Abstract Background Mass spectrometry technology can realize dynamic detection of many complex matrix samples in a simple, rapid, compassionate, precise, and high-throughp...

Back to Top