Javascript must be enabled to continue!
DuReS: An R package for denoising experimental tandem mass spectrometry-based metabolomics data
View through CrossRef
Abstract
Mass spectrometry-based untargeted metabolomics is a powerful technique for profiling small molecules in biological samples, yet accurate metabolite identification remains challenging. One of the primary obstacles in processing tandem mass spectrometry data is the prevalence of random noise peaks, which can result in false annotations and necessitate labor-intensive verification. A common method for removing noise from MS/MS spectra is intensity thresholding, where low-intensity peaks are discarded based on a user-defined cutoff or by analyzing the top “N” most intense peaks. However, determining an optimal threshold is often dataset-specific and may retain many noisy peaks. In this study, we hypothesize that true signal peaks consistently recur across replicate MS/MS spectra generated from the same precursor ion, unlike random noise. An optimal recurrence frequency of 0.12 (95% CI: 0.087-0.15) was derived using an open-source metabolomics dataset, which enhanced the dot product score between the experimental and library spectra by 66% post-denoising and resulted in a median signal and noise reduction of 5.83% and 99.07%, respectively. Validated across multiple metabolomics datasets, our denoising workflow significantly improved spectral matching metrics, leading to more accurate annotations and fewer false positives. Available freely as an R package, Denoising Using Replicate Spectra (DuReS) (
https://github.com/BiosystemEngineeringLab-IITB/dures
) is designed to remove noise while retaining diagnostically significant peaks efficiently. It accepts mzML files and feature lists from standard global untargeted metabolomics analysis software as input, enabling users to seamlessly integrate the denoising pipeline into their workflow without additional data manipulation.
Title: DuReS: An R package for denoising experimental tandem mass spectrometry-based metabolomics data
Description:
Abstract
Mass spectrometry-based untargeted metabolomics is a powerful technique for profiling small molecules in biological samples, yet accurate metabolite identification remains challenging.
One of the primary obstacles in processing tandem mass spectrometry data is the prevalence of random noise peaks, which can result in false annotations and necessitate labor-intensive verification.
A common method for removing noise from MS/MS spectra is intensity thresholding, where low-intensity peaks are discarded based on a user-defined cutoff or by analyzing the top “N” most intense peaks.
However, determining an optimal threshold is often dataset-specific and may retain many noisy peaks.
In this study, we hypothesize that true signal peaks consistently recur across replicate MS/MS spectra generated from the same precursor ion, unlike random noise.
An optimal recurrence frequency of 0.
12 (95% CI: 0.
087-0.
15) was derived using an open-source metabolomics dataset, which enhanced the dot product score between the experimental and library spectra by 66% post-denoising and resulted in a median signal and noise reduction of 5.
83% and 99.
07%, respectively.
Validated across multiple metabolomics datasets, our denoising workflow significantly improved spectral matching metrics, leading to more accurate annotations and fewer false positives.
Available freely as an R package, Denoising Using Replicate Spectra (DuReS) (
https://github.
com/BiosystemEngineeringLab-IITB/dures
) is designed to remove noise while retaining diagnostically significant peaks efficiently.
It accepts mzML files and feature lists from standard global untargeted metabolomics analysis software as input, enabling users to seamlessly integrate the denoising pipeline into their workflow without additional data manipulation.
Related Results
Mass spectrometry of oligosaccharides
Mass spectrometry of oligosaccharides
Abstract
I.
Introduction
162
II.
CHARACTERISTICS OF TANDEM MASS SPECTRA OF CARBOHYDRATES
163
A. Ionization of Carbohydrates
163
1. Electrospray Ionization (E...
Breast Carcinoma within Fibroadenoma: A Systematic Review
Breast Carcinoma within Fibroadenoma: A Systematic Review
Abstract
Introduction
Fibroadenoma is the most common benign breast lesion; however, it carries a potential risk of malignant transformation. This systematic review provides an ove...
Enhancing bone scan image quality: an improved self-supervised denoising approach
Enhancing bone scan image quality: an improved self-supervised denoising approach
Abstract
Objective. Bone scans play an important role in skeletal lesion assessment, but gamma cameras exhibit challenges with low sensitivity and...
Desmoid-Type Fibromatosis of The Breast: A Case Series
Desmoid-Type Fibromatosis of The Breast: A Case Series
Abstract
IntroductionDesmoid-type fibromatosis (DTF), also called aggressive fibromatosis, is a rare, benign, locally aggressive condition. Mammary DTF originates from fibroblasts ...
A New Approach of Outlier-robust Missing Value Imputation for Metabolomics Data Analysis
A New Approach of Outlier-robust Missing Value Imputation for Metabolomics Data Analysis
Background:Metabolomics data generation and quantification are different from other types of molecular “omics” data in bioinformatics. Mass spectrometry (MS) based (gas chromatogra...
Comparison of flow injection analysis electrospray mass spectrometry and tandem mass spectrometry and electrospray high‐field asymmetric waveform ion mobility mass spectrometry and tandem mass spectrometry for the determination of underivatized amino acid
Comparison of flow injection analysis electrospray mass spectrometry and tandem mass spectrometry and electrospray high‐field asymmetric waveform ion mobility mass spectrometry and tandem mass spectrometry for the determination of underivatized amino acid
AbstractTwenty proteinogenic amino acids (AAs) were determined without derivatization using flow injection analysis followed by electrospray ionization mass spectrometry and tandem...
From Targeted Quantification to Untargeted Metabolomics
From Targeted Quantification to Untargeted Metabolomics
Metabolomics is an emerging and rapidly evolving technology tool, which involves quantitative and qualitative metabolite assessments science. It offers tremendous promise for diffe...
MSFC: A New Feature Construction Method for Accurate Diagnosis of Mass Spectrometry Data
MSFC: A New Feature Construction Method for Accurate Diagnosis of Mass Spectrometry Data
Abstract
Background
Mass spectrometry technology can realize dynamic detection of many complex matrix samples in a simple, rapid, compassionate, precise, and high-throughp...

