Javascript must be enabled to continue!
Biologically Informative NA Deconvolution (BIND) excavates hidden features of the proteome from missing values in large-scale datasets
View through CrossRef
AbstractThe fast-advancing mass spectrometry and related technologies have greatly extended the depth of coverage in large-scale proteomics studies, including single-cell applications. As sample numbers grow rapidly, it is often challenging to interpret the proteins with missing values that are often presented as “NA” (not available). It could be the evidence of no expression, low expression below the detection threshold, or false negative detection due to technical issues. Existing methods for missing values imputation, while generally useful, rarely consider the non-random NA values that inform biological significance. In the current study, we developedBiologicallyInformativeNADeconvolution (BIND) that applies an adaptive neighborhood-based modeling to deconvolve the nature of NAs as “biological” (low/no expression) or technical (experimental errors). Applying to multiple cell line datasets and human tissue extracellular vesicle datasets, BIND excavated the NAs that indicated “hallmark absence” of unique proteins. This led to improvements in protein-protein interaction analysis and the identification of novel disease biomarkers. To facilitate its public accessibility, we compiled BIND into a web server that features functional online operations and interactive visualizations. Furthermore, we demonstrated that the BIND server could deconvolve the NAs and improve the analyses of single-cell proteomics datasets. Overall, BIND delineates the biological significance of missing values rather than treating them as a burden, providing a critical perspective for understanding the complex proteome in various biological contexts.Graphical abstract
Cold Spring Harbor Laboratory
Title: Biologically Informative NA Deconvolution (BIND) excavates hidden features of the proteome from missing values in large-scale datasets
Description:
AbstractThe fast-advancing mass spectrometry and related technologies have greatly extended the depth of coverage in large-scale proteomics studies, including single-cell applications.
As sample numbers grow rapidly, it is often challenging to interpret the proteins with missing values that are often presented as “NA” (not available).
It could be the evidence of no expression, low expression below the detection threshold, or false negative detection due to technical issues.
Existing methods for missing values imputation, while generally useful, rarely consider the non-random NA values that inform biological significance.
In the current study, we developedBiologicallyInformativeNADeconvolution (BIND) that applies an adaptive neighborhood-based modeling to deconvolve the nature of NAs as “biological” (low/no expression) or technical (experimental errors).
Applying to multiple cell line datasets and human tissue extracellular vesicle datasets, BIND excavated the NAs that indicated “hallmark absence” of unique proteins.
This led to improvements in protein-protein interaction analysis and the identification of novel disease biomarkers.
To facilitate its public accessibility, we compiled BIND into a web server that features functional online operations and interactive visualizations.
Furthermore, we demonstrated that the BIND server could deconvolve the NAs and improve the analyses of single-cell proteomics datasets.
Overall, BIND delineates the biological significance of missing values rather than treating them as a burden, providing a critical perspective for understanding the complex proteome in various biological contexts.
Graphical abstract.
Related Results
Sparsity‐enhanced wavelet deconvolution
Sparsity‐enhanced wavelet deconvolution
ABSTRACTWe propose a three‐step bandwidth enhancing wavelet deconvolution process, combining linear inverse filtering and non‐linear reflectivity construction based on a sparseness...
Wave Scattering Deconvolution
Wave Scattering Deconvolution
ABSTRACT
The least-squares approach is commonly used for spiking and predictive deconvolution. An alternative approach is wave scattering deconvolution (WSD) prop...
Abstract 1554: Development of a deconvolution algorithm for tissue-based gene expression data
Abstract 1554: Development of a deconvolution algorithm for tissue-based gene expression data
Abstract
Tissue data provide substantially more information than cell-line data, and offer new opportunities to study cancer biology and evolution in its actual micr...
NLTD 2.0: A Nonlinear Framework for Robust and Customizable Color Deconvolution in Histopathology
NLTD 2.0: A Nonlinear Framework for Robust and Customizable Color Deconvolution in Histopathology
Abstract
Advancements in computational approaches have enabled robust utilization of histological tissue data. A crucial step in the development of computational tools ...
Long-range superharmonic Josephson current and spin-triplet pairing correlations in a junction with ferromagnetic bilayers
Long-range superharmonic Josephson current and spin-triplet pairing correlations in a junction with ferromagnetic bilayers
AbstractThe long-range spin-triplet supercurrent transport is an interesting phenomenon in the superconductor/ferromagnet ("Equation missing") heterostructure containing noncolline...
Targeting the High-Density Lipoprotein Proteome for the Treatment of Post-Acute Sequelae of SARS-CoV-2
Targeting the High-Density Lipoprotein Proteome for the Treatment of Post-Acute Sequelae of SARS-CoV-2
Here, we target the high-density lipoprotein (HDL) proteome in a case series of 16 patients with post-COVID-19 symptoms treated with HMG-Co-A reductase inhibitors (statin) plus ang...
Handling Missing Data in COVID-19 Incidence Estimation: Secondary Data Analysis
Handling Missing Data in COVID-19 Incidence Estimation: Secondary Data Analysis
Abstract
Background
The COVID-19 pandemic has revealed significant challenges in disease forecasting and in developing a public health response, ...
Uncovering the consequences of batch effect associated missing values in omics data analysis
Uncovering the consequences of batch effect associated missing values in omics data analysis
ABSTRACTStatistical analyses in high-dimensional omics data are often hampered by the presence of batch effects (BEs) and missing values (MVs), but the interaction between these tw...

