Javascript must be enabled to continue!
Bayesian Multi-Study Non-Negative Matrix Factorization for Mutational Signatures
View through CrossRef
A
bstract
Mutational signatures shed insight into the range of mutational processes giving rise to tumors and allow a better understanding of cancer origin. They are typically identified from high-throughput sequencing data of cancer genomes using non-negative matrix factorization (NMF), and many such techniques have been developed towards this aim. However, it is often of particular interest to compare mutational signatures across multiple conditions, e.g. to understand which signatures are present across different treatments, or to identify signatures that are shared or specific across cancer types. Existing techniques within the NMF context only allow decomposition within a single dataset, so that integrating results across multiple conditions requires running separate analyses on each dataset, followed by subjective and manual comparisons of the identified signatures. To address this issue, we propose a Bayesian multi-study NMF method that jointly decomposes multiple studies or conditions to identify signatures that are common, specific, or partially shared by any subset. We propose two models: a “discovery-only” model that estimates de novo signatures in a completely unsupervised manner, and a “recovery-discovery” model that builds informative priors from previously known signatures to both update the estimates of these signatures and identify any novel signatures. We then further extend these models to estimate the effects of sample-level covariates on the exposures to each signature, enforcing sparsity through a non-local spike-and-slab prior. We demonstrate our approach on a range of simulations, and apply our method to colorectal cancer samples to show its utility.
Title: Bayesian Multi-Study Non-Negative Matrix Factorization for Mutational Signatures
Description:
A
bstract
Mutational signatures shed insight into the range of mutational processes giving rise to tumors and allow a better understanding of cancer origin.
They are typically identified from high-throughput sequencing data of cancer genomes using non-negative matrix factorization (NMF), and many such techniques have been developed towards this aim.
However, it is often of particular interest to compare mutational signatures across multiple conditions, e.
g.
to understand which signatures are present across different treatments, or to identify signatures that are shared or specific across cancer types.
Existing techniques within the NMF context only allow decomposition within a single dataset, so that integrating results across multiple conditions requires running separate analyses on each dataset, followed by subjective and manual comparisons of the identified signatures.
To address this issue, we propose a Bayesian multi-study NMF method that jointly decomposes multiple studies or conditions to identify signatures that are common, specific, or partially shared by any subset.
We propose two models: a “discovery-only” model that estimates de novo signatures in a completely unsupervised manner, and a “recovery-discovery” model that builds informative priors from previously known signatures to both update the estimates of these signatures and identify any novel signatures.
We then further extend these models to estimate the effects of sample-level covariates on the exposures to each signature, enforcing sparsity through a non-local spike-and-slab prior.
We demonstrate our approach on a range of simulations, and apply our method to colorectal cancer samples to show its utility.
Related Results
Substitution mutational signatures across pan-squamous cell carcinomas
Substitution mutational signatures across pan-squamous cell carcinomas
Abstract
Background
Squamous cell carcinoma (SCC) is a highly heterogeneous and aggressive cancer type with significant g...
Predictors of False-Negative Axillary FNA Among Breast Cancer Patients: A Cross-Sectional Study
Predictors of False-Negative Axillary FNA Among Breast Cancer Patients: A Cross-Sectional Study
Abstract
Introduction
Fine-needle aspiration (FNA) is commonly used to investigate lymphadenopathy of suspected metastatic origin. The current study aims to find the association be...
Clonality and Mixed Mutational Signature in Aged Hematopoietic Stem Cells Via Single Cell Variant Analysis
Clonality and Mixed Mutational Signature in Aged Hematopoietic Stem Cells Via Single Cell Variant Analysis
Abstract
Genomic stability and integrity in Hematopoietic Stem Cells (HSCs) is maintained via DNA damage checkpoints, DNA proofreading and DNA repair (Moehrle et al....
Denoising Auto-Encoder-Enhanced Deep Non-Negative Matrix Factorization Clustering Model
Denoising Auto-Encoder-Enhanced Deep Non-Negative Matrix Factorization Clustering Model
Non-negative matrix factorization directly decomposes data features into a base matrix and community matrix, which are easily affected by noise. Multi-view datasets have multiple f...
Factorization structures, cones, and polytopes
Factorization structures, cones, and polytopes
Abstract
Factorization structures occur in toric differential and discrete geometry and can be viewed in multiple ways, e.g., as objects determining substantial classes of expli...
Sample-efficient Optimization Using Neural Networks
Sample-efficient Optimization Using Neural Networks
<p>The solution to many science and engineering problems includes identifying the minimum or maximum of an unknown continuous function whose evaluation inflicts non-negligibl...
Correlation of Mutational Signatures in Cancer Genes with General Signatures
Correlation of Mutational Signatures in Cancer Genes with General Signatures
The occurrence of various mutation patterns, such as changes in the DNA sequence and the loss of some sequences, is called a “mutational signature,” and they represent the molecula...
Figs S1-S9
Figs S1-S9
Fig. S1. Consensus phylogram (50 % majority rule) resulting from a Bayesian analysis of the ITS sequence alignment of sequences generated in this study and reference sequences from...

