Search engine for discovering works of Art, research articles, and books related to Art and Culture
ShareThis
Javascript must be enabled to continue!

Predicting regional somatic mutation rates using DNA motifs

View through CrossRef
Abstract How the locus-specificity of epigenetic modifications is regulated remains an unanswered question. A contributing mechanism is that epigenetic enzymes are recruited to specific loci by DNA binding factors recognizing particular sequence motifs (referred to as epi-motifs). Using these motifs to predict biological outputs depending on local epigenetic state such as somatic mutation rates would confirm their functionality. Here, we used DNA motifs including known TF motifs and epi-motifs as a surrogate of epigenetic signals to predict somatic mutation rates in 13 cancers at an average 23kbp resolution. We implemented an interpretable neural network model, called contextual regression, to successfully learn the universal relationship between mutations and DNA motifs, and uncovered motifs that are most impactful on the regional mutation rates such as TP53 and epi-motifs associated with H3K9me3. Furthermore, we identified genomic regions with significantly higher mutation rates than the expected values in each individual tumor and demonstrated that such cancer-specific regions can accurately predict cancer types. (The code is available from https://github.com/Wang-lab-UCSD/SomaticMutation ) Significance Statement The relationship between DNA motifs and somatic mutation rates in cancers is not fully understood, especially at high resolution. Here we developed an interpretable neural network model to successfully predict somatic mutation rates using DNA motifs in 13 diverse cancers and identified the most informative motifs. Furthermore, we showed that the genomic regions with significant higher mutation rates than the predicted values can be used for cancer classification.
Title: Predicting regional somatic mutation rates using DNA motifs
Description:
Abstract How the locus-specificity of epigenetic modifications is regulated remains an unanswered question.
A contributing mechanism is that epigenetic enzymes are recruited to specific loci by DNA binding factors recognizing particular sequence motifs (referred to as epi-motifs).
Using these motifs to predict biological outputs depending on local epigenetic state such as somatic mutation rates would confirm their functionality.
Here, we used DNA motifs including known TF motifs and epi-motifs as a surrogate of epigenetic signals to predict somatic mutation rates in 13 cancers at an average 23kbp resolution.
We implemented an interpretable neural network model, called contextual regression, to successfully learn the universal relationship between mutations and DNA motifs, and uncovered motifs that are most impactful on the regional mutation rates such as TP53 and epi-motifs associated with H3K9me3.
Furthermore, we identified genomic regions with significantly higher mutation rates than the expected values in each individual tumor and demonstrated that such cancer-specific regions can accurately predict cancer types.
(The code is available from https://github.
com/Wang-lab-UCSD/SomaticMutation ) Significance Statement The relationship between DNA motifs and somatic mutation rates in cancers is not fully understood, especially at high resolution.
Here we developed an interpretable neural network model to successfully predict somatic mutation rates using DNA motifs in 13 diverse cancers and identified the most informative motifs.
Furthermore, we showed that the genomic regions with significant higher mutation rates than the predicted values can be used for cancer classification.

Related Results

JIT 2023 - Jornadas de Jóvenes Investigadores Tecnológicos
JIT 2023 - Jornadas de Jóvenes Investigadores Tecnológicos
Es un honor presentar este libro que compila los trabajos de investigación y desarrollo presentados en las Jornadas de Jóvenes Investigadores Tecnológicos (JIT) 2023. Este evento s...
Genome wide hypomethylation and youth-associated DNA gap reduction promoting DNA damage and senescence-associated pathogenesis
Genome wide hypomethylation and youth-associated DNA gap reduction promoting DNA damage and senescence-associated pathogenesis
Abstract Background: Age-associated epigenetic alteration is the underlying cause of DNA damage in aging cells. Two types of youth-associated DNA-protection epigenetic mark...
Genome wide hypomethylation and youth-associated DNA gap reduction promoting DNA damage and senescence-associated pathogenesis
Genome wide hypomethylation and youth-associated DNA gap reduction promoting DNA damage and senescence-associated pathogenesis
Introduction: The United States currently faces two opioid crises, an evolved crisis currently manifesting as widespread abuse of illicit opioids, and a crisis in pain management l...
Echinococcus granulosus in Environmental Samples: A Cross-Sectional Molecular Study
Echinococcus granulosus in Environmental Samples: A Cross-Sectional Molecular Study
Abstract Introduction Echinococcosis, caused by tapeworms of the Echinococcus genus, remains a significant zoonotic disease globally. The disease is particularly prevalent in areas...
Cell cycle alterations associate with a redistribution of mutation rates across chromosomal domains in human cancers
Cell cycle alterations associate with a redistribution of mutation rates across chromosomal domains in human cancers
AbstractSomatic mutations in human cells have a highly heterogeneous genomic distribution, with increased burden in late-replication time (RT), heterochromatic domains of chromosom...
Abstract 4594: Detection of colorectal cancer related gene mutations from CTC and ctDNA
Abstract 4594: Detection of colorectal cancer related gene mutations from CTC and ctDNA
Abstract [Background] It is well known that emerging mutations which is not found in the primary tumor exist in metastatic tumor and molecular therapy induce emergin...
Expression of Specialized Error-Prone DNA Polymerases in Chronic Lymphocytic Leukemia (CLL).
Expression of Specialized Error-Prone DNA Polymerases in Chronic Lymphocytic Leukemia (CLL).
Abstract Specialized DNA polymerases are low fidelity enzymes required to bypass DNA lesions that are also involved in somatic hypermutation. Deregulation of these e...
Spatial control of protein binding with DNA nanostructures
Spatial control of protein binding with DNA nanostructures
<p dir="ltr">The physical and chemical properties of DNA, including its structure predictability thanks to Watson-Crick base pairing, make it into an obvious polymer of choic...

Back to Top