Javascript must be enabled to continue!
Abstract 1295: Deep-learning profiling of regulatory somatic variants across clonal cancer evolution.
View through CrossRef
Abstract
Somatic noncoding mutations are abundant in cancer genomes, yet their regulatory functional consequences remain poorly understood. To address this gap, we generated a pan-cancer atlas of variant regulatory effects by profiling >35M SNVs within +/-20 kb of transcription start sites across more than 8,000 TCGA samples. Chromatin impact was predicted with the Sei deep-learning model (>20,000 features across 40 sequence classes), and clonal architecture was inferred with PyClone-VI, enabling genome-wide, evolution aware annotation of proximal regulatory alterations. We then aggregated these variant-level functional predictions into gene-level and patient-level representations of regulatory dysregulation. Despite comparable mutation counts near COSMIC Tier1/2 and non-COSMIC genes, the atlas revealed substantially higher predicted regulatory impact in promoter-proximal mutations, uncovering a noncoding functional dimension not detectable from burden alone. Aggregating these predicted effects into a personalized somatic dysregulation representation across COSMIC Hallmark genes produced a patient-level promoter-proximal regulatory effect score. In a multivariable Cox model of progression-free interval stratified by cancer type and adjusted for log1p(FGA), HRD score, non-silent mutations per Mb, and age at diagnosis, this regulatory score was a strong and independent predictor of adverse progression (HR = 1.10, 95% CI 1.04-1.15, P < 0.005), whereas noncoding mutation count contributed no additional information. Partitioning regulatory burden by evolutionary timing revealed distinct processes: clonal promoter-proximal disruption consistently predicted worse pan-cancer outcomes and showed stronger within-cancer associations, whereas subclonal burden was prognostic only in specific cancers. For example, subclonal promoter-proximal disruption within Hallmark genes was significantly associated with progression in Uterine Corpus Endometrial Carcinoma (HR = 1.36, 95% CI 1.10-1.70, P = 0.01), possibly indicating late-arising regulatory programs contributing to tumor progression. Finally, deep-learning regulatory sequence classes mapped to tissue-specific vulnerabilities: lower-grade glioma showed disproportionate impact from the E3 brain/melanocyte enhancer class, whereas sarcoma was most strongly associated with PC4 polycomb/bivalent stem-cell-linked disruption. Together, these results show that functional promoter-proximal disruption, not mutational load, captures clinically relevant noncoding dysregulation, shaped by context-specific and evolution-linked regulatory programs. This atlas provides a scalable foundation for integrating noncoding regulatory functions into prognostic modeling and will be made available to the research community.
Citation Format:
Ksenia Sokolova, Vessela N. Kristensen, Olga G. Troyanskaya. Deep-learning profiling of regulatory somatic variants across clonal cancer evolution [abstract]. In: Proceedings of the American Association for Cancer Research Annual Meeting 2026; Part 1 (Regular Abstracts); 2026 Apr 17-22; San Diego, CA. Philadelphia (PA): AACR; Cancer Res 2026;86(7 Suppl):Abstract nr 1295.
American Association for Cancer Research (AACR)
Title: Abstract 1295: Deep-learning profiling of regulatory somatic variants across clonal cancer evolution.
Description:
Abstract
Somatic noncoding mutations are abundant in cancer genomes, yet their regulatory functional consequences remain poorly understood.
To address this gap, we generated a pan-cancer atlas of variant regulatory effects by profiling >35M SNVs within +/-20 kb of transcription start sites across more than 8,000 TCGA samples.
Chromatin impact was predicted with the Sei deep-learning model (>20,000 features across 40 sequence classes), and clonal architecture was inferred with PyClone-VI, enabling genome-wide, evolution aware annotation of proximal regulatory alterations.
We then aggregated these variant-level functional predictions into gene-level and patient-level representations of regulatory dysregulation.
Despite comparable mutation counts near COSMIC Tier1/2 and non-COSMIC genes, the atlas revealed substantially higher predicted regulatory impact in promoter-proximal mutations, uncovering a noncoding functional dimension not detectable from burden alone.
Aggregating these predicted effects into a personalized somatic dysregulation representation across COSMIC Hallmark genes produced a patient-level promoter-proximal regulatory effect score.
In a multivariable Cox model of progression-free interval stratified by cancer type and adjusted for log1p(FGA), HRD score, non-silent mutations per Mb, and age at diagnosis, this regulatory score was a strong and independent predictor of adverse progression (HR = 1.
10, 95% CI 1.
04-1.
15, P < 0.
005), whereas noncoding mutation count contributed no additional information.
Partitioning regulatory burden by evolutionary timing revealed distinct processes: clonal promoter-proximal disruption consistently predicted worse pan-cancer outcomes and showed stronger within-cancer associations, whereas subclonal burden was prognostic only in specific cancers.
For example, subclonal promoter-proximal disruption within Hallmark genes was significantly associated with progression in Uterine Corpus Endometrial Carcinoma (HR = 1.
36, 95% CI 1.
10-1.
70, P = 0.
01), possibly indicating late-arising regulatory programs contributing to tumor progression.
Finally, deep-learning regulatory sequence classes mapped to tissue-specific vulnerabilities: lower-grade glioma showed disproportionate impact from the E3 brain/melanocyte enhancer class, whereas sarcoma was most strongly associated with PC4 polycomb/bivalent stem-cell-linked disruption.
Together, these results show that functional promoter-proximal disruption, not mutational load, captures clinically relevant noncoding dysregulation, shaped by context-specific and evolution-linked regulatory programs.
This atlas provides a scalable foundation for integrating noncoding regulatory functions into prognostic modeling and will be made available to the research community.
Citation Format:
Ksenia Sokolova, Vessela N.
Kristensen, Olga G.
Troyanskaya.
Deep-learning profiling of regulatory somatic variants across clonal cancer evolution [abstract].
In: Proceedings of the American Association for Cancer Research Annual Meeting 2026; Part 1 (Regular Abstracts); 2026 Apr 17-22; San Diego, CA.
Philadelphia (PA): AACR; Cancer Res 2026;86(7 Suppl):Abstract nr 1295.
Related Results
Abstract LB497: Portraits of clonal landscape and mutational signature in primary tumors and matched lung metastatic model of osteosarcoma
Abstract LB497: Portraits of clonal landscape and mutational signature in primary tumors and matched lung metastatic model of osteosarcoma
Abstract
Objective:
The development of lung metastasis following primary tumor diagnosis, resection and chemotherapy rema...
Distinct mutational landscapes when comparing germline and somatic cancer variants in forty tumor suppressor genes
Distinct mutational landscapes when comparing germline and somatic cancer variants in forty tumor suppressor genes
Abstract
Germline and somatic cancer variants in tumor suppressor genes (TSGs) share loss of function mechanisms with studies of a few genes (
...
Marker selection strategies for circulating tumor DNA guided by phylogenetic inference
Marker selection strategies for circulating tumor DNA guided by phylogenetic inference
Abstract
Motivation
Blood-based profiling of tumor DNA (“liquid biopsy”) has offered great prospects for non-invasive early can...
Abstract 2491: Targeted next generation sequencing identifies somatic variants in Egyptian breast cancer patients
Abstract 2491: Targeted next generation sequencing identifies somatic variants in Egyptian breast cancer patients
Abstract
Breast cancer (BC) is the leading cause of cancer-related death in women worldwide and its incidence is progressively increasing in Egypt. Recent advances i...
Abstract OI-1: OI-1 Decoding breast cancer predisposition genes
Abstract OI-1: OI-1 Decoding breast cancer predisposition genes
Abstract
Women with one or more first-degree female relatives with a history of breast cancer have a two-fold increased risk of developing breast cancer. This risk i...
CREATING LEARNING MEDIA IN TEACHING ENGLISH AT SMP MUHAMMADIYAH 2 PAGELARAN ACADEMIC YEAR 2020/2021
CREATING LEARNING MEDIA IN TEACHING ENGLISH AT SMP MUHAMMADIYAH 2 PAGELARAN ACADEMIC YEAR 2020/2021
The pandemic Covid-19 currently demands teachers to be able to use technology in teaching and learning process. But in reality there are still many teachers who have not been able ...
Leveraging cancer mutation data to predict the pathogenicity of germline missense variants
Leveraging cancer mutation data to predict the pathogenicity of germline missense variants
ABSTRACTInnovative and easy-to-implement strategies are needed to improve the pathogenicity assessment of rare germline missense variants. Somatic cancer driver mutations identifie...
Abstract 1678: Accurate detection of expressed variation in RNA-seq
Abstract 1678: Accurate detection of expressed variation in RNA-seq
Abstract
Somatic variant detection has long been an area of great interest and critical importance in cancer research and treatment. DNA-Seq based somatic variant...

