Javascript must be enabled to continue!
Cancer Classification from Healthy DNA
View through CrossRef
ABSTRACTThe genome is traditionally viewed as atime-independentsource of information; a paradigm that drives researchers to seek correlations between the presence of certain genes and a patient’s risk of disease. This analysis neglectsgenomic temporal changes, which we believe to be a crucial signal for predicting an individual’s susceptibility to cancer. We hypothesize that each individual’s genome passes through anevolution channel(The termchannelis motivated by the notion of communication channel introduced by Shannon1 in 1948 and started the area ofInformation Theory), that is controlled by hereditary, environmental and stochastic factors. This channel differs among individuals, giving rise to varying predispositions to developing cancer. We introduce the concept ofmutation profilesthat are computed without any comparative analysis, but by analyzing the short tandem repeat regions in asingle healthy genomeand capturing information about the individual’s evolution channel. Using machine learning on data from more than 5,000 TCGA cancer patients, we demonstrate that thesemutation profilescan accurately distinguish between patients with various types of cancer. For example, the pairwise validation accuracy of the classifier between PAAD (pancreas) patients and GBM (brain) patients is 93%. Our results show that healthy unaffected cells still contain a cancer-specific signal, which opens the possibility of cancer prediction from a healthy genome.
Title: Cancer Classification from Healthy DNA
Description:
ABSTRACTThe genome is traditionally viewed as atime-independentsource of information; a paradigm that drives researchers to seek correlations between the presence of certain genes and a patient’s risk of disease.
This analysis neglectsgenomic temporal changes, which we believe to be a crucial signal for predicting an individual’s susceptibility to cancer.
We hypothesize that each individual’s genome passes through anevolution channel(The termchannelis motivated by the notion of communication channel introduced by Shannon1 in 1948 and started the area ofInformation Theory), that is controlled by hereditary, environmental and stochastic factors.
This channel differs among individuals, giving rise to varying predispositions to developing cancer.
We introduce the concept ofmutation profilesthat are computed without any comparative analysis, but by analyzing the short tandem repeat regions in asingle healthy genomeand capturing information about the individual’s evolution channel.
Using machine learning on data from more than 5,000 TCGA cancer patients, we demonstrate that thesemutation profilescan accurately distinguish between patients with various types of cancer.
For example, the pairwise validation accuracy of the classifier between PAAD (pancreas) patients and GBM (brain) patients is 93%.
Our results show that healthy unaffected cells still contain a cancer-specific signal, which opens the possibility of cancer prediction from a healthy genome.
Related Results
Genome wide hypomethylation and youth-associated DNA gap reduction promoting DNA damage and senescence-associated pathogenesis
Genome wide hypomethylation and youth-associated DNA gap reduction promoting DNA damage and senescence-associated pathogenesis
Abstract
Background: Age-associated epigenetic alteration is the underlying cause of DNA damage in aging cells. Two types of youth-associated DNA-protection epigenetic mark...
Echinococcus granulosus in Environmental Samples: A Cross-Sectional Molecular Study
Echinococcus granulosus in Environmental Samples: A Cross-Sectional Molecular Study
Abstract
Introduction
Echinococcosis, caused by tapeworms of the Echinococcus genus, remains a significant zoonotic disease globally. The disease is particularly prevalent in areas...
Are Cervical Ribs Indicators of Childhood Cancer? A Narrative Review
Are Cervical Ribs Indicators of Childhood Cancer? A Narrative Review
Abstract
A cervical rib (CR), also known as a supernumerary or extra rib, is an additional rib that forms above the first rib, resulting from the overgrowth of the transverse proce...
Diagnostic Rate of the Cancer by BDORT Utilizing the Cancer Slide
Diagnostic Rate of the Cancer by BDORT Utilizing the Cancer Slide
Purpose:
To make a diagnosis of cancer with BDORT (resonance test), we can choose two methods. One is to use a chemical agent like Integrin α5β1 or Oncogene C-f...
Edoxaban and Cancer-Associated Venous Thromboembolism: A Meta-analysis of Clinical Trials
Edoxaban and Cancer-Associated Venous Thromboembolism: A Meta-analysis of Clinical Trials
Abstract
Introduction
Cancer patients face a venous thromboembolism (VTE) risk that is up to 50 times higher compared to individuals without cancer. In 2010, direct oral anticoagul...
Abstract 1490: RAD51C-deficient cancer cells require DNA polymerase zeta to bypass cisplatin-induced lesion
Abstract 1490: RAD51C-deficient cancer cells require DNA polymerase zeta to bypass cisplatin-induced lesion
RAD51C is a RAD51 paralog protein that mediates RAD51 filament formation on single-stranded DNA (ssDNA) in a canonical homologous recombination (HR) pathway. This step is vital for...
Abstract OI-1: OI-1 Decoding breast cancer predisposition genes
Abstract OI-1: OI-1 Decoding breast cancer predisposition genes
Abstract
Women with one or more first-degree female relatives with a history of breast cancer have a two-fold increased risk of developing breast cancer. This risk i...
Abstract 1624: Antigen-independent de novo prediction of cancer-associated TCR repertoire
Abstract 1624: Antigen-independent de novo prediction of cancer-associated TCR repertoire
Abstract
Cancer-associated T cells play a critical role in mediating immune responses in the anti-tumor immunity. However, due to the complex nature of cancer antige...

