Javascript must be enabled to continue!
Abstract 1678: Accurate detection of expressed variation in RNA-seq
View through CrossRef
Abstract
Somatic variant detection has long been an area of great interest and critical importance in cancer research and treatment. DNA-Seq based somatic variant calling pipelines have been maturing over the last several years and although there is still room for improvement, several methods have been developed capable of accurately detecting somatic variants from matched tumor/normal DNA. RNA-Seq has become a widely used tool for measuring gene and isoform abundance, identifying alternative splicing, and detection of gene fusions. Detection of expressed variation is receiving growing interest, however computational methods to detect this variation have received far less attention than that of DNA-Seq. We have developed an optimized RNA-Seq pipeline based upon the ABRA2 realigner capable of accurately detecting expressed somatic variation in RNA-Seq.
We applied this pipeline to the TCGA Breast Cancer dataset. To assess the impact of identification of expressed variation as an indicator of variant significance, we compared the expression state of the top 20 cohort wide significantly mutated genes (SMG) as identified by The Cancer Genome Atlas Network (2012) with all other genes. At the median for SMGs, 23 somatic variants are expressed with over 90% of variants detected in DNA being expressed. For non-SMGs, the median number of expressed variants is 1 and the median fraction of expressed variants is 50%. Among non-SMGs, a total of six genes exceed the median total number of expressed variants and fraction of expressed variants for SMGs (AHNAK, CHD4, ERBB2, FASN, FOXA1 and MYH9). All of these variants have been previously implicated in breast cancer, however none of these variants were among the cohort wide SMGs identified in the original study and only FOXA1 was identified as being a subtype specific SMG (ER+,ER+/HER2-). Notably, variants in the massive TTN gene are found to be coding 73% of the time, but expressed in only 8% of cases. This is substantially less than the 90% found in SMGs and a likely indicator that TTN mutations are passenger variants. The median and third quartile RNA variant allele frequency (VAF) for SMGs is .41 and .69 versus .32 and .46 for DNA. By comparison, the median and third quartile VAF for non-SMGS was 0 and .27 in RNA and .20 and .30 for DNA. The increased VAF for RNA SMGs is a likely indicator of selection and can potentially further be used to identify variants of significance. We believe these results demonstrate the utility of variant expression as a potentail tool to aid in assessment of variant significance.
Citation Format: Lisle E. Mose, David Marron, Joel S. Parker. Accurate detection of expressed variation in RNA-seq [abstract]. In: Proceedings of the American Association for Cancer Research Annual Meeting 2019; 2019 Mar 29-Apr 3; Atlanta, GA. Philadelphia (PA): AACR; Cancer Res 2019;79(13 Suppl):Abstract nr 1678.
American Association for Cancer Research (AACR)
Title: Abstract 1678: Accurate detection of expressed variation in RNA-seq
Description:
Abstract
Somatic variant detection has long been an area of great interest and critical importance in cancer research and treatment.
DNA-Seq based somatic variant calling pipelines have been maturing over the last several years and although there is still room for improvement, several methods have been developed capable of accurately detecting somatic variants from matched tumor/normal DNA.
RNA-Seq has become a widely used tool for measuring gene and isoform abundance, identifying alternative splicing, and detection of gene fusions.
Detection of expressed variation is receiving growing interest, however computational methods to detect this variation have received far less attention than that of DNA-Seq.
We have developed an optimized RNA-Seq pipeline based upon the ABRA2 realigner capable of accurately detecting expressed somatic variation in RNA-Seq.
We applied this pipeline to the TCGA Breast Cancer dataset.
To assess the impact of identification of expressed variation as an indicator of variant significance, we compared the expression state of the top 20 cohort wide significantly mutated genes (SMG) as identified by The Cancer Genome Atlas Network (2012) with all other genes.
At the median for SMGs, 23 somatic variants are expressed with over 90% of variants detected in DNA being expressed.
For non-SMGs, the median number of expressed variants is 1 and the median fraction of expressed variants is 50%.
Among non-SMGs, a total of six genes exceed the median total number of expressed variants and fraction of expressed variants for SMGs (AHNAK, CHD4, ERBB2, FASN, FOXA1 and MYH9).
All of these variants have been previously implicated in breast cancer, however none of these variants were among the cohort wide SMGs identified in the original study and only FOXA1 was identified as being a subtype specific SMG (ER+,ER+/HER2-).
Notably, variants in the massive TTN gene are found to be coding 73% of the time, but expressed in only 8% of cases.
This is substantially less than the 90% found in SMGs and a likely indicator that TTN mutations are passenger variants.
The median and third quartile RNA variant allele frequency (VAF) for SMGs is .
41 and .
69 versus .
32 and .
46 for DNA.
By comparison, the median and third quartile VAF for non-SMGS was 0 and .
27 in RNA and .
20 and .
30 for DNA.
The increased VAF for RNA SMGs is a likely indicator of selection and can potentially further be used to identify variants of significance.
We believe these results demonstrate the utility of variant expression as a potentail tool to aid in assessment of variant significance.
Citation Format: Lisle E.
Mose, David Marron, Joel S.
Parker.
Accurate detection of expressed variation in RNA-seq [abstract].
In: Proceedings of the American Association for Cancer Research Annual Meeting 2019; 2019 Mar 29-Apr 3; Atlanta, GA.
Philadelphia (PA): AACR; Cancer Res 2019;79(13 Suppl):Abstract nr 1678.
Related Results
Abstract P1-05-23: Utilities and challenges of RNA-Seq based expression and variant calling in a clinical setting
Abstract P1-05-23: Utilities and challenges of RNA-Seq based expression and variant calling in a clinical setting
Abstract
Introduction
Variant calling based on DNA samples has been the gold standard of clinical testing since the advent of Sanger sequencing. The u...
Detection of Multiple Types of Cancer Driver Mutations Using Targeted RNA Sequencing in NSCLC
Detection of Multiple Types of Cancer Driver Mutations Using Targeted RNA Sequencing in NSCLC
ABSTRACTCurrently, DNA and RNA are used separately to capture different types of gene mutations. DNA is commonly used for the detection of SNVs, indels and CNVs; RNA is used for an...
Detecting RNA–RNA interactome
Detecting RNA–RNA interactome
AbstractThe last decade has seen a robust increase in various types of novel RNA molecules and their complexity in gene regulation. RNA molecules play a critical role in cellular e...
Abstract 2323: Deciphering RNA degradation: Insights from a comparative analysis of paired fresh frozen/FFPE total RNA-seq
Abstract 2323: Deciphering RNA degradation: Insights from a comparative analysis of paired fresh frozen/FFPE total RNA-seq
Abstract
Background: Fresh frozen (FF) and formalin-fixed paraffin-embedded (FFPE) samples are primary resources for archival tissues in cancer studies. Despite the ...
Diagnostic Validation of a Clinical Laboratory-Oriented Targeted RNA Sequencing System As a Comprehensive Assay for Hematologic Malignancies
Diagnostic Validation of a Clinical Laboratory-Oriented Targeted RNA Sequencing System As a Comprehensive Assay for Hematologic Malignancies
Introduction Targeted RNA sequencing (RNA-seq) is a highly accurate method for sequencing transcripts of interest and can overcome limitations regarding resolution, throughput, and...
Generating Synthetic Single Cell Data from Bulk RNA-seq Using a Pretrained Variational Autoencoder
Generating Synthetic Single Cell Data from Bulk RNA-seq Using a Pretrained Variational Autoencoder
AbstractSingle cell RNA sequencing (scRNA-seq) is a powerful approach which generates genome-wide gene expression profiles at single cell resolution. Among its many applications, i...
Abstract 4875: HIVE Proteomics: Integrated, cloud-based RNA-Seq and proteomics analysis of prostate adenocarcinoma samples
Abstract 4875: HIVE Proteomics: Integrated, cloud-based RNA-Seq and proteomics analysis of prostate adenocarcinoma samples
Abstract
Automated bottom-up proteomics workflows implemented with modern mass-spectrometry instrumentation can readily generate millions of peptide fragmentation sp...
Benchmarking Algorithms for Gene Set Scoring of Single-cell ATAC-seq Data
Benchmarking Algorithms for Gene Set Scoring of Single-cell ATAC-seq Data
AbstractGene set scoring (GSS) has been routinely conducted for gene expression analysis of bulk or single-cell RNA-seq data, which helps to decipher single-cell heterogeneity and ...

