Javascript must be enabled to continue!
Importance of Transcript Variants in Transcriptome Analyses
View through CrossRef
RNA sequencing (RNA-Seq) has become a widely adopted technique for studying gene expression. However, conventional RNA-Seq analyses rely on gene expression (GE) values that aggregate all the transcripts produced under a single gene identifier, overlooking the complexity of transcript variants arising from different transcription start sites or alternative splicing. Transcript variants may encode proteins with diverse functional domains, or noncoding RNAs. This study explored the implications of neglecting transcript variants in RNA-Seq analyses. Among the 1334 transcription factor (TF) genes expressed in mouse embryonic stem (ES) or trophoblast stem (TS) cells, 652 were differentially expressed in TS cells based on GE values (365 upregulated and 287 downregulated, ≥absolute 2-fold changes, false discovery rate (FDR) p-value ≤ 0.05). The 365 upregulated genes expressed 883 transcript variants. Further transcript expression (TE) based analyses identified only 174 (<20%) of the 883 transcripts to be upregulated. The remaining 709 transcripts were either downregulated or showed no significant changes. Meanwhile, the 287 downregulated genes expressed 856 transcript variants and only 153 (<20%) of the 856 transcripts were downregulated. The other 703 transcripts were either upregulated or showed no significant change. Additionally, the 682 insignificant TF genes (GE values < absolute 2-fold changes and/or FDR p-values > 0.05) between ES and TS cells expressed 2215 transcript variants. These included 477 (>21%) differentially expressed transcripts (276 upregulated and 201 downregulated, ≥absolute 2-fold changes, FDR p-value ≤ 0.05). Hence, GE based RNA-Seq analyses do not represent accurate expression levels due to divergent transcripts expression from the same gene. Our findings show that by including transcript variants in RNA-Seq analyses, we can generate a precise understanding of a gene’s functional and regulatory landscape; ignoring the variants may result in an erroneous interpretation.
Title: Importance of Transcript Variants in Transcriptome Analyses
Description:
RNA sequencing (RNA-Seq) has become a widely adopted technique for studying gene expression.
However, conventional RNA-Seq analyses rely on gene expression (GE) values that aggregate all the transcripts produced under a single gene identifier, overlooking the complexity of transcript variants arising from different transcription start sites or alternative splicing.
Transcript variants may encode proteins with diverse functional domains, or noncoding RNAs.
This study explored the implications of neglecting transcript variants in RNA-Seq analyses.
Among the 1334 transcription factor (TF) genes expressed in mouse embryonic stem (ES) or trophoblast stem (TS) cells, 652 were differentially expressed in TS cells based on GE values (365 upregulated and 287 downregulated, ≥absolute 2-fold changes, false discovery rate (FDR) p-value ≤ 0.
05).
The 365 upregulated genes expressed 883 transcript variants.
Further transcript expression (TE) based analyses identified only 174 (<20%) of the 883 transcripts to be upregulated.
The remaining 709 transcripts were either downregulated or showed no significant changes.
Meanwhile, the 287 downregulated genes expressed 856 transcript variants and only 153 (<20%) of the 856 transcripts were downregulated.
The other 703 transcripts were either upregulated or showed no significant change.
Additionally, the 682 insignificant TF genes (GE values < absolute 2-fold changes and/or FDR p-values > 0.
05) between ES and TS cells expressed 2215 transcript variants.
These included 477 (>21%) differentially expressed transcripts (276 upregulated and 201 downregulated, ≥absolute 2-fold changes, FDR p-value ≤ 0.
05).
Hence, GE based RNA-Seq analyses do not represent accurate expression levels due to divergent transcripts expression from the same gene.
Our findings show that by including transcript variants in RNA-Seq analyses, we can generate a precise understanding of a gene’s functional and regulatory landscape; ignoring the variants may result in an erroneous interpretation.
Related Results
Importance of transcript variants in transcriptome analyses
Importance of transcript variants in transcriptome analyses
Abstract
RNA sequencing (RNA-Seq) has become a widely adopted genome-wide technique for investigating gene expression patterns. However, conventi...
Clinical Implications of Germline Predisposition Gene Variants in Patients with Refractory or Relapsed B Acute Lymphoblastic Leukemia
Clinical Implications of Germline Predisposition Gene Variants in Patients with Refractory or Relapsed B Acute Lymphoblastic Leukemia
Objectives:Gene variants are important factors in prognosis of the patients with hematological malignancies. In current study, our team investigate the relationship between blood a...
Marfan syndrome: genetic variant determinants of cardiovascular outcomes
Marfan syndrome: genetic variant determinants of cardiovascular outcomes
Abstract
Background
Marfan syndrome is a systemic connective tissue disorder caused by genetic variants in the fibrillin-1 (FBN1...
Abstract 1490: Molecular function of the read-through transcript PRR5-ARHGAP8
Abstract 1490: Molecular function of the read-through transcript PRR5-ARHGAP8
Abstract
Background: Ovarian carcinoma is one of the most fatal malignancies in females. From the analysis of RNA-seq we have previously conducted to study the trans...
The utility of transcriptomics in the conservation of sensitive and economically important species
The utility of transcriptomics in the conservation of sensitive and economically important species
The connection between the central dogma of biology [DNA --(Transcription)---› RNA –(Translation)--› Protein] and the 'omics' resources obtained from each molecule are now being ex...
High Rate of BCR-ABL Transcript Undetectability Achieved by Treating with Imatinib Mesylate Very Late CML Patients in Stable CCR after IFN.
High Rate of BCR-ABL Transcript Undetectability Achieved by Treating with Imatinib Mesylate Very Late CML Patients in Stable CCR after IFN.
Abstract
Background. Interferon alfa (IFN a) induces complete cytogenetic response (CCR) in small proportion of CML patients, with almost all of these patients still...
Abstract 4891: Genome-first approach identifies shelterin complex gene variants and associated cancers
Abstract 4891: Genome-first approach identifies shelterin complex gene variants and associated cancers
Abstract
Telomeres, nucleotide repeats and a six-protein complex called shelterin at chromosome ends, are critical for genomic integrity. Rare heterozygous pathogeni...
Effect of
de novo
transcriptome assembly on transcript quantification
Effect of
de novo
transcriptome assembly on transcript quantification
Abstract
Background
Correct quantification of transcript expression is essential to understand the functio...

