Search engine for discovering works of Art, research articles, and books related to Art and Culture
ShareThis
Javascript must be enabled to continue!

Importance of transcript variants in transcriptome analyses

View through CrossRef
Abstract RNA sequencing (RNA-Seq) has become a widely adopted genome-wide technique for investigating gene expression patterns. However, conventional RNA-Seq analyses typically rely on gene expression (GE) values that aggregate all the transcripts produced by a gene under a single identifier, overlooking the complexity of transcript variants arising from different transcription start sites and alternative splicing events. In this study, we explored the implications of neglecting transcript variants in RNA-Seq analyses. Among the 1334 transcription factor (TF) genes expressed in mouse embryonic stem (ES) or trophoblast stem (TS) cells, 652 were reported to be differentially expressed in TS cells based on GE values (365 upregulated and 287 downregulated, ≥2-fold, FDR p -value ≤0.05). Intriguingly, differential gene expression analysis revealed that of the 365 upregulated genes, 883 transcript variants were expressed, with only 174 (<20%) variants exhibiting upregulation based on transcript expression (TE) values. The remaining 709 (>80%) variants were either down-regulated or showed no significant change in expression analysis. Similarly, the 287 genes reported to be downregulated expressed 856 transcript variants, with only 153 (<20%) downregulated variants and 703 (>82%) variants that were upregulated or showed no significant changes. Additionally, the 682 TF genes that did not show significant changes between ES and TS cells (GE values < 2-fold changes and/or FDR p-values >0.05) expressed 2215 transcript variants, which included 477 (>21%) that were differentially expressed (276 upregulated and 201 downregulated, ≥2-fold, FDR p-value ≤0.05). Notably, a particular gene does not express just one protein; rather its transcript variants encode multiple proteins with distinct functional domains, including non-coding regulatory RNAs. Our findings underscore the critical necessity of considering transcript variants in RNA-Seq analyses. Doing so may enable a more precise understanding of the intricate functional and regulatory landscape of genes; ignoring the variants may result in an erroneous interpretation. Graphic Abstract Differential expression of transcription factors (TFs) between mouse embryonic stem (ES) cells and trophoblast stem (TS) cells. This graphic presentation clearly demonstrates the importance of including transcript variants during RNA sequencing (RNA-Seq) analyses. Panel A represents the conventional differential gene expression analysis approach after RNA-Seq, where all transcript reads are taken under a single gene name. Panel B takes differential gene expression analysis one step further by examining all the transcript variants that were previously hidden under the main gene name. Our results indicate that exclusive gene expression (GE) analysis inaccurately defines over 80% of the transcript expression (TE). Without analyses of all the transcript variants’ reads, we fail to uncover the functional importance of the variants and the regulation of their expression. Both GE and TE values are expressed as transcript per million (TPM). Data analyses were performed by using CLC Genomics Workbench.
Title: Importance of transcript variants in transcriptome analyses
Description:
Abstract RNA sequencing (RNA-Seq) has become a widely adopted genome-wide technique for investigating gene expression patterns.
However, conventional RNA-Seq analyses typically rely on gene expression (GE) values that aggregate all the transcripts produced by a gene under a single identifier, overlooking the complexity of transcript variants arising from different transcription start sites and alternative splicing events.
In this study, we explored the implications of neglecting transcript variants in RNA-Seq analyses.
Among the 1334 transcription factor (TF) genes expressed in mouse embryonic stem (ES) or trophoblast stem (TS) cells, 652 were reported to be differentially expressed in TS cells based on GE values (365 upregulated and 287 downregulated, ≥2-fold, FDR p -value ≤0.
05).
Intriguingly, differential gene expression analysis revealed that of the 365 upregulated genes, 883 transcript variants were expressed, with only 174 (<20%) variants exhibiting upregulation based on transcript expression (TE) values.
The remaining 709 (>80%) variants were either down-regulated or showed no significant change in expression analysis.
Similarly, the 287 genes reported to be downregulated expressed 856 transcript variants, with only 153 (<20%) downregulated variants and 703 (>82%) variants that were upregulated or showed no significant changes.
Additionally, the 682 TF genes that did not show significant changes between ES and TS cells (GE values < 2-fold changes and/or FDR p-values >0.
05) expressed 2215 transcript variants, which included 477 (>21%) that were differentially expressed (276 upregulated and 201 downregulated, ≥2-fold, FDR p-value ≤0.
05).
Notably, a particular gene does not express just one protein; rather its transcript variants encode multiple proteins with distinct functional domains, including non-coding regulatory RNAs.
Our findings underscore the critical necessity of considering transcript variants in RNA-Seq analyses.
Doing so may enable a more precise understanding of the intricate functional and regulatory landscape of genes; ignoring the variants may result in an erroneous interpretation.
Graphic Abstract Differential expression of transcription factors (TFs) between mouse embryonic stem (ES) cells and trophoblast stem (TS) cells.
This graphic presentation clearly demonstrates the importance of including transcript variants during RNA sequencing (RNA-Seq) analyses.
Panel A represents the conventional differential gene expression analysis approach after RNA-Seq, where all transcript reads are taken under a single gene name.
Panel B takes differential gene expression analysis one step further by examining all the transcript variants that were previously hidden under the main gene name.
Our results indicate that exclusive gene expression (GE) analysis inaccurately defines over 80% of the transcript expression (TE).
Without analyses of all the transcript variants’ reads, we fail to uncover the functional importance of the variants and the regulation of their expression.
Both GE and TE values are expressed as transcript per million (TPM).
Data analyses were performed by using CLC Genomics Workbench.

Related Results

Importance of Transcript Variants in Transcriptome Analyses
Importance of Transcript Variants in Transcriptome Analyses
RNA sequencing (RNA-Seq) has become a widely adopted technique for studying gene expression. However, conventional RNA-Seq analyses rely on gene expression (GE) values that aggrega...
Clinical Implications of Germline Predisposition Gene Variants in Patients with Refractory or Relapsed B Acute Lymphoblastic Leukemia
Clinical Implications of Germline Predisposition Gene Variants in Patients with Refractory or Relapsed B Acute Lymphoblastic Leukemia
Objectives:Gene variants are important factors in prognosis of the patients with hematological malignancies. In current study, our team investigate the relationship between blood a...
Abstract 1490: Molecular function of the read-through transcript PRR5-ARHGAP8
Abstract 1490: Molecular function of the read-through transcript PRR5-ARHGAP8
Abstract Background: Ovarian carcinoma is one of the most fatal malignancies in females. From the analysis of RNA-seq we have previously conducted to study the trans...
The utility of transcriptomics in the conservation of sensitive and economically important species
The utility of transcriptomics in the conservation of sensitive and economically important species
The connection between the central dogma of biology [DNA --(Transcription)---› RNA –(Translation)--› Protein] and the 'omics' resources obtained from each molecule are now being ex...
Current Updates on Variants of SARS‐CoV‐ 2: Systematic Review
Current Updates on Variants of SARS‐CoV‐ 2: Systematic Review
ABSTRACTBackgroundCoronavirus disease 2019 is caused by the severe acute respiratory syndrome coronavirus 2, which has become a pandemic. Severe acute respiratory syndrome coronavi...
Transcriptome changes and cAMP oscillations in an archaeal cell cycle
Transcriptome changes and cAMP oscillations in an archaeal cell cycle
Abstract Background The cell cycle of all organisms includes mass increase by a factor of two, replication of the genetic material, segregation o...

Back to Top