Search engine for discovering works of Art, research articles, and books related to Art and Culture
ShareThis
Javascript must be enabled to continue!

Homeolog expression quantification methods for allopolyploids

View through CrossRef
AbstractGenome duplication with hybridization, or allopolyploidization, occurs in animals, fungi, and plants, and is especially common in crop plants. There is increasing interest in the study of allopolyploids due to advances in polyploid genome assembly, however the high level of sequence similarity in duplicated gene copies (homeologs) pose many challenges. Here we compared standard RNA-seq expression quantification approaches used currently for diploid species against subgenome-classification approaches which maps reads to each subgenome separately. We examined mapping error using our previous and new RNA-seq data in which a subgenome is experimentally added (synthetic allotetraploid Arabidopsis kamchatica) or reduced (allohexaploid wheat Triticum aestivum versus extracted allotetraploid) as ground truth. The error rates in the two species were very similar. The standard approaches showed higher error rates (> 10% using pseudo-alignment with Kallisto) while subgenome-classification approaches showed much lower error rates (< 1% using EAGLE-RC, < 2% using HomeoRoq). Although downstream analysis may partly mitigate mapping errors, the difference in methods was substantial in hexaploid wheat, where Kallisto appeared to have systematic differences relative to other methods. Only approximately half of the differentially expressed homeologs detected using Kallisto overlapped with those by any other method. In general, disagreement in low expression genes was responsible for most of the discordance between methods, which is consistent with known biases in Kallisto. We also observed that there exist uncertainties in genome sequences and annotation which can affect each method differently. Overall, subgenome-classification approaches tend to perform better than standard approaches with EAGLE-RC having the highest precision.
Title: Homeolog expression quantification methods for allopolyploids
Description:
AbstractGenome duplication with hybridization, or allopolyploidization, occurs in animals, fungi, and plants, and is especially common in crop plants.
There is increasing interest in the study of allopolyploids due to advances in polyploid genome assembly, however the high level of sequence similarity in duplicated gene copies (homeologs) pose many challenges.
Here we compared standard RNA-seq expression quantification approaches used currently for diploid species against subgenome-classification approaches which maps reads to each subgenome separately.
We examined mapping error using our previous and new RNA-seq data in which a subgenome is experimentally added (synthetic allotetraploid Arabidopsis kamchatica) or reduced (allohexaploid wheat Triticum aestivum versus extracted allotetraploid) as ground truth.
The error rates in the two species were very similar.
The standard approaches showed higher error rates (> 10% using pseudo-alignment with Kallisto) while subgenome-classification approaches showed much lower error rates (< 1% using EAGLE-RC, < 2% using HomeoRoq).
Although downstream analysis may partly mitigate mapping errors, the difference in methods was substantial in hexaploid wheat, where Kallisto appeared to have systematic differences relative to other methods.
Only approximately half of the differentially expressed homeologs detected using Kallisto overlapped with those by any other method.
In general, disagreement in low expression genes was responsible for most of the discordance between methods, which is consistent with known biases in Kallisto.
We also observed that there exist uncertainties in genome sequences and annotation which can affect each method differently.
Overall, subgenome-classification approaches tend to perform better than standard approaches with EAGLE-RC having the highest precision.

Related Results

Prognostic Implications Of PRAME Expression Levels In Myelodysplastic Syndromes
Prognostic Implications Of PRAME Expression Levels In Myelodysplastic Syndromes
Abstract The preferentially expressed antigen of the melanoma (PRAME) gene was first identified in melanoma tissue as a tumour-associated antigen recognized by autol...
Gene-tree reconciliation with MUL-trees to resolve polyploidy events
Gene-tree reconciliation with MUL-trees to resolve polyploidy events
Abstract Polyploidy can have a huge impact on the evolution of species, and it is a common occurrence, especially in plants. The two types of polyploids - autopolyp...
Absolute quantification in brain SPECT imaging
Absolute quantification in brain SPECT imaging
Certes malalties neurològiques estan associades amb problemes en els sistemes de neurotransmissió. Una aproximació a l'estudi d'aquests sistemes és la tomografia d'emissió SPECT (S...
Target site bioanalysis and pharmacokinetics of antileishmanial drugs
Target site bioanalysis and pharmacokinetics of antileishmanial drugs
This thesis focuses on bioanalytical method development and validation of antileishmanial drugs amphotericin B, miltefosine, and paromomycin in human plasma and human skin tissue f...
Abstract P1-05-23: Utilities and challenges of RNA-Seq based expression and variant calling in a clinical setting
Abstract P1-05-23: Utilities and challenges of RNA-Seq based expression and variant calling in a clinical setting
Abstract Introduction Variant calling based on DNA samples has been the gold standard of clinical testing since the advent of Sanger sequencing. The u...
Quantification des sous-algèbres de Lie coisotropes
Quantification des sous-algèbres de Lie coisotropes
L’objet de cette thèse est l’étude de l’existence d’une quantification pour les sous-algèbres de Lie coisotropes des bigèbres de Lie. Une sous-algèbre de Lie coisotrope d’une bigèb...

Back to Top