Search engine for discovering works of Art, research articles, and books related to Art and Culture
ShareThis
Javascript must be enabled to continue!

EASTR: Correcting systematic alignment errors in multi-exon genes

View through CrossRef
Abstract Accurate alignment of transcribed RNA to reference genomes is a critical step in the analysis of gene expression, which in turn has broad applications in biomedical research and in the basic sciences. We have discovered that widely used splice-aware aligners, such as STAR and HISAT2, can introduce erroneous spliced alignments between repeated sequences, leading to the inclusion of falsely spliced transcripts in RNA-seq experiments. In some cases, the “phantom” introns resulting from these errors have made their way into widely-used genome annotation databases. To address this issue, we have developed EASTR (Emending Alignments of Spliced Transcript Reads), a novel software tool that can detect and remove falsely spliced alignments or transcripts from alignment and annotation files. EASTR improves the accuracy of spliced alignments across diverse species, including human, maize, and Arabidopsis thaliana , by detecting sequence similarity between intron-flanking regions. We demonstrate that applying EASTR before transcript assembly substantially reduces false positive introns, exons, and transcripts, improving the overall accuracy of assembled transcripts. Additionally, we show that EASTR’s application to reference annotation databases can detect and correct likely cases of mis-annotated transcripts.
Title: EASTR: Correcting systematic alignment errors in multi-exon genes
Description:
Abstract Accurate alignment of transcribed RNA to reference genomes is a critical step in the analysis of gene expression, which in turn has broad applications in biomedical research and in the basic sciences.
We have discovered that widely used splice-aware aligners, such as STAR and HISAT2, can introduce erroneous spliced alignments between repeated sequences, leading to the inclusion of falsely spliced transcripts in RNA-seq experiments.
In some cases, the “phantom” introns resulting from these errors have made their way into widely-used genome annotation databases.
To address this issue, we have developed EASTR (Emending Alignments of Spliced Transcript Reads), a novel software tool that can detect and remove falsely spliced alignments or transcripts from alignment and annotation files.
EASTR improves the accuracy of spliced alignments across diverse species, including human, maize, and Arabidopsis thaliana , by detecting sequence similarity between intron-flanking regions.
We demonstrate that applying EASTR before transcript assembly substantially reduces false positive introns, exons, and transcripts, improving the overall accuracy of assembled transcripts.
Additionally, we show that EASTR’s application to reference annotation databases can detect and correct likely cases of mis-annotated transcripts.

Related Results

NICU Medication Errors: Describing the Cause and Nature of Medication Errors in a NICU in Qatar
NICU Medication Errors: Describing the Cause and Nature of Medication Errors in a NICU in Qatar
IntroductionA medication error can be defined as “any error occurring in the medication use process” and focuses on problems with the delivery of medication to a patient [1]. Medic...
Spectrum of Tp53 gene mutation in basal cell carcinoma and its morphological subtypes in people of color.
Spectrum of Tp53 gene mutation in basal cell carcinoma and its morphological subtypes in people of color.
Objectives: Basal cell carcinoma (BCC) is the most common cutaneous malignancy in white population. The pattern of exon specific p53 mutations in BCC and its subtypes remain undete...
Splicing of designer exons informs a biophysical model for exon definition
Splicing of designer exons informs a biophysical model for exon definition
Pre-mRNA molecules in humans contain mostly short internal exons flanked by longer introns. To explain the removal of such introns, exon recognition instead of intron recognition h...
Abstract 1626: Sensitive detection of MET exon 14 skipping by RT-qPCR and next generation sequencing
Abstract 1626: Sensitive detection of MET exon 14 skipping by RT-qPCR and next generation sequencing
Abstract The MET receptor tyrosine kinase (MET) gene is a proto-oncogene whose abnormal activation can trigger tumor growth, angiogenesis, and metastasis. Numerous m...
Polymorphism Study of POU1F1 Gene in Mandya and NARI-Suwarna Sheep
Polymorphism Study of POU1F1 Gene in Mandya and NARI-Suwarna Sheep
Background: The mutations in POU1F1 gene result in deficiency of GH, PRL and TSH-â expression and thus have an indirect effect on development and reproduction. The present study wa...
Novel exon 11 skipping mutation in a patient with glycogen storage disease type IIId
Novel exon 11 skipping mutation in a patient with glycogen storage disease type IIId
AbstractWe report the molecular genetic abnormalities of a patient with GSD IIId presenting with progressive myopathy and cardiopathy leading to a fatal outcome. We identified two ...
Exon Skipping as a Therapeutic for Neurofibromatosis Type I
Exon Skipping as a Therapeutic for Neurofibromatosis Type I
Abstract We investigated the feasibility of utilizing an exon skipping approach as a genotype-dependent therapeutic for neurofibromatosis type 1 (NF1) by determining which ...

Back to Top