Search engine for discovering works of Art, research articles, and books related to Art and Culture
ShareThis
Javascript must be enabled to continue!

Terminitor: Cleavage Site Prediction Using Deep Learning Models

View through CrossRef
Abstract As a widespread RNA processing machinery, alternative polyadenylation plays a crucial role in gene regulation. To help decipher its underlying mechanism and understand its impact, it is desirable to comprehensively profile 3’-untranslated region cleavage and associated polyadenylation sites. State-of-the-art polyadenylation site detection tools are known to be influenced by library preparation artefacts or manually selected features. Moreover, recently published machine learning methods have only been tested on pre-constructed datasets, thus lacking validation on experimental data. Here we present Terminitor, the first deep neural network-based profiling pipeline to make predictions from RNA-seq data. We show how Terminitor outperforms competing tools in sensitivity and precision on experimental transcriptome sequencing data, and demonstrate its use with data from short- and long-read sequencing technologies. For species without a good reference transcriptome annotation, Terminitor is still able to pass on the information learnt from a related species and make reasonable predictions. We used Terminitor to showcase how single nucleotide variations can create or destroy polyadenylated cleavage sites in human RNA-seq samples. Author Summary 3’ cleavage and polyadenylation of pre-mRNA is part of RNA maturation process. One gene can be cleaved at different positions at its 3’ end, namely alternatively polyadenylation, thus identifying the correct polyadenylated cleavage site (poly(A) CS) is essential to unveil its role in gene regulation under different physiological and pathological conditions. The current poly(A) CS prediction tools are either heavily influenced by RNA-Seq library preparation artefacts or have only been designed and tested on ad hoc datasets, lacking association with real world applications. In this study, we present a deep learning model, Terminitor, that predicts the probability of a nucleotide sequence containing a poly(A) CS, and validated its performance on human and mouse data. Along with the model, we propose a poly(A) CS profiling pipeline for RNA-seq data. We benchmarked our pipeline against competing tools and achieved higher sensitivity and precision in experimental data. The usage of Terminitor is not limited to genome and transcriptome annotation and we expect it to facilitate the identification of novel isoforms, improve the accuracy of transcript quantification and differential expression analysis, and contribute to the repertoire of reference transcriptome annotation.
Title: Terminitor: Cleavage Site Prediction Using Deep Learning Models
Description:
Abstract As a widespread RNA processing machinery, alternative polyadenylation plays a crucial role in gene regulation.
To help decipher its underlying mechanism and understand its impact, it is desirable to comprehensively profile 3’-untranslated region cleavage and associated polyadenylation sites.
State-of-the-art polyadenylation site detection tools are known to be influenced by library preparation artefacts or manually selected features.
Moreover, recently published machine learning methods have only been tested on pre-constructed datasets, thus lacking validation on experimental data.
Here we present Terminitor, the first deep neural network-based profiling pipeline to make predictions from RNA-seq data.
We show how Terminitor outperforms competing tools in sensitivity and precision on experimental transcriptome sequencing data, and demonstrate its use with data from short- and long-read sequencing technologies.
For species without a good reference transcriptome annotation, Terminitor is still able to pass on the information learnt from a related species and make reasonable predictions.
We used Terminitor to showcase how single nucleotide variations can create or destroy polyadenylated cleavage sites in human RNA-seq samples.
Author Summary 3’ cleavage and polyadenylation of pre-mRNA is part of RNA maturation process.
One gene can be cleaved at different positions at its 3’ end, namely alternatively polyadenylation, thus identifying the correct polyadenylated cleavage site (poly(A) CS) is essential to unveil its role in gene regulation under different physiological and pathological conditions.
The current poly(A) CS prediction tools are either heavily influenced by RNA-Seq library preparation artefacts or have only been designed and tested on ad hoc datasets, lacking association with real world applications.
In this study, we present a deep learning model, Terminitor, that predicts the probability of a nucleotide sequence containing a poly(A) CS, and validated its performance on human and mouse data.
Along with the model, we propose a poly(A) CS profiling pipeline for RNA-seq data.
We benchmarked our pipeline against competing tools and achieved higher sensitivity and precision in experimental data.
The usage of Terminitor is not limited to genome and transcriptome annotation and we expect it to facilitate the identification of novel isoforms, improve the accuracy of transcript quantification and differential expression analysis, and contribute to the repertoire of reference transcriptome annotation.

Related Results

Selection of Injectable Drug Product Composition using Machine Learning Models (Preprint)
Selection of Injectable Drug Product Composition using Machine Learning Models (Preprint)
BACKGROUND As of July 2020, a Web of Science search of “machine learning (ML)” nested within the search of “pharmacokinetics or pharmacodynamics” yielded over 100...
CREATING LEARNING MEDIA IN TEACHING ENGLISH AT SMP MUHAMMADIYAH 2 PAGELARAN ACADEMIC YEAR 2020/2021
CREATING LEARNING MEDIA IN TEACHING ENGLISH AT SMP MUHAMMADIYAH 2 PAGELARAN ACADEMIC YEAR 2020/2021
The pandemic Covid-19 currently demands teachers to be able to use technology in teaching and learning process. But in reality there are still many teachers who have not been able ...
Recent Advances in Photoinduced Oxidative Cleavage of Alkenes
Recent Advances in Photoinduced Oxidative Cleavage of Alkenes
AbstractOxidative cleavage of alkenes leading to valuable carbonyl derivatives is a fundamental transformation in synthetic chemistry. In particular, ozonolysis is the mainstream m...
The cell and stress-specific canonical and non-canonical tRNA cleavage
The cell and stress-specific canonical and non-canonical tRNA cleavage
Abstract Following stress, tRNA is cleaved to generate tRNA halves (tiRNAs). These stress-induced small RNAs have been shown to regulate translat...
How to find simple and accurate rules for viral protease cleavage specificities
How to find simple and accurate rules for viral protease cleavage specificities
BACKGROUND:Proteases of human pathogens are becoming increasingly important drug targets, hence it is necessary to understand their substrate specificity and to interpret this know...
Endothelial Protein C Receptor
Endothelial Protein C Receptor
IntroductionThe protein C anticoagulant pathway plays a critical role in the negative regulation of the blood clotting response. The pathway is triggered by thrombin, which allows ...
Uncoupling from transcription protects polyadenylation site cleavage from inhibition by DNA damage
Uncoupling from transcription protects polyadenylation site cleavage from inhibition by DNA damage
Abstract Pre-mRNA 3’-end processing by cleavage and polyadenylation (CPA) is a nuclear process in which RNA polymerase II (Pol II) transcripts ar...
Mechanism of cleavage and ligation by FLP recombinase: classification of mutations in FLP protein by in vitro complementation analysis.
Mechanism of cleavage and ligation by FLP recombinase: classification of mutations in FLP protein by in vitro complementation analysis.
The FLP recombinase of the 2 microns plasmid of Saccharomyces cerevisiae is a member of the integrase family of site-specific recombinases. Recombination catalyzed by members of th...

Back to Top