Javascript must be enabled to continue!
Galaxy CLIP-Explorer: a web server for CLIP-Seq data analysis
View through CrossRef
Abstract
Background
Post-transcriptional regulation via RNA-binding proteins plays a fundamental role in every organism, but the regulatory mechanisms lack important understanding. Nevertheless, they can be elucidated by cross-linking immunoprecipitation in combination with high-throughput sequencing (CLIP-Seq). CLIP-Seq answers questions about the functional role of an RNA-binding protein and its targets by determining binding sites on a nucleotide level and associated sequence and structural binding patterns. In recent years the amount of CLIP-Seq data skyrocketed, urging the need for an automatic data analysis that can deal with different experimental set-ups. However, noncanonical data, new protocols, and a huge variety of tools, especially for peak calling, made it difficult to define a standard.
Findings
CLIP-Explorer is a flexible and reproducible data analysis pipeline for iCLIP data that supports for the first time eCLIP, FLASH, and uvCLAP data. Individual steps like peak calling can be changed to adapt to different experimental settings. We validate CLIP-Explorer on eCLIP data, finding similar or nearly identical motifs for various proteins in comparison with other databases. In addition, we detect new sequence motifs for PTBP1 and U2AF2. Finally, we optimize the peak calling with 3 different peak callers on RBFOX2 data, discuss the difficulty of the peak-calling step, and give advice for different experimental set-ups.
Conclusion
CLIP-Explorer finally fills the demand for a flexible CLIP-Seq data analysis pipeline that is applicable to the up-to-date CLIP protocols. The article further shows the limitations of current peak-calling algorithms and the importance of a robust peak detection.
Oxford University Press (OUP)
Title: Galaxy CLIP-Explorer: a web server for CLIP-Seq data analysis
Description:
Abstract
Background
Post-transcriptional regulation via RNA-binding proteins plays a fundamental role in every organism, but the regulatory mechanisms lack important understanding.
Nevertheless, they can be elucidated by cross-linking immunoprecipitation in combination with high-throughput sequencing (CLIP-Seq).
CLIP-Seq answers questions about the functional role of an RNA-binding protein and its targets by determining binding sites on a nucleotide level and associated sequence and structural binding patterns.
In recent years the amount of CLIP-Seq data skyrocketed, urging the need for an automatic data analysis that can deal with different experimental set-ups.
However, noncanonical data, new protocols, and a huge variety of tools, especially for peak calling, made it difficult to define a standard.
Findings
CLIP-Explorer is a flexible and reproducible data analysis pipeline for iCLIP data that supports for the first time eCLIP, FLASH, and uvCLAP data.
Individual steps like peak calling can be changed to adapt to different experimental settings.
We validate CLIP-Explorer on eCLIP data, finding similar or nearly identical motifs for various proteins in comparison with other databases.
In addition, we detect new sequence motifs for PTBP1 and U2AF2.
Finally, we optimize the peak calling with 3 different peak callers on RBFOX2 data, discuss the difficulty of the peak-calling step, and give advice for different experimental set-ups.
Conclusion
CLIP-Explorer finally fills the demand for a flexible CLIP-Seq data analysis pipeline that is applicable to the up-to-date CLIP protocols.
The article further shows the limitations of current peak-calling algorithms and the importance of a robust peak detection.
Related Results
Laniakea: an open solution to provide Galaxy “on-demand” instances over heterogeneous cloud infrastructures
Laniakea: an open solution to provide Galaxy “on-demand” instances over heterogeneous cloud infrastructures
Abstract
Background
Galaxy is rapidly becoming the de facto standard among workflow managers for bioinformatics. A rich feature...
Pangeo for everyone with Galaxy
Pangeo for everyone with Galaxy
<p>Pangeo has been deployed on a number of diverse infrastructures and learning resources are available with for instance the Pangeo Tutorial Gallery (http://gallery....
Introducing a new notification system in Galaxy
Introducing a new notification system in Galaxy
We are excited to introduce a new notification system for Galaxy that provides users with increased awareness about things happening in Galaxy. This system includes several feature...
How Third Party Observers Respond to Overheard Service Failures: Implications for Frontline Service Employees and the Firm
How Third Party Observers Respond to Overheard Service Failures: Implications for Frontline Service Employees and the Firm
Frontline service employees and unhappy customers have generally been the main focus of the service failures research. It is established that service failures are difficult to prev...
Benchmarking Algorithms for Gene Set Scoring of Single-cell ATAC-seq Data
Benchmarking Algorithms for Gene Set Scoring of Single-cell ATAC-seq Data
AbstractGene set scoring (GSS) has been routinely conducted for gene expression analysis of bulk or single-cell RNA-seq data, which helps to decipher single-cell heterogeneity and ...
Galaxy Architecture
Galaxy Architecture
Slides for this
training session
at the
2016 Galaxy Community Conference (GCC2016)
.
...
Viral pathogen data analysis with Galaxy
Viral pathogen data analysis with Galaxy
The success of the GalaxyProject SARS-CoV-2 analysis effort during the COVID-19 pandemic has boosted
interest in Galaxy as a platform for viral pathogen data analys...
Generating Synthetic Single Cell Data from Bulk RNA-seq Using a Pretrained Variational Autoencoder
Generating Synthetic Single Cell Data from Bulk RNA-seq Using a Pretrained Variational Autoencoder
AbstractSingle cell RNA sequencing (scRNA-seq) is a powerful approach which generates genome-wide gene expression profiles at single cell resolution. Among its many applications, i...

