Javascript must be enabled to continue!
Reproducible and accessible analysis of transposon insertion data at scale
View through CrossRef
ABSTRACT
Significant progress has been made in advancing and standardizing tools for human genomic and biomedical research, yet the field of next generation sequencing (NGS) analysis for microorganisms (including multiple pathogens) remains fragmented, lacks accessible and reusable tools, is hindered by local computational resource limitations, and does not offer widely accepted standards. One of such “problem areas” is the analysis of Transposon Insertion Sequencing (TIS) data. TIS allows perturbing the entire genome of a microorganism by introducing random insertions of transposon-derived constructs. The impact of the insertions on the survival and growth provides precise information about genes affecting specific phenotypic characteristics. A wide array of tools has been developed to analyze TIS data and among the variety of options available, it is often difficult to identify which one can provide a reliable and reproducible analysis. Here we sought to understand the challenges and propose reliable practices for the analysis of TIS experiments. Using data from two recent TIS studies we have developed a series of workflows that include multiple tools for data de-multiplexing, promoter sequence identification, transposon flank alignment, and read count repartition across the genome. Particular attention was paid to quality control procedures such as the determination of the optimal tool parameters for the analysis and removal of contamination. Our work provides an assessment of the currently available tools for TIS data analysis and offers ready to use workflows that can be invoked by anyone in the world using our public Galaxy platform (
https://usegalaxy.org
). To lower the entry barriers we have also developed interactive tutorials explaining details of TIS data analysis procedures at
https://bit.ly/gxy-tis
.
Importance
A wide array of tools has been developed to analyze TIS data and among the variety of options available, it is often difficult to identify which one can provide a reliable and reproducible analysis. Here we sought to understand the challenges and propose reliable practices for the analysis of TIS experiments. Using data from two recent TIS studies we have developed a series of workflows that include multiple tools for data de-multiplexing, promoter sequence identification, transposon flank alignment, and read count repartition across the genome. Particular attention was paid to quality control procedures such as the determination of the optimal tool parameters for the analysis and removal of contamination. Our work democratizes the TIS data analysis by providing open workflows supported by public computational infrastructure.
Title: Reproducible and accessible analysis of transposon insertion data at scale
Description:
ABSTRACT
Significant progress has been made in advancing and standardizing tools for human genomic and biomedical research, yet the field of next generation sequencing (NGS) analysis for microorganisms (including multiple pathogens) remains fragmented, lacks accessible and reusable tools, is hindered by local computational resource limitations, and does not offer widely accepted standards.
One of such “problem areas” is the analysis of Transposon Insertion Sequencing (TIS) data.
TIS allows perturbing the entire genome of a microorganism by introducing random insertions of transposon-derived constructs.
The impact of the insertions on the survival and growth provides precise information about genes affecting specific phenotypic characteristics.
A wide array of tools has been developed to analyze TIS data and among the variety of options available, it is often difficult to identify which one can provide a reliable and reproducible analysis.
Here we sought to understand the challenges and propose reliable practices for the analysis of TIS experiments.
Using data from two recent TIS studies we have developed a series of workflows that include multiple tools for data de-multiplexing, promoter sequence identification, transposon flank alignment, and read count repartition across the genome.
Particular attention was paid to quality control procedures such as the determination of the optimal tool parameters for the analysis and removal of contamination.
Our work provides an assessment of the currently available tools for TIS data analysis and offers ready to use workflows that can be invoked by anyone in the world using our public Galaxy platform (
https://usegalaxy.
org
).
To lower the entry barriers we have also developed interactive tutorials explaining details of TIS data analysis procedures at
https://bit.
ly/gxy-tis
.
Importance
A wide array of tools has been developed to analyze TIS data and among the variety of options available, it is often difficult to identify which one can provide a reliable and reproducible analysis.
Here we sought to understand the challenges and propose reliable practices for the analysis of TIS experiments.
Using data from two recent TIS studies we have developed a series of workflows that include multiple tools for data de-multiplexing, promoter sequence identification, transposon flank alignment, and read count repartition across the genome.
Particular attention was paid to quality control procedures such as the determination of the optimal tool parameters for the analysis and removal of contamination.
Our work democratizes the TIS data analysis by providing open workflows supported by public computational infrastructure.
Related Results
Transposon insertion sequencing (Tn-seq) library preparation protocol - includes UMI for PCR duplicate removal v1
Transposon insertion sequencing (Tn-seq) library preparation protocol - includes UMI for PCR duplicate removal v1
Transposon insertion sequencing (Tn-Seq) is an NGS method to en masse map genomic locations of transposon insertions in a pool of mutants. Insertion mutants are usually made using ...
Safety and efficacy of laryngeal mask airway placement in term of oropharyngeal leak pressure: A comparison between conventional insertion and laryngoscope guided insertion
Safety and efficacy of laryngeal mask airway placement in term of oropharyngeal leak pressure: A comparison between conventional insertion and laryngoscope guided insertion
Background: The effectiveness of laryngeal mask airway (LMA) through conventional insertion technique versus laryngoscope-guided insertion is a debatable issue. Use of only fiberop...
Bacterial genome annotation script using BLASTN v2
Bacterial genome annotation script using BLASTN v2
This protocol uses the command line tools provided by the Python package TnAtlas to identify and annotate transposon integration events in genomes. Given a set of sequencing reads...
Factors determining frequency of plasmid cointegration mediated by insertion sequence IS
1
Factors determining frequency of plasmid cointegration mediated by insertion sequence IS
1
We demonstrate that mutants with deletions at either end of the insertion sequence IS
1
lose the ability to mediate cointegration of two plasmids, whereas m...
Outcomes after Intrauterine Device Insertion for Heavy Menstrual Bleeding in Young Women with and without Inherited Bleeding Disorders
Outcomes after Intrauterine Device Insertion for Heavy Menstrual Bleeding in Young Women with and without Inherited Bleeding Disorders
Introduction:
Data on hormonal intrauterine device (IUD) related adverse events in young women with inherited bleeding disorders (IBDs) are limited. Our primary obje...
High-throughput workflow for the genotypic characterization of transposon insertion library variants v2
High-throughput workflow for the genotypic characterization of transposon insertion library variants v2
This is a workflow for the genotypic characterization of transposon library variants. It has been developed using an open-source Opentrons OT-2 robot, BLASTN for genomic annotation...
The Drosophila OSC Genome: A Resource for Studies of Transposon and piRNA Biology
The Drosophila OSC Genome: A Resource for Studies of Transposon and piRNA Biology
ABSTRACT
Accurate genome assemblies are critical for understanding small RNA-mediated genome defense. In animals, the PIWI-interacting RNA (piRNA) pathway protects ...
LoRTIS Software Suite: Transposon mutant analysis using long-read sequencing
LoRTIS Software Suite: Transposon mutant analysis using long-read sequencing
Abstract
To date transposon insertion sequencing (TIS) methodologies have used short-read nucleotide sequencing technology. However, short-read sequences are unlike...

