Search engine for discovering works of Art, research articles, and books related to Art and Culture
ShareThis
Javascript must be enabled to continue!

Bacterial genome annotation script using BLASTN v2

View through CrossRef
This protocol uses a python based script and command-line BLASTn to annotate in a final table single-read sequencing results from genome amplifications, within other output files. Its main use in our lab (https://biocomputationlab.com) is to identify the location and gene locus of transposon inserts in microbial bacterial genomes of Pseudomonas putida KT2440. However, this script can be used for other bacterial genomes for which its genome sequence and annotation are available. Script was developed and tested in python 3.11.9 with blastn version 2.9.0, sickle version 1.33 and fastqc version 0.11.9 This is a description of the LAP entry LAPu-InsertsGenAnnotation-2.0.0 located in the LAP repository, specifically LAPu-InsertsGenAnnotation-2.0.0 and Github Entry LAPu-InsertsGenAnnotation-2.0.0, 2 places that you can download directly the script used and usage examples The major changes from previous version are: File format for -identity Argument: Now accepts XLSX and CSV files. New Argument --summaryMap: Added the --summaryMap argument. Enhanced -quality Argument: When provided, the merged quality file (FastQC file) will use the file name (without extension) as the sequence identifier, which will match the qaccver in the table_reads_gene_description.csv file. Support for Numeric Identifiers: Numeric identifiers follow the same rules as well names for locating them in a file or sequence identifier. Identifier Pattern for Sequences: The script recognizes the last element of the pattern (+well_, +number_, _well_, _number_) as the sequence identifier for -identity and --summaryMap arguments. More read extensions accepted: they will be treated as previously txt files are treated for all other arguments
Springer Science and Business Media LLC
Title: Bacterial genome annotation script using BLASTN v2
Description:
This protocol uses a python based script and command-line BLASTn to annotate in a final table single-read sequencing results from genome amplifications, within other output files.
Its main use in our lab (https://biocomputationlab.
com) is to identify the location and gene locus of transposon inserts in microbial bacterial genomes of Pseudomonas putida KT2440.
However, this script can be used for other bacterial genomes for which its genome sequence and annotation are available.
Script was developed and tested in python 3.
11.
9 with blastn version 2.
9.
0, sickle version 1.
33 and fastqc version 0.
11.
9 This is a description of the LAP entry LAPu-InsertsGenAnnotation-2.
0 located in the LAP repository, specifically LAPu-InsertsGenAnnotation-2.
0 and Github Entry LAPu-InsertsGenAnnotation-2.
0, 2 places that you can download directly the script used and usage examples The major changes from previous version are: File format for -identity Argument: Now accepts XLSX and CSV files.
New Argument --summaryMap: Added the --summaryMap argument.
Enhanced -quality Argument: When provided, the merged quality file (FastQC file) will use the file name (without extension) as the sequence identifier, which will match the qaccver in the table_reads_gene_description.
csv file.
Support for Numeric Identifiers: Numeric identifiers follow the same rules as well names for locating them in a file or sequence identifier.
Identifier Pattern for Sequences: The script recognizes the last element of the pattern (+well_, +number_, _well_, _number_) as the sequence identifier for -identity and --summaryMap arguments.
More read extensions accepted: they will be treated as previously txt files are treated for all other arguments.

Related Results

Optimizing assembly processes with augmented reality: a case study on TurtleBots
Optimizing assembly processes with augmented reality: a case study on TurtleBots
Augmented reality (AR) technology is revolutionizing traditional assembly processes, offering intuitive and interactive guidance that significantly enhances operational efficiency ...
Bacterial genome annotation script using BLASTN v2
Bacterial genome annotation script using BLASTN v2
This protocol uses the command line tools provided by the Python package TnAtlas to identify and annotate transposon integration events in genomes. Given a set of sequencing reads...
Bacterial genome annotation script using BLASTN v1
Bacterial genome annotation script using BLASTN v1
This protocol uses a python based script and command-line blastn to annotate Sanger sequencing results from genome amplifications. Its main use in our lab (https://biocomputationla...
Bacterial genome annotation script using BLASTN v2
Bacterial genome annotation script using BLASTN v2
This protocol uses a python based script and command-line BLASTn to annotate in a final table single-read sequencing results from genome amplifications, within other output files. ...
An extensible genome annotation workbench based on the Galaxy Platform
An extensible genome annotation workbench based on the Galaxy Platform
Introduction Falling costs of genetic sequencing have allowed sequencing and annotation of the genomes of non-model organism. In annotating non-mod...
GAD: a Python script for dividing genome annotation files into feature-based files
GAD: a Python script for dividing genome annotation files into feature-based files
Abstract Nowadays, manipulating and analyzing publicly available genomic datasets become a daily task in bioinformatics and genomics laboratories...
Using Diacritics in the Arabic Script of Malay to Scaffold Arab Postgraduate Students in Reading Malay Words
Using Diacritics in the Arabic Script of Malay to Scaffold Arab Postgraduate Students in Reading Malay Words
Purpose – This study aims to investigate the use of diacritics in the Arabic script of Malay to facilitate Arab postgraduate students of UKM to read the Malay words accurately. It ...
Improved Script Identification Algorithm Using Unicode-Based Regular Expression Matching Strategy
Improved Script Identification Algorithm Using Unicode-Based Regular Expression Matching Strategy
While script identification is the first step in many natural language processing and text mining tasks, at present, there is no open source script identification algorithm for tex...

Back to Top