Search engine for discovering works of Art, research articles, and books related to Art and Culture
ShareThis
Javascript must be enabled to continue!

GeneBank Genie: An interactive toolkit for integrated multivariate analysis and visualization of GenBank records

View through CrossRef
Importance of the work: GeneBank Genie fills a critical gap by providing integrated desktop software for accessible and comprehensive analysis of GenBank data. Objectives: To develop intuitive software facilitating simultaneous automated parsing, preliminary genomic analysis, visualization and sequence extraction from multiple GenBank records. Materials and Methods: GeneBank Genie was built in the Python programming environment, using Tkinter for the graphical user interface (GUI) design, Biopython and custom Python script for sequence processing and scikit-learn for principal component analysis (PCA) and clustering analytics. Results: The developed software called GeneBank Genie was used successfully to analyze a dataset of 333 Orthoptera mitochondrial genomes, automatically computing nucleotide compositions, the percentage of nitrogenous bases that are either guanine or cytosine (GC content) and gene annotations. The PCA revealed distinct genomic clustering patterns and Mahalanobis distances were used to identify outliers effectively. Taxonomic visualizations demonstrated robust exploratory capabilities based on interactive Sankey diagrams, dendrograms, correlation heatmaps and K-means clustering. Additionally, rapid extraction of gene sequences illustrated practical applications for molecular research workflows. Main finding: GeneBank Genie uniquely integrates automated batch processing of GenBank records, PCA-based analytics and intuitive visualizations, greatly simplifying genomic data exploration for biologists. GeneBank Genie is freely available at: https://github.com/yashmgupta/GeneBank-Genie
Kasetsart University Research and Development Institute
Title: GeneBank Genie: An interactive toolkit for integrated multivariate analysis and visualization of GenBank records
Description:
Importance of the work: GeneBank Genie fills a critical gap by providing integrated desktop software for accessible and comprehensive analysis of GenBank data.
Objectives: To develop intuitive software facilitating simultaneous automated parsing, preliminary genomic analysis, visualization and sequence extraction from multiple GenBank records.
Materials and Methods: GeneBank Genie was built in the Python programming environment, using Tkinter for the graphical user interface (GUI) design, Biopython and custom Python script for sequence processing and scikit-learn for principal component analysis (PCA) and clustering analytics.
Results: The developed software called GeneBank Genie was used successfully to analyze a dataset of 333 Orthoptera mitochondrial genomes, automatically computing nucleotide compositions, the percentage of nitrogenous bases that are either guanine or cytosine (GC content) and gene annotations.
The PCA revealed distinct genomic clustering patterns and Mahalanobis distances were used to identify outliers effectively.
Taxonomic visualizations demonstrated robust exploratory capabilities based on interactive Sankey diagrams, dendrograms, correlation heatmaps and K-means clustering.
Additionally, rapid extraction of gene sequences illustrated practical applications for molecular research workflows.
Main finding: GeneBank Genie uniquely integrates automated batch processing of GenBank records, PCA-based analytics and intuitive visualizations, greatly simplifying genomic data exploration for biologists.
GeneBank Genie is freely available at: https://github.
com/yashmgupta/GeneBank-Genie.

Related Results

The Europlanet Evaluation Toolkit
The Europlanet Evaluation Toolkit
Evaluation can provide essential information in understanding the effectiveness and accessibility of outreach activities in engaging diverse communities. In this presentation, we w...
The Europlanet Evaluation Toolkit
The Europlanet Evaluation Toolkit
<div> <p>In this presentation, we will give an overview of the Europlanet Evaluation Toolkit, a resource that aims to empower outreach providers and edu...
First Molecular Identification of a Begomovirus Isolated from Tomato in Madagascar
First Molecular Identification of a Begomovirus Isolated from Tomato in Madagascar
In April 2001, reduced leaf size, leaf curling, yellowing symptoms, and reduced yield were observed in tomato plants in the southwestern (Toliary, Morondava, Miandrivazo) and north...
Enhancing Sepsis Prevention in Long-Term Care Facilities: Development of an Infection Prevention Sepsis Toolkit
Enhancing Sepsis Prevention in Long-Term Care Facilities: Development of an Infection Prevention Sepsis Toolkit
Objectives: The purpose of this Doctor of Nursing Practice (DNP) project was to create, validate, and refine an Evidence-Based Sepsis Prevention Toolkit to enhanced early sepsis re...
Response to Toshihide Tsuda, Yumiko Miyano and Eiji Yamamoto [1]
Response to Toshihide Tsuda, Yumiko Miyano and Eiji Yamamoto [1]
Abstract Background In August 2021, we published in Environmental Health a Toolkit for detecting misused epidemiological methods with the goal of pr...
GenBank 2 Sequin - a file converter preparing custom GenBank files for database submission
GenBank 2 Sequin - a file converter preparing custom GenBank files for database submission
Abstract The typical wet lab user often annotates smaller sequences in the GenBank format, but resulting files are not accepted for database subm...
PREVALENCE OF MULTIDRUG RESISTANT SHIGA TOXIN PRODUCING E. coli IN THE MILK OF CATTLE, BUFFALOES, AND CAMEL
PREVALENCE OF MULTIDRUG RESISTANT SHIGA TOXIN PRODUCING E. coli IN THE MILK OF CATTLE, BUFFALOES, AND CAMEL
Regarding high morbidity, mortality, and production losses, fungi infections have their importance among infectious illnesses and seem to be one of the main challenges facing poult...
GENOTYPIC CHARACTERIZATION OF SOME PATHOGENIC FUNGI ISOLATED FROM POULTRY AND THEIR SURROUNDINGS
GENOTYPIC CHARACTERIZATION OF SOME PATHOGENIC FUNGI ISOLATED FROM POULTRY AND THEIR SURROUNDINGS
Regarding high morbidity, mortality, and production losses, fungi infections have their importance among infectious illnesses and seem to be one of the main challenges facing poult...

Back to Top