Javascript must be enabled to continue!

Tripal MegaSearch: a tool for interactive and customizable query and download of big data

Abstract Tripal MegaSearch is a Tripal module for querying and downloading biological data stored in Chado. This module allows site users to select data types, restrict the dataset by applying various filters and then customizing fields to view and download through a single interface. Set by site administrators, example data types include gene, germplasm, marker, map, QTL, genotype, phenotype and expression data. When querying for genes, users can restrict the gene dataset using various filters such as name, chromosome position and functional annotation. They can then customize fields to download, such as name, organism, type, chromosome position, various functional annotations such as BLAST, KEGG, InterPro and GO term. FASTA files can also be downloaded for the sequence data. Site administrators can choose from two different data sources to serve data: Tripal MegaSearch materialized views or Chado tables. If neither data source is desired, administrators may also create their own materialized views and serve them through the flexible dynamic Tripal MegaSearch query form. Tripal MegaSearch is currently implemented in several databases including the Genome Database for Rosaceae www.rosaceae.org and TreeGenes www.https://treegenesdb.org/.

Oxford University Press (OUP)

Sook Jung Chun-Huai Cheng Katheryn Buble Taein Lee Jodi Humann Jing Yu James Crabb Heidi Hough Dorrie Main

Database

2021

Title: Tripal MegaSearch: a tool for interactive and customizable query and download of big data

Description:

Abstract Tripal MegaSearch is a Tripal module for querying and downloading biological data stored in Chado.

This module allows site users to select data types, restrict the dataset by applying various filters and then customizing fields to view and download through a single interface.

Set by site administrators, example data types include gene, germplasm, marker, map, QTL, genotype, phenotype and expression data.

When querying for genes, users can restrict the gene dataset using various filters such as name, chromosome position and functional annotation.

They can then customize fields to download, such as name, organism, type, chromosome position, various functional annotations such as BLAST, KEGG, InterPro and GO term.

FASTA files can also be downloaded for the sequence data.

Site administrators can choose from two different data sources to serve data: Tripal MegaSearch materialized views or Chado tables.

If neither data source is desired, administrators may also create their own materialized views and serve them through the flexible dynamic Tripal MegaSearch query form.

Tripal MegaSearch is currently implemented in several databases including the Genome Database for Rosaceae www.

rosaceae.

org and TreeGenes www.

https://treegenesdb.

org/.

Back

Smart manufacturing has been developed since the introduction of Industry 4.0. It consists of resource sharing and networking, predictive engineering, and material and data analyti...

Named Entity Recognition in Statistical Dataset Search Queries

Search engines must understand user queries to provide relevant search results. Search engines can enhance their understanding of user intent by employing named entity recognition ...

RaPID-Query for Fast Identity by Descent Search and Genealogical Analysis

AbstractThe size of genetic databases has grown large enough such that, genetic genealogical search, a process of inferring familial relatedness by identifying DNA matches, has bec...

Digital Footprint as a Source of Big Data in Education

The purpose of this study is to consider the prospects and problems of using big data in education.Materials and methods. The research methods include analysis, systematization and...

Some new fuzzy query processing methods based on similarity measurement and fuzzy data clustering

In relational and object-oriented database systems there is always data that is naturally fuzzy or uncertain. However, to deal with complex data types with fuzzy nature, these syst...

RDF Subgraph Matching by Means of Star Decomposition

<p>With the continuous development of the network, the scale of RDF data is becoming larger and larger. In the face of large-scale RDF data processing, the traditional databa...

Robot tool use: A survey

Using human tools can significantly benefit robots in many application domains. Such ability would allow robots to solve problems that they were unable to without tools. However, r...

About one approach to automatic creation of formal queries to ontological knowledge bases

The article develops an approach that includes the analysis of short natural language messages in Ukrainian and the automatic generation of queries in SPARQL and Cypher based on th...

Email:
Password:

Email:

Tripal MegaSearch: a tool for interactive and customizable query and download of big data

Related Results