Javascript must be enabled to continue!
Tripal MegaSearch: a tool for interactive and customizable query and download of big data
View through CrossRef
Abstract
Tripal MegaSearch is a Tripal module for querying and downloading biological data stored in Chado. This module allows site users to select data types, restrict the dataset by applying various filters and then customizing fields to view and download through a single interface. Set by site administrators, example data types include gene, germplasm, marker, map, QTL, genotype, phenotype and expression data. When querying for genes, users can restrict the gene dataset using various filters such as name, chromosome position and functional annotation. They can then customize fields to download, such as name, organism, type, chromosome position, various functional annotations such as BLAST, KEGG, InterPro and GO term. FASTA files can also be downloaded for the sequence data. Site administrators can choose from two different data sources to serve data: Tripal MegaSearch materialized views or Chado tables. If neither data source is desired, administrators may also create their own materialized views and serve them through the flexible dynamic Tripal MegaSearch query form. Tripal MegaSearch is currently implemented in several databases including the Genome Database for Rosaceae www.rosaceae.org and TreeGenes www.https://treegenesdb.org/.
Oxford University Press (OUP)
Title: Tripal MegaSearch: a tool for interactive and customizable query and download of big data
Description:
Abstract
Tripal MegaSearch is a Tripal module for querying and downloading biological data stored in Chado.
This module allows site users to select data types, restrict the dataset by applying various filters and then customizing fields to view and download through a single interface.
Set by site administrators, example data types include gene, germplasm, marker, map, QTL, genotype, phenotype and expression data.
When querying for genes, users can restrict the gene dataset using various filters such as name, chromosome position and functional annotation.
They can then customize fields to download, such as name, organism, type, chromosome position, various functional annotations such as BLAST, KEGG, InterPro and GO term.
FASTA files can also be downloaded for the sequence data.
Site administrators can choose from two different data sources to serve data: Tripal MegaSearch materialized views or Chado tables.
If neither data source is desired, administrators may also create their own materialized views and serve them through the flexible dynamic Tripal MegaSearch query form.
Tripal MegaSearch is currently implemented in several databases including the Genome Database for Rosaceae www.
rosaceae.
org and TreeGenes www.
https://treegenesdb.
org/.
Related Results
Optimising tool wear and workpiece condition monitoring via cyber-physical systems for smart manufacturing
Optimising tool wear and workpiece condition monitoring via cyber-physical systems for smart manufacturing
Smart manufacturing has been developed since the introduction of Industry 4.0. It consists of resource sharing and networking, predictive engineering, and material and data analyti...
Named Entity Recognition in Statistical Dataset Search Queries
Named Entity Recognition in Statistical Dataset Search Queries
Search engines must understand user queries to provide relevant search results. Search engines can enhance their understanding of user intent by employing named entity recognition ...
RaPID-Query for Fast Identity by Descent Search and Genealogical Analysis
RaPID-Query for Fast Identity by Descent Search and Genealogical Analysis
AbstractThe size of genetic databases has grown large enough such that, genetic genealogical search, a process of inferring familial relatedness by identifying DNA matches, has bec...
Digital Footprint as a Source of Big Data in Education
Digital Footprint as a Source of Big Data in Education
The purpose of this study is to consider the prospects and problems of using big data in education.Materials and methods. The research methods include analysis, systematization and...
Some new fuzzy query processing methods based on similarity measurement and fuzzy data clustering
Some new fuzzy query processing methods based on similarity measurement and fuzzy data clustering
In relational and object-oriented database systems there is always data that is naturally fuzzy or uncertain. However, to deal with complex data types with fuzzy nature, these syst...
RDF Subgraph Matching by Means of Star Decomposition
RDF Subgraph Matching by Means of Star Decomposition
<p>With the continuous development of the network, the scale of RDF data is becoming larger and larger. In the face of large-scale RDF data processing, the traditional databa...
Robot tool use: A survey
Robot tool use: A survey
Using human tools can significantly benefit robots in many application domains. Such ability would allow robots to solve problems that they were unable to without tools. However, r...
About one approach to automatic creation of formal queries to ontological knowledge bases
About one approach to automatic creation of formal queries to ontological knowledge bases
The article develops an approach that includes the analysis of short natural language messages in Ukrainian and the automatic generation of queries in SPARQL and Cypher based on th...


