Search engine for discovering works of Art, research articles, and books related to Art and Culture
ShareThis
Javascript must be enabled to continue!

Blind Queries Applied to JSON Document Stores

View through CrossRef
Social Media, Web Portals and, in general, information systems offer their own Application Programming Interfaces (APIs), used to provide large data sets concerning every aspect of day-by-day life. APIs usually provide data sets as collections of JSON documents. The heterogeneous structure of JSON documents returned by different APIs constitutes a barrier to effectively query and analyze these data sets. The adoption of NoSQL document stores, such as MongoDB, is useful for gathering these data sets, but does not solve the problem of querying the final heterogeneous repository. The aim of this paper is to provide analysts with a tool, named HammerJDB, that allows for blind querying collections of JSON documents within a NoSQL document database. The idea below is that users may know the application domain but it may be that they are not aware of the real structures of the documents stored in the database—the tool for blind querying tries to bridge the gap, by adopting a query rewriting mechanism. This paper is an evolution of a technique for blind querying Open Data portals and of its implementation within the Hammer framework, presented in some previous work. In this paper, we evolve that approach in order to query a NoSQL document database by evolving the Hammer framework into the HammerJDB framework, which is able to work on MongoDB databases. The effectiveness of the new approach is evaluated on a data set (derived from a real-life one), containing job-vacancy ads collected from European job portals.
Title: Blind Queries Applied to JSON Document Stores
Description:
Social Media, Web Portals and, in general, information systems offer their own Application Programming Interfaces (APIs), used to provide large data sets concerning every aspect of day-by-day life.
APIs usually provide data sets as collections of JSON documents.
The heterogeneous structure of JSON documents returned by different APIs constitutes a barrier to effectively query and analyze these data sets.
The adoption of NoSQL document stores, such as MongoDB, is useful for gathering these data sets, but does not solve the problem of querying the final heterogeneous repository.
The aim of this paper is to provide analysts with a tool, named HammerJDB, that allows for blind querying collections of JSON documents within a NoSQL document database.
The idea below is that users may know the application domain but it may be that they are not aware of the real structures of the documents stored in the database—the tool for blind querying tries to bridge the gap, by adopting a query rewriting mechanism.
This paper is an evolution of a technique for blind querying Open Data portals and of its implementation within the Hammer framework, presented in some previous work.
In this paper, we evolve that approach in order to query a NoSQL document database by evolving the Hammer framework into the HammerJDB framework, which is able to work on MongoDB databases.
The effectiveness of the new approach is evaluated on a data set (derived from a real-life one), containing job-vacancy ads collected from European job portals.

Related Results

Theoretical study of laser-cooled SH<sup>–</sup> anion
Theoretical study of laser-cooled SH<sup>–</sup> anion
The potential energy curves, dipole moments, and transition dipole moments for the <inline-formula><tex-math id="M13">\begin{document}${{\rm{X}}^1}{\Sigma ^ + }$\end{do...
Revisiting near-threshold photoelectron interference in argon with a non-adiabatic semiclassical model
Revisiting near-threshold photoelectron interference in argon with a non-adiabatic semiclassical model
<sec> <b>Purpose:</b> The interaction of intense, ultrashort laser pulses with atoms gives rise to rich non-perturbative phenomena, which are encoded within th...
Graph-based interactive bibliographic information retrieval systems
Graph-based interactive bibliographic information retrieval systems
In the big data era, we have witnessed the explosion of scholarly literature. This explosion has imposed challenges to the retrieval of bibliographic information. Retrieval of inte...
The Strategic Evolution of Fashion Flagship Stores
The Strategic Evolution of Fashion Flagship Stores
About thirty-five years ago the trend of investing in flagship stores in the fashion and luxury sectors started, and has not stopped even since the last economic crisis. Recently, ...
Definition of REST web services with JSON schema
Definition of REST web services with JSON schema
SummaryThe Web has evolved from being a collection of documents to a collection of interconnected services that interoperate throughout the Internet. Web services are a natural con...
Line Blind Technology
Line Blind Technology
Abstract Executive Summary Line blind is a new positive isolation technology that may replace traditional blinding. Line blind r...
Transformation of recording features in an electronic environment
Transformation of recording features in an electronic environment
The article deals with one of the main theoretical problems of document science related to the definition of document features. This problem is also of applied importance, since wh...
Ukrainian Embroidery as a Type of Document
Ukrainian Embroidery as a Type of Document
The purpose of the article is to determine the general and specific features of Ukrainian embroidery as a type of carrier of documented information. The methodology. We chose the ...

Back to Top