Javascript must be enabled to continue!
A database-driven research data framework for integrating and processing high-dimensional geoscientific data
View through CrossRef
Abstract. This paper introduces a modular research data framework designed for geoscientific research across disciplinary boundaries. It is specifically designed to support small research projects, providing a bottom-up solution that empowers individual teams that need to adhere to strict data management requirements from funding bodies, but often lack the financial and human resources to do so. The framework supports the transformation of raw research data into scientific knowledge. It addresses critical challenges, such as the rapid increase in the volume, variety and complexity of geoscientific datasets, data heterogeneity, spatial complexity, and the need to comply with the FAIR (Findable, Accessible, Interoperable, and Reusable) principles. The framework uses a dual-component architecture. First, an Online Transaction Processing (OLTP) system features a user interface and a persistent relational database, ensuring accurate and consistent data storage when capturing and managing diverse geoscientific research data. Complementing this, an orchestration layer manages automated data pipelines to process the stored data and generate dynamic in-memory Online Analytical Processing (OLAP) databases that allow flexible, high-performance analysis. It is adaptable to evolving research requirements and supports various data types and methodological approaches, such as machine learning and deep learning, that place high demands on the data and their formats. A case study in Western Romania demonstrates the application of the data framework in an interdisciplinary geoarchaeological research project by processing and storing heterogeneous datasets, thereby reducing data management efforts, improving findability, replicability, and reproducibility, and streamlining the integration of high-dimensional data for small, interdisciplinary teams.
Title: A database-driven research data framework for integrating and processing high-dimensional geoscientific data
Description:
Abstract.
This paper introduces a modular research data framework designed for geoscientific research across disciplinary boundaries.
It is specifically designed to support small research projects, providing a bottom-up solution that empowers individual teams that need to adhere to strict data management requirements from funding bodies, but often lack the financial and human resources to do so.
The framework supports the transformation of raw research data into scientific knowledge.
It addresses critical challenges, such as the rapid increase in the volume, variety and complexity of geoscientific datasets, data heterogeneity, spatial complexity, and the need to comply with the FAIR (Findable, Accessible, Interoperable, and Reusable) principles.
The framework uses a dual-component architecture.
First, an Online Transaction Processing (OLTP) system features a user interface and a persistent relational database, ensuring accurate and consistent data storage when capturing and managing diverse geoscientific research data.
Complementing this, an orchestration layer manages automated data pipelines to process the stored data and generate dynamic in-memory Online Analytical Processing (OLAP) databases that allow flexible, high-performance analysis.
It is adaptable to evolving research requirements and supports various data types and methodological approaches, such as machine learning and deep learning, that place high demands on the data and their formats.
A case study in Western Romania demonstrates the application of the data framework in an interdisciplinary geoarchaeological research project by processing and storing heterogeneous datasets, thereby reducing data management efforts, improving findability, replicability, and reproducibility, and streamlining the integration of high-dimensional data for small, interdisciplinary teams.
Related Results
Making geoscientific lab data FAIR: a conceptual model for a geophysical laboratory database
Making geoscientific lab data FAIR: a conceptual model for a geophysical laboratory database
Abstract. The term of geoscientific laboratory measurements involves a variety of methods in geosciences. Accordingly, the resulting data comprise many different data types, format...
Non-Recommended Publishing Lists: Strategies for Detecting Deceitful Journals
Non-Recommended Publishing Lists: Strategies for Detecting Deceitful Journals
Abstract
The rapid growth of open access publishing (OAP) has significantly improved the accessibility and dissemination of scientific knowledge. However, this expansion has also c...
Geoscientific research's influence on renewable energy policies and ecological balancing
Geoscientific research's influence on renewable energy policies and ecological balancing
Geoscientific research plays a crucial role in shaping renewable energy policies and promoting ecological balance. This review explores the impact of geoscientific research on rene...
The Intelligent Prospector v1.0: geoscientific model development and prediction by sequential data acquisition planning with application to mineral exploration
The Intelligent Prospector v1.0: geoscientific model development and prediction by sequential data acquisition planning with application to mineral exploration
Abstract. Geoscientific models are based on geoscientific data; hence,
building better models, in the sense of attaining better predictions, often
means acquiring additional data. ...
A database-driven research data framework for integrating and processing high-dimensional geoscientific data
A database-driven research data framework for integrating and processing high-dimensional geoscientific data
Abstract. This paper introduces a modular research data framework designed for geoscientific research across disciplinary boundaries. It is specifically designed to support small r...
Firedrake - automated, differentiable building blocks for geoscientific simulation
Firedrake - automated, differentiable building blocks for geoscientific simulation
Computer simulations of continuous processes described by partial differential equations are a bedrock of geoscientific simulation. Each simulation is a complex composition of equa...
High Dimensional Computing on Arabic Language Classification
High Dimensional Computing on Arabic Language Classification
Abstract
The brain circuit is enormous regarding quantities of neurons and neuro-transmitters, proposing that huge circuits are the main entity to the brain-core processing...
Editorial: The publication of geoscientific model developments v1.0
Editorial: The publication of geoscientific model developments v1.0
Abstract. In 2008, the first volume of the European Geosciences Union (EGU) journal Geoscientific Model Development (GMD) was published. GMD was founded because we perceived there ...

