Javascript must be enabled to continue!

Testing the FAIR metrics on data catalogs

The introduction of the FAIR –Findable, Accessible, Interoperable, Reusable– principles has caused quite an uproar within the scientific community. Principles which, if everyone adheres to them, could result in new, revolutionary ways of performing research and fulfill the promise of open science. Furthermore, it allows for concepts such as personalized medicine and personal health monitoring to -finally- become implemented in daily practice. However, to bring about these changes, data users need to rethink the way they treat scientific data. Just passing a dataset along, without extensive metadata will not suffice anymore. Such new ways of executing research require a significantly different approach from the entire scientific community or, for that matter, anyone who wants to reap the benefits from going FAIR. Yet, how do you initiate behavioral change? One important solution is by changing the software scientists use and requiring data owners, or data stewards, to FAIRify their dataset. Data catalogs are a great starting point for FAIRifying data as the software already intends to make data Findable and Accessible, while the metadata is Interoperable and relying on users to provide sufficient metadata to ensure Reusability. In this paper we analyse how well the FAIR principles are implemented in several data catalogs. To determine how FAIR a catalog is, the FAIR metrics were created by the GO-FAIR initiative. These metrics help determine to what extend data can be considered FAIR. However, the metrics were only recently developed, being first released at the end of 2017. At the moment software does not come standard with a FAIR metrics review. Still, this insight is highly desired by the scientific community. How else can they be sure that (public) money is spend in a FAIR way? The Hyve has tested/evaluated three popular open source data catalogs based on the FAIR metrics: CKAN, Dataverse, and Invenio. Most data stewards will be familiar with at least one of these. Within this white paper we provide answers to the following questions: Which of the three data catalogs performs best in making data FAIR? Which data catalog utilizes FAIR datasets the most? Which one creates the most FAIR metadata? Which catalog has the highest potential to increase its FAIRness, and how? Which data catalog facilitates the FAIRifying process the best?

PeerJ

Jarno A A van Erp Carolyn D Langen Anca Boon Kees van Bochove

2018

Title: Testing the FAIR metrics on data catalogs

Description:

The introduction of the FAIR –Findable, Accessible, Interoperable, Reusable– principles has caused quite an uproar within the scientific community.

Principles which, if everyone adheres to them, could result in new, revolutionary ways of performing research and fulfill the promise of open science.

Furthermore, it allows for concepts such as personalized medicine and personal health monitoring to -finally- become implemented in daily practice.

However, to bring about these changes, data users need to rethink the way they treat scientific data.

Just passing a dataset along, without extensive metadata will not suffice anymore.

Such new ways of executing research require a significantly different approach from the entire scientific community or, for that matter, anyone who wants to reap the benefits from going FAIR.

Yet, how do you initiate behavioral change? One important solution is by changing the software scientists use and requiring data owners, or data stewards, to FAIRify their dataset.

Data catalogs are a great starting point for FAIRifying data as the software already intends to make data Findable and Accessible, while the metadata is Interoperable and relying on users to provide sufficient metadata to ensure Reusability.

In this paper we analyse how well the FAIR principles are implemented in several data catalogs.

To determine how FAIR a catalog is, the FAIR metrics were created by the GO-FAIR initiative.

These metrics help determine to what extend data can be considered FAIR.

However, the metrics were only recently developed, being first released at the end of 2017.

At the moment software does not come standard with a FAIR metrics review.

Still, this insight is highly desired by the scientific community.

How else can they be sure that (public) money is spend in a FAIR way? The Hyve has tested/evaluated three popular open source data catalogs based on the FAIR metrics: CKAN, Dataverse, and Invenio.

Most data stewards will be familiar with at least one of these.

Within this white paper we provide answers to the following questions: Which of the three data catalogs performs best in making data FAIR? Which data catalog utilizes FAIR datasets the most? Which one creates the most FAIR metadata? Which catalog has the highest potential to increase its FAIRness, and how? Which data catalog facilitates the FAIRifying process the best?.

Back

The introduction of the FAIR –Findable, Accessible, Interoperable, Reusable– principles has caused quite an uproar within the scientific community. Principles which, if everyone ad...

SOFTWARE TESTING TECHNIQUES AND PRINCIPLES

This paper describes Software testing, need for software testing, Software testing goals and principles. Further it describe about different Software testing techniques and differe...

Curriculum Development for FAIR Data Stewardship

Abstract The FAIR Guidelines attempts to make digital data Findable, Accessible, Interoperable, and Reusable (FAIR). To prepare FAIR data, a new data science discipl...

THE SECURITY AND PRIVACY MEASURING SYSTEM FOR THE INTERNET OF THINGS DEVICES

The purpose of the article: elimination of the gap in existing need in the set of clear and objective security and privacy metrics for the IoT devices users and manufacturers and a...

An integrated approach to above- and below-ground ecological monitoring for nature-based solutions

1. As the development of nature-based solutions (NbS) increases globally, it is important to ensure that projects meet the objective of benefiting biodiversity, alongside tackling ...

Effect of Combining Catalogs with Different Completeness

<p>In most seismic studies, we prefer the earthquake catalog that covers a larger region and/or a longer period. We usually combine two or more catalogs to achieve th...

Abstract The rapid growth of open access publishing (OAP) has significantly improved the accessibility and dissemination of scientific knowledge. However, this expansion has also c...

Vovel metrics—novel coupling metrics for improved software fault prediction

Software is a complex entity, and its development needs careful planning and a high amount of time and cost. To assess quality of program, software measures are very helpful. Among...

Email:
Password:

Email:

Testing the FAIR metrics on data catalogs

Related Results