Javascript must be enabled to continue!
Data Anonymization for Open Science: A Case Study
View through CrossRef
ABSTRACTOne of many challenges to open science is anonymization of personal data so that it may be shared. This paper presents a case study of the anonymization of a dataset containing cardio-respiratory fitness and commuting patterns for Slovenian school children. It evaluates three different anonymization tools, ARX, SDV, and SynDiffix. The fitness study was selected because its small size (N=713) and generally low statistical significance make it particularly challenging for data anonymization. Unlike most prior anonymization tool evaluations, this paper examines whether the scientific conclusions of the original study would have been supported by the anonymized datasets. It also considers the burden imposed on researchers using the tools both for data generation and data analysis.
Cold Spring Harbor Laboratory
Title: Data Anonymization for Open Science: A Case Study
Description:
ABSTRACTOne of many challenges to open science is anonymization of personal data so that it may be shared.
This paper presents a case study of the anonymization of a dataset containing cardio-respiratory fitness and commuting patterns for Slovenian school children.
It evaluates three different anonymization tools, ARX, SDV, and SynDiffix.
The fitness study was selected because its small size (N=713) and generally low statistical significance make it particularly challenging for data anonymization.
Unlike most prior anonymization tool evaluations, this paper examines whether the scientific conclusions of the original study would have been supported by the anonymized datasets.
It also considers the burden imposed on researchers using the tools both for data generation and data analysis.
Related Results
Hydatid Disease of The Brain Parenchyma: A Systematic Review
Hydatid Disease of The Brain Parenchyma: A Systematic Review
Abstarct
Introduction
Isolated brain hydatid disease (BHD) is an extremely rare form of echinococcosis. A prompt and timely diagnosis is a crucial step in disease management. This ...
Breast Carcinoma within Fibroadenoma: A Systematic Review
Breast Carcinoma within Fibroadenoma: A Systematic Review
Abstract
Introduction
Fibroadenoma is the most common benign breast lesion; however, it carries a potential risk of malignant transformation. This systematic review provides an ove...
The Costs of Anonymization: Case Study Using Clinical Data (Preprint)
The Costs of Anonymization: Case Study Using Clinical Data (Preprint)
BACKGROUND
Sharing data from clinical studies can accelerate scientific progress, improve transparency, and increase the potential for innovation and collab...
Chest Wall Hydatid Cysts: A Systematic Review
Chest Wall Hydatid Cysts: A Systematic Review
Abstract
Introduction
Given the rarity of chest wall hydatid disease, information on this condition is primarily drawn from case reports. Hence, this study systematically reviews t...
Anonymize or synthesize? Privacy-preserving methods for heart failure score analytics
Anonymize or synthesize? Privacy-preserving methods for heart failure score analytics
Abstract
Aims
Data availability remains a critical challenge in modern, data-driven medical research. Due to the sensitive natur...
Hydatid Cyst of The Orbit: A Systematic Review with Meta-Data
Hydatid Cyst of The Orbit: A Systematic Review with Meta-Data
Abstarct
Introduction
Orbital hydatid cysts (HCs) constitute less than 1% of all cases of hydatidosis, yet their occurrence is often linked to severe visual complications. This stu...
Primary Thyroid Non-Hodgkin B-Cell Lymphoma: A Case Series
Primary Thyroid Non-Hodgkin B-Cell Lymphoma: A Case Series
Abstract
Introduction
Non-Hodgkin lymphoma (NHL) of the thyroid, a rare malignancy linked to autoimmune disorders, is poorly understood in terms of its pathogenesis and treatment o...
Strengthening GIS Security: Anonymization and Differential Privacy for Safeguarding Sensitive Geospatial Data
Strengthening GIS Security: Anonymization and Differential Privacy for Safeguarding Sensitive Geospatial Data
The protection of Geographic Information Systems (GIS) is now more relevant since these systems gather, process, and store geospatial data to various ends, receiving and processing...

