Javascript must be enabled to continue!
Anonymize or synthesize? Privacy-preserving methods for heart failure score analytics
View through CrossRef
Abstract
Aims
Data availability remains a critical challenge in modern, data-driven medical research. Due to the sensitive nature of patient health records, they are rightfully subject to stringent privacy protection measures. One way to overcome these restrictions is to preserve patient privacy by using anonymization and synthetization strategies. In this work, we investigate the effectiveness of these methods for protecting patient privacy using real-world cardiology health records.
Methods and results
We implemented anonymization and synthetization techniques for a structure data set, which was collected during the HiGHmed Use Case Cardiology study. We employed the data anonymization tool ARX and the data synthetization framework ASyH individually and in combination. We evaluated the utility and shortcomings of the different approaches by statistical analyses and privacy risk assessments. Data utility was assessed by computing two heart failure risk scores on the protected data sets. We observed only minimal deviations to scores from the original data set. Additionally, we performed a re-identification risk analysis and found only minor residual risks for common types of privacy threats.
Conclusion
We could demonstrate that anonymization and synthetization methods protect privacy while retaining data utility for heart failure risk assessment. Both approaches and a combination thereof introduce only minimal deviations from the original data set over all features. While data synthesis techniques produce any number of new records, data anonymization techniques offer more formal privacy guarantees. Consequently, data synthesis on anonymized data further enhances privacy protection with little impacting data utility. We share all generated data sets with the scientific community through a use and access agreement.
Oxford University Press (OUP)
Title: Anonymize or synthesize? Privacy-preserving methods for heart failure score analytics
Description:
Abstract
Aims
Data availability remains a critical challenge in modern, data-driven medical research.
Due to the sensitive nature of patient health records, they are rightfully subject to stringent privacy protection measures.
One way to overcome these restrictions is to preserve patient privacy by using anonymization and synthetization strategies.
In this work, we investigate the effectiveness of these methods for protecting patient privacy using real-world cardiology health records.
Methods and results
We implemented anonymization and synthetization techniques for a structure data set, which was collected during the HiGHmed Use Case Cardiology study.
We employed the data anonymization tool ARX and the data synthetization framework ASyH individually and in combination.
We evaluated the utility and shortcomings of the different approaches by statistical analyses and privacy risk assessments.
Data utility was assessed by computing two heart failure risk scores on the protected data sets.
We observed only minimal deviations to scores from the original data set.
Additionally, we performed a re-identification risk analysis and found only minor residual risks for common types of privacy threats.
Conclusion
We could demonstrate that anonymization and synthetization methods protect privacy while retaining data utility for heart failure risk assessment.
Both approaches and a combination thereof introduce only minimal deviations from the original data set over all features.
While data synthesis techniques produce any number of new records, data anonymization techniques offer more formal privacy guarantees.
Consequently, data synthesis on anonymized data further enhances privacy protection with little impacting data utility.
We share all generated data sets with the scientific community through a use and access agreement.
Related Results
ecision Farming and Predictive Analytics in Precision Farming and Predictive Analytics in Precision Farming and Predictive Analytics in Precision Farming and Predictive Analytics in Precision Farming and Predictive Analytics in Precision Farming and Predi
ecision Farming and Predictive Analytics in Precision Farming and Predictive Analytics in Precision Farming and Predictive Analytics in Precision Farming and Predictive Analytics in Precision Farming and Predictive Analytics in Precision Farming and Predi
The scope of sensor networks and the Internet of Things spanning rapidly to diversified domains but not limited to sports, health, and business trading. In recent past, the sensors...
Frequency of Common Chromosomal Abnormalities in Patients with Idiopathic Acquired Aplastic Anemia
Frequency of Common Chromosomal Abnormalities in Patients with Idiopathic Acquired Aplastic Anemia
Objective: To determine the frequency of common chromosomal aberrations in local population idiopathic determine the frequency of common chromosomal aberrations in local population...
Etiology and predictors of heart failure in pregnancy. Newer Insights from the M-PAC registry
Etiology and predictors of heart failure in pregnancy. Newer Insights from the M-PAC registry
Abstract
Background
Women with heart disease undergoing pregnancy is on the increase, along with an increasing cardiac contribut...
Privacy and Security for Digital Health: Assessing Risks and Harms to Users
Privacy and Security for Digital Health: Assessing Risks and Harms to Users
Electronic Health (e-Health), such as mobile health (mHealth) and Health Information Systems (HIS), benefits healthcare consumers and professionals. However, it also poses potentia...
Augmented Differential Privacy Framework for Data Analytics
Augmented Differential Privacy Framework for Data Analytics
Abstract
Differential privacy has emerged as a popular privacy framework for providing privacy preserving noisy query answers based on statistical properties of databases. ...
THE SECURITY AND PRIVACY MEASURING SYSTEM FOR THE INTERNET OF THINGS DEVICES
THE SECURITY AND PRIVACY MEASURING SYSTEM FOR THE INTERNET OF THINGS DEVICES
The purpose of the article: elimination of the gap in existing need in the set of clear and objective security and privacy metrics for the IoT devices users and manufacturers and a...
Privacy Risk in Recommender Systems
Privacy Risk in Recommender Systems
Nowadays, recommender systems are mostly used in many online applications to filter information and help users in selecting their relevant requirements. It avoids users to become o...
A Privacy Protection Method for Power User Profiles That Integrates Improved Differential Privacy and Secret Sharing
A Privacy Protection Method for Power User Profiles That Integrates Improved Differential Privacy and Secret Sharing
ABSTRACT
In response to the privacy leakage risks inherent in the big data processing of power user personas, propose a collaborative optimiz...

