Javascript must be enabled to continue!
Anonymize or synthesize? Privacy-preserving methods for heart failure score analytics
View through CrossRef
Abstract
Aims
Data availability remains a critical challenge in modern, data-driven medical research. Due to the sensitive nature of patient health records, they are rightfully subject to stringent privacy protection measures. One way to overcome these restrictions is to preserve patient privacy by using anonymization and synthetization strategies. In this work, we investigate the effectiveness of these methods for protecting patient privacy using real-world cardiology health records.
Methods and results
We implemented anonymization and synthetization techniques for a structure data set, which was collected during the HiGHmed Use Case Cardiology study. We employed the data anonymization tool ARX and the data synthetization framework ASyH individually and in combination. We evaluated the utility and shortcomings of the different approaches by statistical analyses and privacy risk assessments. Data utility was assessed by computing two heart failure risk scores on the protected data sets. We observed only minimal deviations to scores from the original data set. Additionally, we performed a re-identification risk analysis and found only minor residual risks for common types of privacy threats.
Conclusion
We could demonstrate that anonymization and synthetization methods protect privacy while retaining data utility for heart failure risk assessment. Both approaches and a combination thereof introduce only minimal deviations from the original data set over all features. While data synthesis techniques produce any number of new records, data anonymization techniques offer more formal privacy guarantees. Consequently, data synthesis on anonymized data further enhances privacy protection with little impacting data utility. We share all generated data sets with the scientific community through a use and access agreement.
Oxford University Press (OUP)
Title: Anonymize or synthesize? Privacy-preserving methods for heart failure score analytics
Description:
Abstract
Aims
Data availability remains a critical challenge in modern, data-driven medical research.
Due to the sensitive nature of patient health records, they are rightfully subject to stringent privacy protection measures.
One way to overcome these restrictions is to preserve patient privacy by using anonymization and synthetization strategies.
In this work, we investigate the effectiveness of these methods for protecting patient privacy using real-world cardiology health records.
Methods and results
We implemented anonymization and synthetization techniques for a structure data set, which was collected during the HiGHmed Use Case Cardiology study.
We employed the data anonymization tool ARX and the data synthetization framework ASyH individually and in combination.
We evaluated the utility and shortcomings of the different approaches by statistical analyses and privacy risk assessments.
Data utility was assessed by computing two heart failure risk scores on the protected data sets.
We observed only minimal deviations to scores from the original data set.
Additionally, we performed a re-identification risk analysis and found only minor residual risks for common types of privacy threats.
Conclusion
We could demonstrate that anonymization and synthetization methods protect privacy while retaining data utility for heart failure risk assessment.
Both approaches and a combination thereof introduce only minimal deviations from the original data set over all features.
While data synthesis techniques produce any number of new records, data anonymization techniques offer more formal privacy guarantees.
Consequently, data synthesis on anonymized data further enhances privacy protection with little impacting data utility.
We share all generated data sets with the scientific community through a use and access agreement.
Related Results
Augmented Differential Privacy Framework for Data Analytics
Augmented Differential Privacy Framework for Data Analytics
Abstract
Differential privacy has emerged as a popular privacy framework for providing privacy preserving noisy query answers based on statistical properties of databases. ...
THE SECURITY AND PRIVACY MEASURING SYSTEM FOR THE INTERNET OF THINGS DEVICES
THE SECURITY AND PRIVACY MEASURING SYSTEM FOR THE INTERNET OF THINGS DEVICES
The purpose of the article: elimination of the gap in existing need in the set of clear and objective security and privacy metrics for the IoT devices users and manufacturers and a...
Privacy Risk in Recommender Systems
Privacy Risk in Recommender Systems
Nowadays, recommender systems are mostly used in many online applications to filter information and help users in selecting their relevant requirements. It avoids users to become o...
Service Quality Improvement in the Banking Sector: A Data Analytics Perspective
Service Quality Improvement in the Banking Sector: A Data Analytics Perspective
Service quality in the banking sector is a critical determinant of customer satisfaction, loyalty, and competitive advantage. As banks strive to meet the evolving expectations of c...
People Analytics
People Analytics
People analytics refers to the systematic and scientific process of applying quantitative or qualitative data analysis methods to derive insights that shape and inform employee-rel...
Heart failure monitoring using the wearable cardioverter defibrillator in patients with newly diagnosed heart failure
Heart failure monitoring using the wearable cardioverter defibrillator in patients with newly diagnosed heart failure
Abstract
Funding Acknowledgements
Type of funding sources: None.
Background
...
Empagliflozin in the Real World: Strengthening Heart Failure Care in Pakistan
Empagliflozin in the Real World: Strengthening Heart Failure Care in Pakistan
Heart failure with reduced ejection fraction (HFrEF) remains a major clinical challenge worldwide and is a pressing public health issue in Pakistan. Patients here often present at ...
Assessing survival time of heart failure patients: using Bayesian approach
Assessing survival time of heart failure patients: using Bayesian approach
AbstractHeart failure is a failure of the heart to pump blood with normal efficiency and a globally growing public health issue with a high death rate all over the world, including...

