Search engine for discovering works of Art, research articles, and books related to Art and Culture
ShareThis
Javascript must be enabled to continue!

Federated Data Linkage in Practice

View through CrossRef
In recent years, great strides have been made towards the deployment of federated systems for data research, including exploring federated trusted research environments (TREs). These federated TREs allow data analysts to securely use data within multiple institutions with minimal overheads while preserving information governance. However, due to the sensitive nature of data utilised in data linkage pipelines, especially within privacy-preserving applications, the real-world deployment of federated data linkage is in its early stages.   There are several challenges to overcome with privacy-preserving federated linkage, including securely and efficiently sharing information that enables the identification of record similarities. Another key consideration is the resilience of the federated linkage ecosystem to information being added or removed as data providers move in and out of the initiative, as changes in the record set can cause intermediary links to be destroyed or created. Within non-federated linkage, a spine dataset such as a national census can provide an anchor on which incoming datasets can be linked. In federated linkage this may not explicitly be the case, especially if providers are globally dispersed or such a broad spine does not exist for the use case, and so there may be critical datasets within the overall system. We present a discussion over the real-world challenges to federated linkage on population-scale applications, including approaches which allow privacy-preserving linkage across multiple TREs and handling information governance requirements. We also explore the impact of the collaborative nature of federated linkage, looking at how quality and record links are affected in practice.
Title: Federated Data Linkage in Practice
Description:
In recent years, great strides have been made towards the deployment of federated systems for data research, including exploring federated trusted research environments (TREs).
These federated TREs allow data analysts to securely use data within multiple institutions with minimal overheads while preserving information governance.
However, due to the sensitive nature of data utilised in data linkage pipelines, especially within privacy-preserving applications, the real-world deployment of federated data linkage is in its early stages.
  There are several challenges to overcome with privacy-preserving federated linkage, including securely and efficiently sharing information that enables the identification of record similarities.
Another key consideration is the resilience of the federated linkage ecosystem to information being added or removed as data providers move in and out of the initiative, as changes in the record set can cause intermediary links to be destroyed or created.
Within non-federated linkage, a spine dataset such as a national census can provide an anchor on which incoming datasets can be linked.
In federated linkage this may not explicitly be the case, especially if providers are globally dispersed or such a broad spine does not exist for the use case, and so there may be critical datasets within the overall system.
We present a discussion over the real-world challenges to federated linkage on population-scale applications, including approaches which allow privacy-preserving linkage across multiple TREs and handling information governance requirements.
We also explore the impact of the collaborative nature of federated linkage, looking at how quality and record links are affected in practice.

Related Results

On a Framework for Federated Cluster Analysis
On a Framework for Federated Cluster Analysis
Federated learning is becoming increasingly popular to enable automated learning in distributed networks of autonomous partners without sharing raw data. Many works focus on superv...
Distributed Learning for Heart Disease Risk Prediction Based on Key Clinical Parameters with Evaluation Metrics Analysis
Distributed Learning for Heart Disease Risk Prediction Based on Key Clinical Parameters with Evaluation Metrics Analysis
Abstract The purpose of this study design and test a Decentralized Federated learning framework that integrates a Mutual Learning approach with a Hierarchical Dirichlet Pro...
Image-based crop disease detection with federated learning
Image-based crop disease detection with federated learning
Abstract Crop disease detection and management is critical to improving productivity, reducing costs, and promoting environmentally friendly crop treatment methods. Modern ...
Evaluation measure for group-based record linkage
Evaluation measure for group-based record linkage
Introduction The robustness of record linkage evaluation measures is of high importance since linkage techniques are assessed based on these. However, minimal research has been con...
Perspectives on linkage to care for patients diagnosed with HIV: A qualitative study at a rural health center in South Western Uganda
Perspectives on linkage to care for patients diagnosed with HIV: A qualitative study at a rural health center in South Western Uganda
Linkage to care for newly diagnosed human immunodeficiency virus (HIV) patients is important to ensure that patients have good access to care. However, there is little information ...
Security and Privacy in Healthcare Applications Using Federated Learning - Review
Security and Privacy in Healthcare Applications Using Federated Learning - Review
This review explored data security and privacy in Internet of Health Things (IoHT) networks, focusing on local training for AI models and Federated Learning (FL) to ensure privacy ...
DLforum – A multidisciplinary online discussion forum for data linkage researchers and practitioners
DLforum – A multidisciplinary online discussion forum for data linkage researchers and practitioners
Data linkage, the process of identifying records that refer to the same entities across databases, is a crucial component of Population Data Science. Data linkage has a history goi...
Towards a Holistic Approach to Chronic Disease Management: Integrating Federated Learning and IoT for Personalized health Care
Towards a Holistic Approach to Chronic Disease Management: Integrating Federated Learning and IoT for Personalized health Care
Chronic diseases, specifically Cardiovascular Disease (CVD), pose a substantial worldwide health obstacle, requiring innovative and comprehensive approaches to management. This stu...

Back to Top