Is dataset condensation a silver bullet for healthcare data sharing?

05/05/2023
by   Yujiang Wang, et al.
13

Safeguarding personal information is paramount for healthcare data sharing, a challenging issue without any silver bullet thus far. We study the prospect of a recent deep-learning advent, dataset condensation (DC), in sharing healthcare data for AI research, and the results are promising. The condensed data abstracts original records and irreversibly conceals individual-level knowledge to achieve a bona fide de-identification, which permits free sharing. Moreover, the original deep-learning utilities are well preserved in the condensed data with compressed volume and accelerated model convergences. In PhysioNet-2012, a condensed dataset of 20 samples can orient deep models attaining 80.3 of mortality prediction (versus 85.8 discovery generalised to MIMIC-III and Coswara datasets. We also interpret the inhere privacy protections of DC through theoretical analysis and empirical evidence. Dataset condensation opens a new gate to sharing healthcare data for AI research with multiple desirable traits.

READ FULL TEXT

page 11

page 13

page 36

page 37

research
04/05/2018

Processing of Electronic Health Records using Deep Learning: A review

Availability of large amount of clinical data is opening up new research...
research
08/31/2022

Non-readily identifiable data collaboration analysis for multiple datasets including personal information

Multi-source data fusion, in which multiple data sources are jointly ana...
research
09/17/2022

Non-Imaging Medical Data Synthesis for Trustworthy AI: A Comprehensive Survey

Data quality is the key factor for the development of trustworthy AI in ...
research
02/12/2021

MIMIC-IF: Interpretability and Fairness Evaluation of Deep Learning Models on MIMIC-IV Dataset

The recent release of large-scale healthcare datasets has greatly propel...
research
03/25/2023

Privacy-Enhancing Technologies in Federated Learning for the Internet of Healthcare Things: A Survey

Advancements in wearable medical devices in IoT technology are shaping t...
research
07/22/2023

Global Differential Privacy for Distributed Metaverse Healthcare Systems

Metaverse-enabled digital healthcare systems are expected to exploit an ...
research
08/26/2022

Another Use of SMOTE for Interpretable Data Collaboration Analysis

Recently, data collaboration (DC) analysis has been developed for privac...

Please sign up or login with your details

Forgot password? Click here to reset