MedDiff: Generating Electronic Health Records using Accelerated Denoising Diffusion Model

02/08/2023
by   Huan He, et al.
0

Due to patient privacy protection concerns, machine learning research in healthcare has been undeniably slower and limited than in other application domains. High-quality, realistic, synthetic electronic health records (EHRs) can be leveraged to accelerate methodological developments for research purposes while mitigating privacy concerns associated with data sharing. The current state-of-the-art model for synthetic EHR generation is generative adversarial networks, which are notoriously difficult to train and can suffer from mode collapse. Denoising Diffusion Probabilistic Models, a class of generative models inspired by statistical thermodynamics, have recently been shown to generate high-quality synthetic samples in certain domains. It is unknown whether these can generalize to generation of large-scale, high-dimensional EHRs. In this paper, we present a novel generative model based on diffusion models that is the first successful application on electronic health records. Our model proposes a mechanism to perform class-conditional sampling to preserve label information. We also introduce a new sampling strategy to accelerate the inference speed. We empirically show that our model outperforms existing state-of-the-art synthetic EHR generation methods.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
02/28/2023

Synthesizing Mixed-type Electronic Health Records using Diffusion Models

Electronic Health Records (EHRs) contain sensitive patient information, ...
research
03/19/2017

Generating Multi-label Discrete Patient Records using Generative Adversarial Networks

Access to electronic health record (EHR) data has motivated computationa...
research
03/10/2023

EHRDiff: Exploring Realistic EHR Synthesis with Diffusion Models

Electronic health records (EHR) contain vast biomedical knowledge and ar...
research
04/04/2023

Synthesize Extremely High-dimensional Longitudinal Electronic Health Records via Hierarchical Autoregressive Language Model

Synthetic electronic health records (EHRs) that are both realistic and p...
research
06/01/2018

Natural Language Generation for Electronic Health Records

A variety of methods existing for generating synthetic electronic health...
research
04/21/2018

Learning from the experts: From expert systems to machine learned diagnosis models

Expert diagnostic support systems have been extensively studied. The pra...
research
11/15/2019

Explicit-Blurred Memory Network for Analyzing Patient Electronic Health Records

In recent years, we have witnessed an increased interest in temporal mod...

Please sign up or login with your details

Forgot password? Click here to reset