A Synthetic Electrocardiogram (ECG) Image Generation Toolbox to Facilitate Deep Learning-Based Scanned ECG Digitization

Access to medical data is often limited as it contains protected health information (PHI). There are privacy concerns regarding using records containing personally identifiable information. Recent advancements have been made in applying deep learning-based algorithms for clinical diagnosis and decision-making. However, deep learning models are data-greedy, whereas the availability of medical datasets for training and evaluating these models is relatively limited. Data augmentation with so-called digital twins is an emerging technique to address this need. This paper presents a novel approach for generating synthetic electrocardiogram (ECG) images with realistic artifacts from time-series data for use in developing algorithms for digitization of ECG images. Synthetic data is generated in a privacy-preserving manner by generating distortionless ECG images on standard ECG paper background. Next, various distortions, including handwritten text artifacts, wrinkles, creases, and perspective transforms are applied to the ECG images. The artifacts are generated synthetically, without personally identifiable information. As a use case, we generated a large ECG image dataset of 21,801 records from the PhysioNet PTB-XL dataset, with 12 lead ECG time-series data from 18,869 patients. A deep ECG image digitization model was developed and trained on the synthetic dataset, and was employed to convert the synthetic images to time-series data for evaluation. The signal-to-noise ratio (SNR) was calculated to assess the image digitization quality vs the ground truth ECG time-series. The results show an average signal recovery SNR of 27±2.8 dB, demonstrating the significance of the proposed synthetic ECG image dataset for training deep learning models.

READ FULL TEXT

page 7

page 12

research
03/04/2023

Synthetic ECG Signal Generation using Probabilistic Diffusion Models

Deep learning image processing models have had remarkable success in rec...
research
11/12/2022

Auto Lead Extraction and Digitization of ECG Paper Records using cGAN

Purpose: An Electrocardiogram (ECG) is the simplest and fastest bio-medi...
research
09/19/2019

Synthesis of Realistic ECG using Generative Adversarial Networks

Access to medical data is highly restricted due to its sensitive nature,...
research
04/18/2019

Explaining Deep Classification of Time-Series Data with Learned Prototypes

The emergence of deep learning networks raises a need for algorithms to ...
research
11/15/2022

Pretraining ECG Data with Adversarial Masking Improves Model Generalizability for Data-Scarce Tasks

Medical datasets often face the problem of data scarcity, as ground trut...
research
03/24/2023

Benchmarking the Impact of Noise on Deep Learning-based Classification of Atrial Fibrillation in 12-Lead ECG

Electrocardiography analysis is widely used in various clinical applicat...
research
06/21/2013

Computer Aided ECG Analysis - State of the Art and Upcoming Challenges

In this paper we present current achievements in computer aided ECG anal...

Please sign up or login with your details

Forgot password? Click here to reset