Textual Data Augmentation for Patient Outcomes Prediction

11/13/2022
by   Qiuhao Lu, et al.
0

Deep learning models have demonstrated superior performance in various healthcare applications. However, the major limitation of these deep models is usually the lack of high-quality training data due to the private and sensitive nature of this field. In this study, we propose a novel textual data augmentation method to generate artificial clinical notes in patients' Electronic Health Records (EHRs) that can be used as additional training data for patient outcomes prediction. Essentially, we fine-tune the generative language model GPT-2 to synthesize labeled text with the original training data. More specifically, We propose a teacher-student framework where we first pre-train a teacher model on the original data, and then train a student model on the GPT-augmented data under the guidance of the teacher. We evaluate our method on the most common patient outcome, i.e., the 30-day readmission rate. The experimental results show that deep models can improve their predictive performance with the augmented data, indicating the effectiveness of the proposed architecture.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
06/03/2021

Noisy student-teacher training for robust keyword spotting

We propose self-training with noisy student-teacher approach for streami...
research
02/02/2023

How to choose "Good" Samples for Text Data Augmentation

Deep learning-based text classification models need abundant labeled dat...
research
12/24/2019

On Sharing Models Instead of Data using Mimic learning for Smart Health Applications

Electronic health records (EHR) systems contain vast amounts of medical ...
research
07/10/2023

Customizing Synthetic Data for Data-Free Student Learning

Data-free knowledge distillation (DFKD) aims to obtain a lightweight stu...
research
10/11/2022

Improving Sample Efficiency of Deep Learning Models in Electricity Market

The superior performance of deep learning relies heavily on a large coll...
research
06/10/2023

Medical Data Augmentation via ChatGPT: A Case Study on Medication Identification and Medication Event Classification

The identification of key factors such as medications, diseases, and rel...
research
08/21/2019

Dialog State Tracking with Reinforced Data Augmentation

Neural dialog state trackers are generally limited due to the lack of qu...

Please sign up or login with your details

Forgot password? Click here to reset