Knowledge-Guided Data-Centric AI in Healthcare: Progress, Shortcomings, and Future Directions

12/27/2022
by   Edward Y. Chang, et al.
0

The success of deep learning is largely due to the availability of large amounts of training data that cover a wide range of examples of a particular concept or meaning. In the field of medicine, having a diverse set of training data on a particular disease can lead to the development of a model that is able to accurately predict the disease. However, despite the potential benefits, there have not been significant advances in image-based diagnosis due to a lack of high-quality annotated data. This article highlights the importance of using a data-centric approach to improve the quality of data representations, particularly in cases where the available data is limited. To address this "small-data" issue, we discuss four methods for generating and aggregating training data: data augmentation, transfer learning, federated learning, and GANs (generative adversarial networks). We also propose the use of knowledge-guided GANs to incorporate domain knowledge in the training data generation process. With the recent progress in large pre-trained language models, we believe it is possible to acquire high-quality knowledge that can be used to improve the effectiveness of knowledge-guided generative methods.

READ FULL TEXT

page 4

page 8

page 12

page 13

page 14

page 15

page 17

research
12/08/2020

Data Instance Prior for Transfer Learning in GANs

Recent advances in generative adversarial networks (GANs) have shown rem...
research
07/14/2017

Knowledge will Propel Machine Understanding of Content: Extrapolating from Current Examples

Machine Learning has been a big success story during the AI resurgence. ...
research
03/30/2023

KD-DLGAN: Data Limited Image Generation via Knowledge Distillation

Generative Adversarial Networks (GANs) rely heavily on large-scale train...
research
07/04/2022

GAN-based generation of realistic 3D data: A systematic review and taxonomy

Data has become the most valuable resource in today's world. With the ma...
research
12/04/2021

Hyper-GAN: Transferring Unconditional to Conditional GANs with HyperNetworks

Conditional GANs have matured in recent years and are able to generate h...
research
10/04/2021

GenCo: Generative Co-training on Data-Limited Image Generation

Training effective Generative Adversarial Networks (GANs) requires large...
research
05/20/2018

Generating High-Quality Surface Realizations Using Data Augmentation and Factored Sequence Models

This work presents a new state of the art in reconstruction of surface r...

Please sign up or login with your details

Forgot password? Click here to reset