Synthetic Sample Selection via Reinforcement Learning

08/26/2020
by   Jiarong Ye, et al.
10

Synthesizing realistic medical images provides a feasible solution to the shortage of training data in deep learning based medical image recognition systems. However, the quality control of synthetic images for data augmentation purposes is under-investigated, and some of the generated images are not realistic and may contain misleading features that distort data distribution when mixed with real images. Thus, the effectiveness of those synthetic images in medical image recognition systems cannot be guaranteed when they are being added randomly without quality assurance. In this work, we propose a reinforcement learning (RL) based synthetic sample selection method that learns to choose synthetic images containing reliable and informative features. A transformer based controller is trained via proximal policy optimization (PPO) using the validation classification accuracy as the reward. The selected images are mixed with the original training data for improved training of image recognition systems. To validate our method, we take the pathology image recognition as an example and conduct extensive experiments on two histopathology image datasets. In experiments on a cervical dataset and a lymph node dataset, the image classification performance is improved by 8.1 2.3 our RL framework. Our proposed synthetic sample selection method is general and has great potential to boost the performance of various medical image recognition systems given limited annotation.

READ FULL TEXT
research
12/09/2019

Selective Synthetic Augmentation with Quality Assurance

Supervised training of an automated medical image analysis system often ...
research
09/28/2022

Data Augmentation using Feature Generation for Volumetric Medical Images

Medical image classification is one of the most critical problems in the...
research
08/08/2023

Synthetic Augmentation with Large-scale Unconditional Pre-training

Deep learning based medical image recognition systems often require a su...
research
07/24/2019

Synthetic Augmentation and Feature-based Filtering for Improved Cervical Histopathology Image Classification

Cervical intraepithelial neoplasia (CIN) grade of histopathology images ...
research
11/19/2015

How much data is needed to train a medical image deep learning system to achieve necessary high accuracy?

The use of Convolutional Neural Networks (CNN) in natural image classifi...
research
09/12/2022

Data Augmentation by Selecting Mixed Classes Considering Distance Between Classes

Data augmentation is an essential technique for improving recognition ac...
research
05/31/2020

Bridging the gap between Natural and Medical Images through Deep Colorization

Deep learning has thrived by training on large-scale datasets. However, ...

Please sign up or login with your details

Forgot password? Click here to reset