Two-Stage Resampling for Convolutional Neural Network Training in the Imbalanced Colorectal Cancer Image Classification

04/07/2020
by   Michał Koziarski, et al.
0

Data imbalance remains one of the open challenges in the contemporary machine learning. It is especially prevalent in case of medical data, such as histopathological images. Traditional data-level approaches for dealing with data imbalance are ill-suited for image data: oversampling methods such as SMOTE and its derivatives lead to creation of unrealistic synthetic observations, whereas undersampling reduces the amount of available data, critical for successful training of convolutional neural networks. To alleviate the problems associated with over- and undersampling we propose a novel two-stage resampling methodology, in which we initially use the oversampling techniques in the image space to leverage a large amount of data for training of a convolutional neural network, and afterwards apply undersampling in the feature space to fine-tune the last layers of the network. Experiments conducted on a colorectal cancer image dataset indicate the usefulness of the proposed approach.

READ FULL TEXT

page 4

page 6

research
04/23/2021

Research on the Detection Method of Breast Cancer Deep Convolutional Neural Network Based on Computer Aid

Traditional breast cancer image classification methods require manual ex...
research
11/28/2021

Imbalanced data preprocessing techniques utilizing local data characteristics

Data imbalance, that is the disproportion between the number of training...
research
07/14/2022

Learning Discriminative Representation via Metric Learning for Imbalanced Medical Image Classification

Data imbalance between common and rare diseases during model training of...
research
09/12/2019

Effective transfer learning for hyperspectral image classification with deep convolutional neural networks

Hyperspectral imaging is a rich source of data, allowing for multitude o...
research
06/02/2019

Radial-Based Undersampling for Imbalanced Data Classification

Data imbalance remains one of the most widespread problems affecting con...
research
04/17/2021

Potential Anchoring for imbalanced data classification

Data imbalance remains one of the factors negatively affecting the perfo...
research
06/26/2023

Topology Estimation of Simulated 4D Image Data by Combining Downscaling and Convolutional Neural Networks

Four-dimensional image-type data can quickly become prohibitively large,...

Please sign up or login with your details

Forgot password? Click here to reset