GradMix for nuclei segmentation and classification in imbalanced pathology image datasets

10/24/2022
by   Tan Nhu Nhat Doan, et al.
0

An automated segmentation and classification of nuclei is an essential task in digital pathology. The current deep learning-based approaches require a vast amount of annotated datasets by pathologists. However, the existing datasets are imbalanced among different types of nuclei in general, leading to a substantial performance degradation. In this paper, we propose a simple but effective data augmentation technique, termed GradMix, that is specifically designed for nuclei segmentation and classification. GradMix takes a pair of a major-class nucleus and a rare-class nucleus, creates a customized mixing mask, and combines them using the mask to generate a new rare-class nucleus. As it combines two nuclei, GradMix considers both nuclei and the neighboring environment by using the customized mixing mask. This allows us to generate realistic rare-class nuclei with varying environments. We employed two datasets to evaluate the effectiveness of GradMix. The experimental results suggest that GradMix is able to improve the performance of nuclei segmentation and classification in imbalanced pathology image datasets.

READ FULL TEXT

page 3

page 6

page 8

research
06/25/2023

DiffMix: Diffusion Model-based Data Synthesis for Nuclei Segmentation and Classification in Imbalanced Pathology Image Datasets

Nuclei segmentation and classification is a significant process in patho...
research
04/06/2023

A review of ensemble learning and data augmentation models for class imbalanced problems: combination, implementation and evaluation

Class imbalance (CI) in classification problems arises when the number o...
research
04/20/2023

Is augmentation effective to improve prediction in imbalanced text datasets?

Imbalanced datasets present a significant challenge for machine learning...
research
06/18/2021

RSG: A Simple but Effective Module for Learning Imbalanced Datasets

Imbalanced datasets widely exist in practice and area great challenge fo...
research
02/23/2022

Image Classification on Small Datasets via Masked Feature Mixing

Deep convolutional neural networks require large amounts of labeled data...
research
01/16/2021

Improve Global Glomerulosclerosis Classification with Imbalanced Data using CircleMix Augmentation

The classification of glomerular lesions is a routine and essential task...
research
12/13/2020

Improving the Classification of Rare Chords with Unlabeled Data

In this work, we explore techniques to improve performance for rare clas...

Please sign up or login with your details

Forgot password? Click here to reset