Data Augmentation for Histopathological Images Based on Gaussian-Laplacian Pyramid Blending

01/31/2020
by   Steve Tsham Mpinda Ataky, et al.
8

Data imbalance is a major problem that affects several machine learning algorithms. Such problems are troublesome because most of the learning algorithms attempts to optimize a loss function based on error measures that do not take into account the data imbalance. Accordingly, the learning algorithm simply generates a trivial model that is biased toward predicting the most frequent class in the training data. Data augmentation techniques have been used to mitigate the data imbalance problem. However, in the case of histopathologic images (HIs), low-level as well as high-level data augmentation techniques still present performance issues when applied in the presence of inter-patient variability; whence the model tends to learn color representations, which are in fact related to the stain process. In this paper, we propose an approach capable of not only augmenting HIs database but also distributing the inter-patient variability by means of image blending using Gaussian-Laplacian pyramid. The proposed approach consists in finding the Gaussian pyramids of two images of different patients and finding the Laplacian pyramids thereof. Afterwards, the left half of one image and the right half of another are joined in each level of Laplacian pyramid, and from the joint pyramids, the original image is reconstructed. This composition, resulting from the blending process, combines stain variation of two patients, avoiding that color misleads the learning process. Experimental results on the BreakHis dataset have shown promising gains vis-à-vis the majority of traditional techniques presented in the literature.

READ FULL TEXT

page 1

page 3

page 4

page 5

page 7

research
06/01/2023

CAISA at SemEval-2023 Task 8: Counterfactual Data Augmentation for Mitigating Class Imbalance in Causal Claim Identification

The class imbalance problem can cause machine learning models to produce...
research
05/23/2023

A Laplacian Pyramid Based Generative H E Stain Augmentation Network

Hematoxylin and Eosin (H E) staining is a widely used sample preparati...
research
03/20/2022

Transparency strategy-based data augmentation for BI-RADS classification of mammograms

Image augmentation techniques have been widely investigated to improve t...
research
07/02/2020

Can We Achieve More with Less? Exploring Data Augmentation for Toxic Comment Classification

This paper tackles one of the greatest limitations in Machine Learning: ...
research
05/16/2018

Lightweight Pyramid Networks for Image Deraining

Existing deep convolutional neural networks have found major success in ...
research
02/07/2023

Data augmentation for machine learning of chemical process flowsheets

Artificial intelligence has great potential for accelerating the design ...
research
01/06/2022

An unambiguous cloudiness index for nonwovens

Cloudiness or formation is a concept routinely used in industry to addre...

Please sign up or login with your details

Forgot password? Click here to reset