Invariance encoding in sliced-Wasserstein space for image classification with limited training data

01/09/2022
by   Mohammad Shifat-E-Rabbi, et al.
9

Deep convolutional neural networks (CNNs) are broadly considered to be state-of-the-art generic end-to-end image classification systems. However, they are known to underperform when training data are limited and thus require data augmentation strategies that render the method computationally expensive and not always effective. Rather than using a data augmentation strategy to encode invariances as typically done in machine learning, here we propose to mathematically augment a nearest subspace classification model in sliced-Wasserstein space by exploiting certain mathematical properties of the Radon Cumulative Distribution Transform (R-CDT), a recently introduced image transform. We demonstrate that for a particular type of learning problem, our mathematical solution has advantages over data augmentation with deep CNNs in terms of classification accuracy and computational complexity, and is particularly effective under a limited training data setting. The method is simple, effective, computationally efficient, non-iterative, and requires no parameters to be tuned. Python code implementing our method is available at https://github.com/rohdelab/mathematical_augmentation. Our method is integrated as a part of the software package PyTransKit, which is available at https://github.com/rohdelab/PyTransKit.

READ FULL TEXT

page 2

page 6

research
04/07/2020

Radon cumulative distribution transform subspace modeling for image classification

We present a new supervised image classification method for problems whe...
research
11/24/2020

Dissecting Image Crops

The elementary operation of cropping underpins nearly every computer vis...
research
04/30/2022

End-to-End Signal Classification in Signed Cumulative Distribution Transform Space

This paper presents a new end-to-end signal classification method using ...
research
02/08/2022

Equivariance versus Augmentation for Spherical Images

We analyze the role of rotational equivariance in convolutional neural n...
research
11/09/2020

MAGNeto: An Efficient Deep Learning Method for the Extractive Tags Summarization Problem

In this work, we study a new image annotation task named Extractive Tags...
research
06/13/2019

CoopSubNet: Cooperating Subnetwork for Data-Driven Regularization of Deep Networks under Limited Training Budgets

Deep networks are an integral part of the current machine learning parad...
research
01/12/2021

Mixup Without Hesitation

Mixup linearly interpolates pairs of examples to form new samples, which...

Please sign up or login with your details

Forgot password? Click here to reset