When and Why Test-Time Augmentation Works

11/23/2020
by   Divya Shanmugam, et al.
6

Test-time augmentation (TTA)—the aggregation of predictions across transformed versions of a test input—is a common practice in image classification. In this paper, we present theoretical and experimental analyses that shed light on 1) when test time augmentation is likely to be helpful and 2) when to use various test-time augmentation policies. A key finding is that even when TTA produces a net improvement in accuracy, it can change many correct predictions into incorrect predictions. We delve into when and why test-time augmentation changes a prediction from being correct to incorrect and vice versa. Our analysis suggests that the nature and amount of training data, the model architecture, and the augmentation policy all matter. Building on these insights, we present a learning-based method for aggregating test-time augmentations. Experiments across a diverse set of models, datasets, and augmentations show that our method delivers consistent improvements over existing approaches.

READ FULL TEXT

page 5

page 6

page 8

research
06/27/2022

Improved Text Classification via Test-Time Augmentation

Test-time augmentation – the aggregation of predictions across transform...
research
12/01/2022

Test-Time Mixup Augmentation for Data and Class-Specific Uncertainty Estimation in Multi-Class Image Classification

Uncertainty estimation of the trained deep learning network provides imp...
research
05/13/2021

Adaptive Test-Time Augmentation for Low-Power CPU

Convolutional Neural Networks (ConvNets) are trained offline using the f...
research
04/20/2023

The Dataset Multiplicity Problem: How Unreliable Data Impacts Predictions

We introduce dataset multiplicity, a way to study how inaccuracies, unce...
research
08/29/2023

Is it an i or an l: Test-time Adaptation of Text Line Recognition Models

Recognizing text lines from images is a challenging problem, especially ...
research
02/21/2020

Greedy Policy Search: A Simple Baseline for Learnable Test-Time Augmentation

Test-time data augmentation—averaging the predictions of a machine learn...
research
07/26/2021

A Comprehensive Study on Colorectal Polyp Segmentation with ResUNet++, Conditional Random Field and Test-Time Augmentation

Colonoscopy is considered the gold standard for detection of colorectal ...

Please sign up or login with your details

Forgot password? Click here to reset