TextAdaIN: Fine-Grained AdaIN for Robust Text Recognition

05/09/2021
by   Oren Nuriel, et al.
0

Leveraging the characteristics of convolutional layers, image classifiers are extremely effective. However, recent works have exposed that in many cases they immoderately rely on global image statistics that are easy to manipulate while preserving image semantics. In text recognition, we reveal that it is rather the local image statistics which the networks overly depend on. Motivated by this, we suggest an approach to regulate the reliance on local statistics that improves overall text recognition performance. Our method, termed TextAdaIN, creates local distortions in the feature map which prevent the network from overfitting to the local statistics. It does so by deliberately mismatching fine-grained feature statistics between samples in a mini-batch. Despite TextAdaIN's simplicity, extensive experiments show its effectiveness compared to other, more complicated methods. TextAdaIN achieves state-of-the-art results on standard handwritten text recognition benchmarks. Additionally, it generalizes to multiple architectures and to the domain of scene text recognition. Furthermore, we demonstrate that integrating TextAdaIN improves robustness towards image corruptions.

READ FULL TEXT

page 7

page 11

page 12

research
03/27/2022

Knowledge Mining with Scene Text for Fine-Grained Recognition

Recently, the semantics of scene text has been proven to be essential in...
research
11/27/2018

Generating Attention from Classifier Activations for Fine-grained Recognition

Recent advances in fine-grained recognition utilize attention maps to lo...
research
07/26/2016

Semantic Clustering for Robust Fine-Grained Scene Recognition

In domain generalization, the knowledge learnt from one or multiple sour...
research
02/24/2016

A fine-grained approach to scene text script identification

This paper focuses on the problem of script identification in unconstrai...
research
08/05/2022

GLASS: Global to Local Attention for Scene-Text Spotting

In recent years, the dominant paradigm for text spotting is to combine t...
research
10/09/2020

Permuted AdaIN: Enhancing the Representation of Local Cues in Image Classifiers

Recent work has shown that convolutional neural network classifiers over...

Please sign up or login with your details

Forgot password? Click here to reset