Learning to Read by Spelling: Towards Unsupervised Text Recognition

09/23/2018
by   Ankush Gupta, et al.
6

This work presents a method for visual text recognition without using any paired supervisory data. We formulate the text recognition task as one of aligning the conditional distribution of strings predicted from given text images, with lexically valid strings sampled from target corpora. This enables fully automated, and unsupervised learning from just line-level text-images, and unpaired text-string samples, obviating the need for large aligned datasets. We present detailed analysis for various aspects of the proposed method, namely - (1) the impact of the length of training sequences on convergence, (2) relation between character frequencies and the order in which they are learnt, and (3) demonstrate the generalisation ability of our recognition network to inputs of arbitrary lengths. Finally, we demonstrate excellent text recognition accuracy on both synthetically generated text images, and scanned images of real printed books, using no labelled training examples.

READ FULL TEXT

page 3

page 4

page 5

page 6

page 8

page 12

page 13

page 14

research
12/31/2018

Accurate, Data-Efficient, Unconstrained Text Recognition with Convolutional Neural Networks

Unconstrained text recognition is an important computer vision task, fea...
research
02/03/2023

The Learnable Typewriter: A Generative Approach to Text Line Analysis

We present a generative document-specific approach to character analysis...
research
08/13/2023

Free-ATM: Exploring Unsupervised Learning on Diffusion-Generated Images with Free Attention Masks

Despite the rapid advancement of unsupervised learning in visual represe...
research
01/13/2020

Separating Content from Style Using Adversarial Learning for Recognizing Text in the Wild

In this work we propose to improve text recognition from a new perspecti...
research
08/24/2023

LISTER: Neighbor Decoding for Length-Insensitive Scene Text Recognition

The diversity in length constitutes a significant characteristic of text...
research
06/26/2019

Leveraging Text Repetitions and Denoising Autoencoders in OCR Post-correction

A common approach for improving OCR quality is a post-processing step ba...
research
06/30/2020

Using Human Psychophysics to Evaluate Generalization in Scene Text Recognition Models

Scene text recognition models have advanced greatly in recent years. Ins...

Please sign up or login with your details

Forgot password? Click here to reset