SMILE: Sequence-to-Sequence Domain Adaption with Minimizing Latent Entropy for Text Image Recognition

02/24/2022
by   Yen-Cheng Chang, et al.
0

Training recognition models with synthetic images have achieved remarkable results in text recognition. However, recognizing text from real-world images still faces challenges due to the domain shift between synthetic and real-world text images. One of the strategies to eliminate the domain difference without manual annotation is unsupervised domain adaptation (UDA). Due to the characteristic of sequential labeling tasks, most popular UDA methods cannot be directly applied to text recognition. To tackle this problem, we proposed a UDA method with minimizing latent entropy on sequence-to-sequence attention-based models with classbalanced self-paced learning. Our experiments show that our proposed framework achieves better recognition results than the existing methods on most UDA text recognition benchmarks. All codes are publicly available.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
12/18/2014

Decomposition-Based Domain Adaptation for Real-World Font Recognition

We present a domain adaption framework to address a domain mismatch betw...
research
11/01/2022

Self-supervised Character-to-Character Distillation

Handling complicated text images (e.g., irregular structures, low resolu...
research
06/22/2020

Text Recognition in Real Scenarios with a Few Labeled Samples

Scene text recognition (STR) is still a hot research topic in computer v...
research
03/31/2015

Real-World Font Recognition Using Deep Network and Domain Adaptation

We address a challenging fine-grain classification problem: recognizing ...
research
06/07/2021

Unsupervised Representation Disentanglement of Text: An Evaluation on Synthetic Datasets

To highlight the challenges of achieving representation disentanglement ...
research
05/09/2023

Fashion CUT: Unsupervised domain adaptation for visual pattern classification in clothes using synthetic data and pseudo-labels

Accurate product information is critical for e-commerce stores to allow ...
research
01/21/2019

Generating Text Sequence Images for Recognition

Recently, methods based on deep learning have dominated the field of tex...

Please sign up or login with your details

Forgot password? Click here to reset