Towards Open-Set Text Recognition via Label-to-Prototype Learning

03/10/2022
by   Chang Liu, et al.
0

Scene text recognition is a popular topic and can benefit various tasks. Although many methods have been proposed for the close-set text recognition challenges, they cannot be directly applied to open-set scenarios, where the evaluation set contains novel characters not appearing in the training set. Conventional methods require collecting new data and retraining the model to handle these novel characters, which is an expensive and tedious process. In this paper, we propose a label-to-prototype learning framework to handle novel characters without retraining the model. In the proposed framework, novel characters are effectively mapped to their corresponding prototypes with a label-to-prototype learning module. This module is trained on characters with seen labels and can be easily generalized to novel characters. Additionally, feature-level rectification is conducted via topology-preserving transformation, resulting in better alignments between visual features and constructed prototypes while having a reasonably small impact on model speed. A lot of experiments show that our method achieves promising performance on a variety of zero-shot, close-set, and open-set text recognition datasets.

READ FULL TEXT

page 1

page 4

page 7

page 8

page 10

research
04/12/2022

Open-set Text Recognition via Character-Context Decoupling

The open-set text recognition task is an emerging challenge that require...
research
09/14/2020

Adaptive Text Recognition through Visual Matching

In this work, our objective is to address the problems of generalization...
research
06/29/2023

Towards Open-Domain Topic Classification

We introduce an open-domain topic classification system that accepts use...
research
03/28/2022

vTTS: visual-text to speech

This paper proposes visual-text to speech (vTTS), a method for synthesiz...
research
10/09/2014

Automatic Training Data Synthesis for Handwriting Recognition Using the Structural Crossing-Over Technique

The paper presents a novel technique called "Structural Crossing-Over" t...
research
11/27/2018

A Compositional Textual Model for Recognition of Imperfect Word Images

Printed text recognition is an important problem for industrial OCR syst...
research
04/06/2018

Learning Joint Gaussian Representations for Movies, Actors, and Literary Characters

Understanding of narrative content has become an increasingly popular to...

Please sign up or login with your details

Forgot password? Click here to reset