Improving patch-based scene text script identification with ensembles of conjoined networks

02/24/2016
by   Lluis Gómez, et al.
0

This paper focuses on the problem of script identification in scene text images. Facing this problem with state of the art CNN classifiers is not straightforward, as they fail to address a key characteristic of scene text instances: their extremely variable aspect ratio. Instead of resizing input images to a fixed aspect ratio as in the typical use of holistic CNN classifiers, we propose here a patch-based classification framework in order to preserve discriminative parts of the image that are characteristic of its class. We describe a novel method based on the use of ensembles of conjoined networks to jointly learn discriminative stroke-parts representations and their relative importance in a patch-based classification scheme. Our experiments with this learning procedure demonstrate state-of-the-art results in two public script identification datasets. In addition, we propose a new public benchmark dataset for the evaluation of multi-lingual scene text end-to-end reading systems. Experiments done in this dataset demonstrate the key role of script identification in a complete end-to-end system that combines our script identification method with a previously published text detector and an off-the-shelf OCR engine.

READ FULL TEXT
research
02/24/2016

A fine-grained approach to scene text script identification

This paper focuses on the problem of script identification in unconstrai...
research
12/09/2019

Patch Aggregator for Scene Text Script Identification

Script identification in the wild is of great importance in a multi-ling...
research
01/01/2018

Script Identification in Natural Scene Image and Video Frame using Attention based Convolutional-LSTM Network

Script identification facilitates many important applications in documen...
research
07/01/2019

ICDAR2019 Robust Reading Challenge on Multi-lingual Scene Text Detection and Recognition -- RRC-MLT-2019

With the growing cosmopolitan culture of modern cities, the need of robu...
research
12/01/2021

On-Device Spatial Attention based Sequence Learning Approach for Scene Text Script Identification

Automatic identification of script is an essential component of a multil...
research
10/03/2020

End-to-End Training of CNN Ensembles for Person Re-Identification

We propose an end-to-end ensemble method for person re-identification (R...
research
03/05/2022

An End-to-End Approach for Seam Carving Detection using Deep Neural Networks

Seam carving is a computational method capable of resizing images for bo...

Please sign up or login with your details

Forgot password? Click here to reset