TeLCoS: OnDevice Text Localization with Clustering of Script

04/16/2021
by   Rachit S Munjal, et al.
0

Recent research in the field of text localization in a resource constrained environment has made extensive use of deep neural networks. Scene text localization and recognition on low-memory mobile devices have a wide range of applications including content extraction, image categorization and keyword based image search. For text recognition of multi-lingual localized text, the OCR systems require prior knowledge of the script of each text instance. This leads to word script identification being an essential step for text recognition. Most existing methods treat text localization, script identification and text recognition as three separate tasks. This makes script identification an overhead in the recognition pipeline. To reduce this overhead, we propose TeLCoS: OnDevice Text Localization with Clustering of Script, a multi-task dual branch lightweight CNN network that performs real-time on device Text Localization and High-level Script Clustering simultaneously. The network drastically reduces the number of calls to a separate script identification module, by grouping and identifying some majorly used scripts through a single feed-forward pass over the localization network. We also introduce a novel structural similarity based channel pruning mechanism to build an efficient network with only 1.15M parameters. Experiments on benchmark datasets suggest that our method achieves state-of-the-art performance, with execution latency of 60 ms for the entire pipeline on the Exynos 990 chipset device.

READ FULL TEXT

page 1

page 2

page 3

page 4

page 6

page 8

research
06/21/2019

A Multitask Network for Localization and Recognition of Text in Images

We present an end-to-end trainable multi-task network that addresses the...
research
01/30/2018

E2E-MLT - an Unconstrained End-to-End Method for Multi-Language Scene Text

An end-to-end method for multi-language scene text localization, recogni...
research
05/17/2021

STRIDE : Scene Text Recognition In-Device

Optical Character Recognition (OCR) systems have been widely used in var...
research
12/01/2021

On-Device Spatial Attention based Sequence Learning Approach for Scene Text Script Identification

Automatic identification of script is an essential component of a multil...
research
10/22/2020

TLGAN: document Text Localization using Generative Adversarial Nets

Text localization from the digital image is the first step for the optic...
research
04/14/2015

Efficient Scene Text Localization and Recognition with Local Character Refinement

An unconstrained end-to-end text localization and recognition method is ...
research
12/09/2019

Patch Aggregator for Scene Text Script Identification

Script identification in the wild is of great importance in a multi-ling...

Please sign up or login with your details

Forgot password? Click here to reset