Towards Boosting the Accuracy of Non-Latin Scene Text Recognition

01/10/2022
by   Sanjana Gunna, et al.
1

Scene-text recognition is remarkably better in Latin languages than the non-Latin languages due to several factors like multiple fonts, simplistic vocabulary statistics, updated data generation tools, and writing systems. This paper examines the possible reasons for low accuracy by comparing English datasets with non-Latin languages. We compare various features like the size (width and height) of the word images and word length statistics. Over the last decade, generating synthetic datasets with powerful deep learning techniques has tremendously improved scene-text recognition. Several controlled experiments are performed on English, by varying the number of (i) fonts to create the synthetic data and (ii) created word images. We discover that these factors are critical for the scene-text recognition systems. The English synthetic datasets utilize over 1400 fonts while Arabic and other non-Latin datasets utilize less than 100 fonts for data generation. Since some of these languages are a part of different regions, we garner additional fonts through a region-based search to improve the scene-text recognition models in Arabic and Devanagari. We improve the Word Recognition Rates (WRRs) on Arabic MLT-17 and MLT-19 datasets by 24.54 achieve WRR gains of 7.88 datasets.

READ FULL TEXT

page 5

page 10

research
01/10/2022

Transfer Learning for Scene Text Recognition in Indian Languages

Scene text recognition in low-resource Indian languages is challenging b...
research
11/07/2017

Unconstrained Scene Text and Video Text Recognition for Arabic Script

Building robust recognizers for Arabic has always been challenging. We d...
research
04/09/2021

Benchmarking Scene Text Recognition in Devanagari, Telugu and Malayalam

Inspired by the success of Deep Learning based approaches to English sce...
research
09/27/2018

Cursive Scene Text Analysis by Deep Convolutional Linear Pyramids

The camera captured images have various aspects to investigate. Generall...
research
11/26/2021

Traditional Chinese Synthetic Datasets Verified with Labeled Data for Scene Text Recognition

Scene text recognition (STR) has been widely studied in academia and ind...
research
01/02/2019

Lipi Gnani - A Versatile OCR for Documents in any Language Printed in Kannada Script

A Kannada OCR, named Lipi Gnani, has been designed and developed from sc...
research
07/09/2018

Universal Word Segmentation: Implementation and Interpretation

Word segmentation is a low-level NLP task that is non-trivial for a cons...

Please sign up or login with your details

Forgot password? Click here to reset