Unconstrained Scene Text and Video Text Recognition for Arabic Script

11/07/2017
by   Mohit Jain, et al.
0

Building robust recognizers for Arabic has always been challenging. We demonstrate the effectiveness of an end-to-end trainable CNN-RNN hybrid architecture in recognizing Arabic text in videos and natural scenes. We outperform previous state-of-the-art on two publicly available video text datasets - ALIF and ACTIV. For the scene text recognition task, we introduce a new Arabic scene text dataset and establish baseline results. For scripts like Arabic, a major challenge in developing robust recognizers is the lack of large quantity of annotated data. We overcome this by synthesising millions of Arabic text images from a large vocabulary of Arabic words and phrases. Our implementation is built on top of the model introduced here [37] which is proven quite effective for English scene text recognition. The model follows a segmentation-free, sequence to sequence transcription approach. The network transcribes a sequence of convolutional features from the input image to a sequence of target labels. This does away with the need for segmenting input image into constituent characters/glyphs, which is often difficult for Arabic script. Further, the ability of RNNs to model contextual dependencies yields superior recognition results.

READ FULL TEXT

page 1

page 3

page 4

page 5

research
08/14/2013

Arabic Text Recognition in Video Sequences

In this paper, we propose a robust approach for text extraction and reco...
research
09/27/2018

Cursive Scene Text Analysis by Deep Convolutional Linear Pyramids

The camera captured images have various aspects to investigate. Generall...
research
09/04/2020

A Hybrid Deep Learning Model for Arabic Text Recognition

Arabic text recognition is a challenging task because of the cursive nat...
research
11/09/2012

NF-SAVO: Neuro-Fuzzy system for Arabic Video OCR

In this paper we propose a robust approach for text extraction and recog...
research
04/09/2021

Benchmarking Scene Text Recognition in Devanagari, Telugu and Malayalam

Inspired by the success of Deep Learning based approaches to English sce...
research
01/10/2022

Towards Boosting the Accuracy of Non-Latin Scene Text Recognition

Scene-text recognition is remarkably better in Latin languages than the ...
research
06/06/2023

Take the Hint: Improving Arabic Diacritization with Partially-Diacritized Text

Automatic Arabic diacritization is useful in many applications, ranging ...

Please sign up or login with your details

Forgot password? Click here to reset