Cursive Scene Text Analysis by Deep Convolutional Linear Pyramids

09/27/2018
by   Saad Bin Ahmed, et al.
0

The camera captured images have various aspects to investigate. Generally, the emphasis of research depends on the interesting regions. Sometimes the focus could be on color segmentation, object detection or scene text analysis. The image analysis, visibility and layout analysis are the tasks easier for humans as suggested by behavioral trait of humans, but in contrast when these same tasks are supposed to perform by machines then it seems to be challenging. The learning machines always learn from the properties associated to provided samples. The numerous approaches are designed in recent years for scene text extraction and recognition and the efforts are underway to improve the accuracy. The convolutional approach provided reasonable results on non-cursive text analysis appeared in natural images. The work presented in this manuscript exploited the strength of linear pyramids by considering each pyramid as a feature of the provided sample. Each pyramid image process through various empirically selected kernels. The performance was investigated by considering Arabic text on each image pyramid of EASTR-42k dataset. The error rate of 0.17 was reported on Arabic scene text recognition.

READ FULL TEXT
research
11/07/2017

Unconstrained Scene Text and Video Text Recognition for Arabic Script

Building robust recognizers for Arabic has always been challenging. We d...
research
01/26/2016

COCO-Text: Dataset and Benchmark for Text Detection and Recognition in Natural Images

This paper describes the COCO-Text dataset. In recent years large-scale ...
research
01/10/2022

Towards Boosting the Accuracy of Non-Latin Scene Text Recognition

Scene-text recognition is remarkably better in Latin languages than the ...
research
11/09/2012

NF-SAVO: Neuro-Fuzzy system for Arabic Video OCR

In this paper we propose a robust approach for text extraction and recog...
research
07/21/2017

Text Recognition in Scene Image and Video Frame using Color Channel Selection

In recent years, recognition of text from natural scene image and video ...
research
07/09/2019

BADAM: A Public Dataset for Baseline Detection in Arabic-script Manuscripts

The application of handwritten text recognition to historical works is h...
research
12/15/2014

CITlab ARGUS for Arabic Handwriting

In the recent years it turned out that multidimensional recurrent neural...

Please sign up or login with your details

Forgot password? Click here to reset