A Fast Hierarchical Method for Multi-script and Arbitrary Oriented Scene Text Extraction

07/28/2014
by   Lluis Gómez, et al.
0

Typography and layout lead to the hierarchical organisation of text in words, text lines, paragraphs. This inherent structure is a key property of text in any script and language, which has nonetheless been minimally leveraged by existing text detection methods. This paper addresses the problem of text segmentation in natural scenes from a hierarchical perspective. Contrary to existing methods, we make explicit use of text structure, aiming directly to the detection of region groupings corresponding to text within a hierarchy produced by an agglomerative similarity clustering process over individual regions. We propose an optimal way to construct such an hierarchy introducing a feature space designed to produce text group hypotheses with high recall and a novel stopping rule combining a discriminative classifier and a probabilistic measure of group meaningfulness based in perceptual organization. Results obtained over four standard datasets, covering text in variable orientations and different languages, demonstrate that our algorithm, while being trained in a single mixed dataset, outperforms state of the art methods in unconstrained scenarios.

READ FULL TEXT

page 1

page 3

page 4

page 7

page 8

page 9

page 10

research
11/16/2019

SA-Text: Simple but Accurate Detector for Text of Arbitrary Shapes

We introduce a new framework for text detection named SA-Text meaning "S...
research
05/05/2022

Exploiting Global and Local Hierarchies for Hierarchical Text Classification

Hierarchical text classification aims to leverage label hierarchy in mul...
research
11/21/2018

Scene Text Detection with Supervised Pyramid Context Network

Scene text detection methods based on deep learning have achieved remark...
research
09/28/2022

Leveraging machine learning for less developed languages: Progress on Urdu text detection

Text detection in natural scene images has applications for autonomous d...
research
11/26/2015

OntoSeg: a Novel Approach to Text Segmentation using Ontological Similarity

Text segmentation (TS) aims at dividing long text into coherent segments...
research
01/21/2020

A Hierarchical Location Normalization System for Text

It's natural these days for people to know the local events from massive...
research
04/11/2017

EAST: An Efficient and Accurate Scene Text Detector

Previous approaches for scene text detection have already achieved promi...

Please sign up or login with your details

Forgot password? Click here to reset