DeepAI AI Chat
Log In Sign Up

Robust Text Detection in Natural Scene Images

by   Xu-Cheng Yin, et al.
English 设为首页

Text detection in natural scene images is an important prerequisite for many content-based image analysis tasks. In this paper, we propose an accurate and robust method for detecting texts in natural scene images. A fast and effective pruning algorithm is designed to extract Maximally Stable Extremal Regions (MSERs) as character candidates using the strategy of minimizing regularized variations. Character candidates are grouped into text candidates by the ingle-link clustering algorithm, where distance weights and threshold of the clustering algorithm are learned automatically by a novel self-training distance metric learning algorithm. The posterior probabilities of text candidates corresponding to non-text are estimated with an character classifier; text candidates with high probabilities are then eliminated and finally texts are identified with a text classifier. The proposed system is evaluated on the ICDAR 2011 Robust Reading Competition dataset; the f measure is over 76 of 71 show that our proposed method can outperform the other competitive method with the f measure increase of over 9 percent. Finally, we have setup an online demo of our proposed scene text detection system at


page 4

page 9


Automatic text extraction and character segmentation using maximally stable extremal regions

Text detection and segmentation is an important prerequisite for many co...

Scene Text Eraser

The character information in natural scene images contains various perso...

Incidental Scene Text Understanding: Recent Progresses on ICDAR 2015 Robust Reading Competition Challenge 4

Different from focused texts present in natural images, which are captur...

Urdu text in natural scene images: a new dataset and preliminary text detection

Text detection in natural scene images for content analysis is an intere...

Text Flow: A Unified Text Detection System in Natural Scene Images

The prevalent scene text detection approach follows four sequential step...

Semi-Bagging Based Deep Neural Architecture to Extract Text from High Entropy Images

Extracting texts of various size and shape from images containing multip...

Code Repositories



view repo