Learning Markov Clustering Networks for Scene Text Detection

05/22/2018
by   Zichuan Liu, et al.
0

A novel framework named Markov Clustering Network (MCN) is proposed for fast and robust scene text detection. MCN predicts instance-level bounding boxes by firstly converting an image into a Stochastic Flow Graph (SFG) and then performing Markov Clustering on this graph. Our method can detect text objects with arbitrary size and orientation without prior knowledge of object size. The stochastic flow graph encode objects' local correlation and semantic information. An object is modeled as strongly connected nodes, which allows flexible bottom-up detection for scale-varying and rotated objects. MCN generates bounding boxes without using Non-Maximum Suppression, and it can be fully parallelized on GPUs. The evaluation on public benchmarks shows that our method outperforms the existing methods by a large margin in detecting multioriented text objects. MCN achieves new state-of-art performance on challenging MSRA-TD500 dataset with precision of 0.88, recall of 0.79 and F-score of 0.83. Also, MCN achieves realtime inference with frame rate of 34 FPS, which is 1.5× speedup when compared with the fastest scene text detection algorithm.

READ FULL TEXT

page 1

page 7

page 8

research
09/30/2018

Correlation Propagation Networks for Scene Text Detection

In this work, we propose a novel hybrid method for scene text detection ...
research
07/25/2022

Optimal Boxes: Boosting End-to-End Scene Text Recognition by Adjusting Annotated Bounding Boxes via Reinforcement Learning

Text detection and recognition are essential components of a modern OCR ...
research
01/04/2018

PixelLink: Detecting Scene Text via Instance Segmentation

Most state-of-the-art scene text detection algorithms are deep learning ...
research
02/25/2018

Multi-Oriented Scene Text Detection via Corner Localization and Region Segmentation

Previous deep learning based state-of-the-art scene text detection metho...
research
01/25/2022

Main Product Detection with Graph Networks for Fashion

Computer vision has established a foothold in the online fashion retail ...
research
11/30/2017

ArbiText: Arbitrary-Oriented Text Detection in Unconstrained Scene

Arbitrary-oriented text detection in the wild is a very challenging task...
research
05/13/2021

LGPMA: Complicated Table Structure Recognition with Local and Global Pyramid Mask Alignment

Table structure recognition is a challenging task due to the various str...

Please sign up or login with your details

Forgot password? Click here to reset