Detecting Oriented Text in Natural Images by Linking Segments

03/19/2017
by   Baoguang Shi, et al.
0

Most state-of-the-art text detection methods are specific to horizontal Latin text and are not fast enough for real-time applications. We introduce Segment Linking (SegLink), an oriented text detection method. The main idea is to decompose text into two locally detectable elements, namely segments and links. A segment is an oriented box covering a part of a word or text line; A link connects two adjacent segments, indicating that they belong to the same word or text line. Both elements are detected densely at multiple scales by an end-to-end trained, fully-convolutional neural network. Final detections are produced by combining segments connected by links. Compared with previous methods, SegLink improves along the dimensions of accuracy, speed, and ease of training. It achieves an f-measure of 75.0 Incidental (Challenge 4) benchmark, outperforming the previous best by a large margin. It runs at over 20 FPS on 512x512 images. Moreover, without modification, SegLink is able to detect long lines of non-Latin text, such as Chinese.

READ FULL TEXT

page 1

page 2

page 4

page 7

page 8

research
02/26/2020

PuzzleNet: Scene Text Detection by Segment Context Graph Learning

Recently, a series of decomposition-based scene text detection methods h...
research
09/11/2020

TP-LSD: Tri-Points Based Line Segment Detector

This paper proposes a novel deep convolutional model, Tri-Points Based L...
research
04/22/2016

Synthetic Data for Text Localisation in Natural Images

In this paper we introduce a new method for text detection in natural im...
research
03/26/2021

YOLinO: Generic Single Shot Polyline Detection in Real Time

The detection of polylines in images is usually either bound to branchle...
research
10/08/2018

High-quality Ellipse Detection Based on Arc-support Line Segments

Over the years many ellipse detection algorithms spring up and are studi...
research
04/22/2021

Fully Convolutional Line Parsing

We present a one-stage Fully Convolutional Line Parsing network (F-Clip)...
research
03/09/2022

On Linking Level Segments

An increasingly common area of study in procedural content generation is...

Please sign up or login with your details

Forgot password? Click here to reset