Efficient Video Scene Text Spotting: Unifying Detection, Tracking, and Recognition

03/08/2019
by   Zhanzhan Cheng, et al.
0

This paper proposes an unified framework for efficiently spotting scene text in videos. The method localizes and tracks text in each frame, and recognizes each tracked text stream one-time. Specifically, we first train a spatial-temporal text detector for localizing text regions in the sequential frames. Secondly, a well-designed text tracker is trained for grouping the localized text regions into corresponding cropped text streams. To efficiently spot video text, we recognize each tracked text stream one-time with a text region quality scoring mechanism instead of identifying the cropped text regions one-by-one. Experiments on two public benchmarks demonstrate that our method achieves impressive performance.

READ FULL TEXT
research
03/29/2021

Tracking Based Semi-Automatic Annotation for Scene Text Videos

Recently, video scene text detection has received increasing attention d...
research
03/20/2022

End-to-End Video Text Spotting with Transformer

Recent video text spotting methods usually require the three-staged pipe...
research
03/31/2023

Video text tracking for dense and small text based on pp-yoloe-r and sort algorithm

Although end-to-end video text spotting methods based on Transformer can...
research
11/19/2020

Towards Spatio-Temporal Video Scene Text Detection via Temporal Clustering

With only bounding-box annotations in the spatial domain, existing video...
research
06/21/2020

Lyric Video Analysis Using Text Detection and Tracking

We attempt to recognize and track lyric words in lyric videos. Lyric vid...
research
05/26/2020

A New Unified Method for Detecting Text from Marathon Runners and Sports Players in Video

Detecting text located on the torsos of marathon runners and sports play...
research
09/11/2017

Exploring Geometric Property Thresholds For Filtering Non-Text Regions In A Connected Component Based Text Detection Application

Automated text detection is a difficult computer vision task. In order t...

Please sign up or login with your details

Forgot password? Click here to reset