Cursive Caption Text Detection in Videos

01/09/2023
by   Ali Mirza, et al.
0

Textual content appearing in videos represents an interesting index for semantic retrieval of videos (from archives), generation of alerts (live streams) as well as high level applications like opinion mining and content summarization. One of the key components of such systems is the detection of textual content in video frames and the same makes the subject of our present study. This paper presents a robust technique for detection of textual content appearing in video frames. More specifically we target text in cursive script taking Urdu text as a case study. Detection of textual regions in video frames is carried out by fine-tuning object detectors based on deep convolutional neural networks for the specific case of text detection. Since it is common to have videos with caption text in multiple-scripts, cursive text is distinguished from Latin text using a script-identification module. Finally, detection and script identification are combined in a single end-to-end trainable system. Experiments on a comprehensive dataset of around 11,000 video frames report an F-measure of 0.91.

READ FULL TEXT

page 2

page 5

page 7

page 9

page 11

page 12

page 14

research
10/31/2021

Distantly Supervised Semantic Text Detection and Recognition for Broadcast Sports Videos Understanding

Comprehensive understanding of key players and actions in multiplayer sp...
research
05/05/2017

Unified Embedding and Metric Learning for Zero-Exemplar Event Detection

Event detection in unconstrained videos is conceived as a content-based ...
research
05/08/2022

Private Eye: On the Limits of Textual Screen Peeking via Eyeglass Reflections in Video Conferencing

Personal video conferencing has become the new norm after COVID-19 cause...
research
04/27/2018

Extracting textual overlays from social media videos using neural networks

Textual overlays are often used in social media videos as people who wat...
research
08/24/2022

Visual Subtitle Feature Enhanced Video Outline Generation

With the tremendously increasing number of videos, there is a great dema...
research
03/25/2022

Semi-supervised and Deep learning Frameworks for Video Classification and Key-frame Identification

Automating video-based data and machine learning pipelines poses several...
research
03/31/2020

Straight to the Point: Fast-forwarding Videos via Reinforcement Learning Using Textual Data

The rapid increase in the amount of published visual data and the limite...

Please sign up or login with your details

Forgot password? Click here to reset