Extracting textual overlays from social media videos using neural networks

04/27/2018
by   Adam Słucki, et al.
0

Textual overlays are often used in social media videos as people who watch them without the sound would otherwise miss essential information conveyed in the audio stream. This is why extraction of those overlays can serve as an important meta-data source, e.g. for content classification or retrieval tasks. In this work, we present a robust method for extracting textual overlays from videos that builds up on multiple neural network architectures. The proposed solution relies on several processing steps: keyframe extraction, text detection and text recognition. The main component of our system, i.e. the text recognition module, is inspired by a convolutional recurrent neural network architecture and we improve its performance using synthetically generated dataset of over 600,000 images with text prepared by authors specifically for this task. We also develop a filtering method that reduces the amount of overlapping text phrases using Levenshtein distance and further boosts system's performance. The final accuracy of our solution reaches over 80A pair with state-of-the-art methods.

READ FULL TEXT

page 2

page 5

page 11

research
03/06/2022

Enhanced Sentiment Extraction Architecture for Social Media Content Analysis Using Capsule Networks

Recent research has produced efficient algorithms based on deep learning...
research
10/29/2022

NTULM: Enriching Social Media Text Representations with Non-Textual Units

On social media, additional context is often present in the form of anno...
research
01/09/2023

Cursive Caption Text Detection in Videos

Textual content appearing in videos represents an interesting index for ...
research
07/21/2017

Recurrent Neural Networks for Online Video Popularity Prediction

In this paper, we address the problem of popularity prediction of online...
research
04/24/2014

Unsupervised Text Extraction from G-Maps

This paper represents an text extraction method from Google maps, GIS ma...
research
09/14/2021

Tribrid: Stance Classification with Neural Inconsistency Detection

We study the problem of performing automatic stance classification on so...
research
04/27/2017

A Survey of Neural Network Techniques for Feature Extraction from Text

This paper aims to catalyze the discussions about text feature extractio...

Please sign up or login with your details

Forgot password? Click here to reset