Lyric Video Analysis Using Text Detection and Tracking

06/21/2020
by   Shota Sakaguchi, et al.
0

We attempt to recognize and track lyric words in lyric videos. Lyric video is a music video showing the lyric words of a song. The main characteristic of lyric videos is that the lyric words are shown at frames synchronously with the music. The difficulty of recognizing and tracking the lyric words is that (1) the words are often decorated and geometrically distorted and (2) the words move arbitrarily and drastically in the video frame. The purpose of this paper is to analyze the motion of the lyric words in lyric videos, as the first step of automatic lyric video generation. In order to analyze the motion of lyric words, we first apply a state-of-the-art scene text detector and recognizer to each video frame. Then, lyric-frame matching is performed to establish the optimal correspondence between lyric words and the frames. After fixing the motion trajectories of individual lyric words from correspondence, we analyze the trajectories of the lyric words by k-medoids clustering and dynamic time warping (DTW).

READ FULL TEXT

page 2

page 6

page 9

page 10

research
11/21/2022

Video Background Music Generation: Dataset, Method and Evaluation

Music is essential when editing videos, but selecting music manually is ...
research
01/06/2014

Bangla Text Recognition from Video Sequence: A New Focus

Extraction and recognition of Bangla text from video frame images is cha...
research
03/29/2021

Tracking Based Semi-Automatic Annotation for Scene Text Videos

Recently, video scene text detection has received increasing attention d...
research
03/08/2019

Efficient Video Scene Text Spotting: Unifying Detection, Tracking, and Recognition

This paper proposes an unified framework for efficiently spotting scene ...
research
06/14/2020

Adaptively Meshed Video Stabilization

Video stabilization is essential for improving visual quality of shaky v...
research
06/12/2018

A Timed Version of the Plactic Monoid

Timed words are words where letters of the alphabet come with time stamp...
research
08/12/2023

A One-dimensional HEVC video steganalysis method using the Optimality of Predicted Motion Vectors

Among steganalysis techniques, detection against motion vector (MV) doma...

Please sign up or login with your details

Forgot password? Click here to reset