AlignNet: A Unifying Approach to Audio-Visual Alignment

02/12/2020
by   Jianren Wang, et al.
4

We present AlignNet, a model that synchronizes videos with reference audios under non-uniform and irregular misalignments. AlignNet learns the end-to-end dense correspondence between each frame of a video and an audio. Our method is designed according to simple and well-established principles: attention, pyramidal processing, warping, and affinity function. Together with the model, we release a dancing dataset Dance50 for training and evaluation. Qualitative, quantitative and subjective evaluation results on dance-music alignment and speech-lip alignment demonstrate that our method far outperforms the state-of-the-art methods. Project video and code are available at https://jianrenw.github.io/AlignNet.

READ FULL TEXT

page 1

page 8

page 9

research
07/28/2020

A Hybrid Approach to Audio-to-Score Alignment

Audio-to-score alignment aims at generating an accurate mapping between ...
research
03/30/2022

End to End Lip Synchronization with a Temporal AutoEncoder

We study the problem of syncing the lip movement in a video with the aud...
research
02/18/2019

End-to-end Lyrics Alignment for Polyphonic Music Using an Audio-to-Character Recognition Model

Time-aligned lyrics can enrich the music listening experience by enablin...
research
11/15/2020

Learning Frame Similarity using Siamese networks for Audio-to-Score Alignment

Audio-to-score alignment aims at generating an accurate mapping between ...
research
08/19/2018

Dynamic Temporal Alignment of Speech to Lips

Many speech segments in movies are re-recorded in a studio during postpr...
research
10/08/2021

Phone-to-audio alignment without text: A Semi-supervised Approach

The task of phone-to-audio alignment has many applications in speech res...
research
02/09/2022

AIVC: Artificial Intelligence based Video Codec

This paper introduces AIVC, an end-to-end neural video codec. It is base...

Please sign up or login with your details

Forgot password? Click here to reset