Exact, Parallelizable Dynamic Time Warping Alignment with Linear Memory

08/04/2020
by   Christopher Tralie, et al.
0

Audio alignment is a fundamental preprocessing step in many MIR pipelines. For two audio clips with M and N frames, respectively, the most popular approach, dynamic time warping (DTW), has O(MN) requirements in both memory and computation, which is prohibitive for frame-level alignments at reasonable rates. To address this, a variety of memory efficient algorithms exist to approximate the optimal alignment under the DTW cost. To our knowledge, however, no exact algorithms exist that are guaranteed to break the quadratic memory barrier. In this work, we present a divide and conquer algorithm that computes the exact globally optimal DTW alignment using O(M+N) memory. Its runtime is still O(MN), trading off memory for a 2x increase in computation. However, the algorithm can be parallelized up to a factor of min(M, N) with the same memory constraints, so it can still run more efficiently than the textbook version with an adequate GPU. We use our algorithm to compute exact alignments on a collection of orchestral music, which we use as ground truth to benchmark the alignment accuracy of several popular approximate alignment schemes at scales that were not previously possible.

READ FULL TEXT

page 3

page 9

research
07/28/2020

A Hybrid Approach to Audio-to-Score Alignment

Audio-to-score alignment aims at generating an accurate mapping between ...
research
07/15/2018

The Globally Optimal Reparameterization Algorithm: an Alternative to Fast Dynamic Time Warping for Action Recognition in Video Sequences

Signal alignment has become a popular problem in robotics due in part to...
research
03/19/2023

Deep Declarative Dynamic Time Warping for End-to-End Learning of Alignment Paths

This paper addresses learning end-to-end models for time series data tha...
research
11/15/2020

Learning Frame Similarity using Siamese networks for Audio-to-Score Alignment

Audio-to-score alignment aims at generating an accurate mapping between ...
research
11/13/2017

Audio-to-score alignment of piano music using RNN-based automatic music transcription

We propose a framework for audio-to-score alignment on piano performance...
research
12/08/2018

Estimates of the Reconstruction Error in Partially Redressed Warped Frames Expansions

In recent work, redressed warped frames have been introduced for the ana...
research
01/18/2022

Improve Sentence Alignment by Divide-and-conquer

In this paper, we introduce a divide-and-conquer algorithm to improve se...

Please sign up or login with your details

Forgot password? Click here to reset