A Review of Audio Features and Statistical Models Exploited for Voice Pattern Design

02/24/2015
by   Ngoc Q. K. Duong, et al.
0

Audio fingerprinting, also named as audio hashing, has been well-known as a powerful technique to perform audio identification and synchronization. It basically involves two major steps: fingerprint (voice pattern) design and matching search. While the first step concerns the derivation of a robust and compact audio signature, the second step usually requires knowledge about database and quick-search algorithms. Though this technique offers a wide range of real-world applications, to the best of the authors' knowledge, a comprehensive survey of existing algorithms appeared more than eight years ago. Thus, in this paper, we present a more up-to-date review and, for emphasizing on the audio signal processing aspect, we focus our state-of-the-art survey on the fingerprint design step for which various audio features and their tractable statistical models are discussed.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
05/16/2023

Robust and lightweight audio fingerprint for Automatic Content Recognition

This research paper presents a novel audio fingerprinting system for Aut...
research
07/15/2020

A survey and an extensive evaluation of popular audio declipping methods

Dynamic range limitations in signal processing often lead to clipping, o...
research
08/29/2023

Audio Deepfake Detection: A Survey

Audio deepfake detection is an emerging active topic. A growing number o...
research
10/22/2020

Neural Audio Fingerprint for High-specific Audio Retrieval based on Contrastive Learning

Most of existing audio fingerprinting systems have limitations to be use...
research
11/29/2022

Neural Vocoder Feature Estimation for Dry Singing Voice Separation

Singing voice separation (SVS) is a task that separates singing voice au...
research
12/30/2009

Writer Identification Using Inexpensive Signal Processing Techniques

We propose to use novel and classical audio and text signal-processing a...
research
10/05/2021

Manifold learning-supported estimation of relative transfer functions for spatial filtering

Many spatial filtering algorithms used for voice capture in, e.g., telec...

Please sign up or login with your details

Forgot password? Click here to reset