A Novel Windowing Technique for Efficient Computation of MFCC for Speaker Recognition

06/12/2012
by   Md Sahidullah, et al.
0

In this paper, we propose a novel family of windowing technique to compute Mel Frequency Cepstral Coefficient (MFCC) for automatic speaker recognition from speech. The proposed method is based on fundamental property of discrete time Fourier transform (DTFT) related to differentiation in frequency domain. Classical windowing scheme such as Hamming window is modified to obtain derivatives of discrete time Fourier transform coefficients. It has been mathematically shown that the slope and phase of power spectrum are inherently incorporated in newly computed cepstrum. Speaker recognition systems based on our proposed family of window functions are shown to attain substantial and consistent performance improvement over baseline single tapered Hamming window as well as recently proposed multitaper windowing technique.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
07/06/2021

Nonuniform fast Fourier transforms with nonequispaced spatial and frequency data and fast sinc transforms

In this paper we study the nonuniform fast Fourier transform with nonequ...
research
02/20/2020

Efficient Trainable Front-Ends for Neural Speech Enhancement

Many neural speech enhancement and source separation systems operate in ...
research
07/26/2023

Differentiable adaptive short-time Fourier transform with respect to the window length

This paper presents a gradient-based method for on-the-fly optimization ...
research
11/17/2018

Designing nearly tight window for improving time-frequency masking

Many audio signal processing methods are formulated in the time-frequenc...
research
12/20/2019

Uniform error estimates for the NFFT

In this paper, we study the error behavior of the known fast Fourier tra...
research
10/21/2021

Optimizing Multi-Taper Features for Deep Speaker Verification

Multi-taper estimators provide low-variance power spectrum estimates tha...
research
02/20/2021

Learnable MFCCs for Speaker Verification

We propose a learnable mel-frequency cepstral coefficient (MFCC) fronten...

Please sign up or login with your details

Forgot password? Click here to reset