Preliminary Study on SSCF-derived Polar Coordinate for ASR

11/30/2022
by   Sotheara Leang, et al.
0

The transition angles are defined to describe the vowel-to-vowel transitions in the acoustic space of the Spectral Subband Centroids, and the findings show that they are similar among speakers and speaking rates. In this paper, we propose to investigate the usage of polar coordinates in favor of angles to describe a speech signal by characterizing its acoustic trajectory and using them in Automatic Speech Recognition. According to the experimental results evaluated on the BRAF100 dataset, the polar coordinates achieved significantly higher accuracy than the angles in the mixed and cross-gender speech recognitions, demonstrating that these representations are superior at defining the acoustic trajectory of the speech signal. Furthermore, the accuracy was significantly improved when they were utilized with their first and second-order derivatives (Δ, ΔΔ), especially in cross-female recognition. However, the results showed they were not much more gender-independent than the conventional Mel-frequency Cepstral Coefficients (MFCCs).

READ FULL TEXT

page 1

page 2

page 3

page 4

research
12/14/2016

Recurrent Deep Stacking Networks for Speech Recognition

This paper presented our work on applying Recurrent Deep Stacking Networ...
research
07/10/2019

Large-Scale Mixed-Bandwidth Deep Neural Network Acoustic Modeling for Automatic Speech Recognition

In automatic speech recognition (ASR), wideband (WB) and narrowband (NB)...
research
02/06/2020

Robust Multi-channel Speech Recognition using Frequency Aligned Network

Conventional speech enhancement technique such as beamforming has known ...
research
05/09/2019

Analysis of Deep Clustering as Preprocessing for Automatic Speech Recognition of Sparsely Overlapping Speech

Significant performance degradation of automatic speech recognition (ASR...
research
12/12/2019

On Neural Phone Recognition of Mixed-Source ECoG Signals

The emerging field of neural speech recognition (NSR) using electrocorti...
research
10/08/2020

Gender domain adaptation for automatic speech recognition task

This paper is focused on the finetuning of acoustic models for speaker a...
research
02/16/2021

Voice Gender Scoring and Independent Acoustic Characterization of Perceived Masculinity and Femininity

Previous research has found that voices can provide reliable information...

Please sign up or login with your details

Forgot password? Click here to reset