Lombard Effect for Bilingual Speakers in Cantonese and English: importance of spectro-temporal features

04/14/2022
by   Maximilian Karl Scharf, et al.
0

For a better understanding of the mechanisms underlying speech perception and the contribution of different signal features, computational models of speech recognition have a long tradition in hearing research. Due to the diverse range of situations in which speech needs to be recognized, these models need to be generalizable across many acoustic conditions, speakers, and languages. This contribution examines the importance of different features for speech recognition predictions of plain and Lombard speech for English in comparison to Cantonese in stationary and modulated noise. While Cantonese is a tonal language that encodes information in spectro-temporal features, the Lombard effect is known to be associated with spectral changes in the speech signal. These contrasting properties of tonal languages and the Lombard effect form an interesting basis for the assessment of speech recognition models. Here, an automatic speech recognition-based ASR model using spectral or spectro-temporal features is evaluated with empirical data. The results indicate that spectro-temporal features are crucial in order to predict the speaker-specific speech recognition threshold SRT_50 in both Cantonese and English as well as to account for the improvement of speech recognition in modulated noise, while effects due to Lombard speech can already be predicted by spectral features.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
10/31/2020

Multilingual Bottleneck Features for Improving ASR Performance of Code-Switched Speech in Under-Resourced Languages

In this work, we explore the benefits of using multilingual bottleneck f...
research
08/17/2013

Implementation Of Back-Propagation Neural Network For Isolated Bangla Speech Recognition

This paper is concerned with the development of Back-propagation Neural ...
research
09/30/2022

Blind Signal Dereverberation for Machine Speech Recognition

We present a method to remove unknown convolutive noise introduced to sp...
research
02/24/2021

Thoughts on the potential to compensate a hearing loss in noise

The effect of hearing impairment on speech perception was described by P...
research
05/07/2020

The Perceptimatic English Benchmark for Speech Perception Models

We present the Perceptimatic English Benchmark, an open experimental ben...
research
12/22/2014

Learning linearly separable features for speech recognition using convolutional neural networks

Automatic speech recognition systems usually rely on spectral-based feat...
research
08/02/2023

Careful Whisper – leveraging advances in automatic speech recognition for robust and interpretable aphasia subtype classification

This paper presents a fully automated approach for identifying speech an...

Please sign up or login with your details

Forgot password? Click here to reset