I-vector Based Within Speaker Voice Quality Identification on connected speech

02/15/2021
by   Chuyao Feng, et al.
0

Voice disorders affect a large portion of the population, especially heavy voice users such as teachers or call-center workers. Most voice disorders can be treated effectively with behavioral voice therapy, which teaches patients to replace problematic, habituated voice production mechanics with optimal voice production technique(s), yielding improved voice quality. However, treatment often fails because patients have difficulty differentiating their habitual voice from the target technique independently, when clinician feedback is unavailable between therapy sessions. Therefore, with the long term aim to extend clinician feedback to extra-clinical settings, we built two systems that automatically differentiate various voice qualities produced by the same individual. We hypothesized that 1) a system based on i-vectors could classify these qualities as if they represent different speakers and 2) such a system would outperform one based on traditional voice signal processing algorithms. Training recordings were provided by thirteen amateur actors, each producing 5 perceptually different voice qualities in connected speech: normal, breathy, fry, twang, and hyponasal. As hypothesized, the i-vector system outperformed the acoustic measure system in classification accuracy (i.e. 97.5% compared to 77.2%, respectively). Findings are expected because the i-vector system maps features to an integrated space which better represents each voice quality than the 22-feature space of the baseline system. Therefore, an i-vector based system has potential for clinical application in voice therapy and voice training.

READ FULL TEXT
research
03/08/2021

CUHK-EE Voice Cloning System for ICASSP 2021 M2VoC Challenge

This paper presents the CUHK-EE voice cloning system for ICASSP 2021 M2V...
research
11/13/2019

Enhanced Voice Post Processing Using Voice Decoder Guidance Indicators

Voice enhancement and voice coding are imperative and important function...
research
01/22/2020

VoiceCoach: Interactive Evidence-based Training for Voice Modulation Skills in Public Speaking

The modulation of voice properties, such as pitch, volume, and speed, is...
research
07/19/2022

Machine-learning applied to classify flow-induced sound parameters from simulated human voice

Disorders of voice production have severe effects on the quality of life...
research
10/29/2020

Interpreting glottal flow dynamics for detecting COVID-19 from voice

In the pathogenesis of COVID-19, impairment of respiratory functions is ...
research
12/15/2021

Chimpanzee voice prints? Insights from transfer learning experiments from human voices

Individual vocal differences are ubiquitous in the animal kingdom. In hu...
research
10/15/2018

The Trajectory of Voice Onset Time with Vocal Aging

Vocal aging, a universal process of human aging, can largely affect one'...

Please sign up or login with your details

Forgot password? Click here to reset