Interpretable Dysarthric Speaker Adaptation based on Optimal-Transport

03/14/2022
by   Rosanna Turrisi, et al.
0

This work addresses the mismatch problem between the distribution of training data (source) and testing data (target), in the challenging context of dysarthric speech recognition. We focus on Speaker Adaptation (SA) in command speech recognition, where data from multiple sources (i.e., multiple speakers) are available. Specifically, we propose an unsupervised Multi-Source Domain Adaptation (MSDA) algorithm based on optimal-transport, called MSDA via Weighted Joint Optimal Transport (MSDA-WJDOT). We achieve a Command Error Rate relative reduction of 16 best competitor method, respectively. The strength of the proposed approach is that, differently from any other existing SA method, it offers an interpretable model that can also be exploited, in this context, to diagnose dysarthria without any specific training. Indeed, it provides a closeness measure between the target and the source speakers, reflecting their similarity in terms of speech characteristics. Based on the similarity between the target speaker and the healthy/dysarthric source speakers, we then define the healthy/dysarthric score of the target speaker that we leverage to perform dysarthria detection. This approach does not require any additional training and achieves a 95 accuracy in the dysarthria diagnosis.

READ FULL TEXT
research
04/06/2021

Optimal Transport-based Adaptation in Dysarthric Speech Tasks

In many real-world applications, the mismatch between distributions of t...
research
09/20/2019

CDOT: Continuous Domain Adaptation using Optimal Transport

In this work, we address the scenario in which the target domain is cont...
research
10/16/2021

A Unified Speaker Adaptation Approach for ASR

Transformer models have been used in automatic speech recognition (ASR) ...
research
12/03/2021

Hierarchical Optimal Transport for Unsupervised Domain Adaptation

In this paper, we propose a novel approach for unsupervised domain adapt...
research
02/25/2018

Multi-channel Adaptive Dereverberation Tracing Abrupt Position Change of Target Speaker

Adaptive algorithm based on multi-channel linear prediction is an effect...
research
03/27/2018

DeepJDOT: Deep Joint distribution optimal transport for unsupervised domain adaptation

In computer vision, one is often confronted with problems of domain shif...
research
10/18/2022

Mid-attribute speaker generation using optimal-transport-based interpolation of Gaussian mixture models

In this paper, we propose a method for intermediating multiple speakers'...

Please sign up or login with your details

Forgot password? Click here to reset