Speech Detection For Child-Clinician Conversations In Danish For Low-Resource In-The-Wild Conditions: A Case Study

04/25/2022
by   Sneha Das, et al.
0

Use of speech models for automatic speech processing tasks can improve efficiency in the screening, analysis, diagnosis and treatment in medicine and psychiatry. However, the performance of pre-processing speech tasks like segmentation and diarization can drop considerably on in-the-wild clinical data, specifically when the target dataset comprises of atypical speech. In this paper we study the performance of a pre-trained speech model on a dataset comprising of child-clinician conversations in Danish with respect to the classification threshold. Since we do not have access to sufficient labelled data, we propose few-instance threshold adaptation, wherein we employ the first minutes of the speech conversation to obtain the optimum classification threshold. Through our work in this paper, we learned that the model with default classification threshold performs worse on children from the patient group. Furthermore, the error rates of the model is directly correlated to the severity of diagnosis in the patients. Lastly, our study on few-instance adaptation shows that three-minutes of clinician-child conversation is sufficient to obtain the optimum classification threshold.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
08/20/2023

Indonesian Automatic Speech Recognition with XLSR-53

This study focuses on the development of Indonesian Automatic Speech Rec...
research
06/01/2023

Towards hate speech detection in low-resource languages: Comparing ASR to acoustic word embeddings on Wolof and Swahili

We consider hate speech detection through keyword spotting on radio broa...
research
05/31/2021

Parkinsonian Chinese Speech Analysis towards Automatic Classification of Parkinson's Disease

Speech disorders often occur at the early stage of Parkinson's disease (...
research
10/09/2021

Personalized Automatic Speech Recognition Trained on Small Disordered Speech Datasets

This study investigates the performance of personalized automatic speech...
research
10/27/2022

Simulating realistic speech overlaps improves multi-talker ASR

Multi-talker automatic speech recognition (ASR) has been studied to gene...
research
03/21/2022

Multi-class versus One-class classifier in spontaneous speech analysis oriented to Alzheimer Disease diagnosis

Most of medical developments require the ability to identify samples tha...
research
05/30/2018

Learning multiple non-mutually-exclusive tasks for improved classification of inherently ordered labels

Medical image classification involves thresholding of labels that repres...

Please sign up or login with your details

Forgot password? Click here to reset