Identification of primary and collateral tracks in stuttered speech

03/02/2020
by   Rachid Riad, et al.
0

Disfluent speech has been previously addressed from two main perspectives: the clinical perspective focusing on diagnostic, and the Natural Language Processing (NLP) perspective aiming at modeling these events and detect them for downstream tasks. In addition, previous works often used different metrics depending on whether the input features are text or speech, making it difficult to compare the different contributions. Here, we introduce a new evaluation framework for disfluency detection inspired by the clinical and NLP perspective together with the theory of performance from <cit.> which distinguishes between primary and collateral tracks. We introduce a novel forced-aligned disfluency dataset from a corpus of semi-directed interviews, and present baseline results directly comparing the performance of text-based features (word and span information) and speech-based (acoustic-prosodic information). Finally, we introduce new audio features inspired by the word-based span features. We show experimentally that using these features outperformed the baselines for speech-based predictions on the present dataset.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
04/30/2020

The role of context in neural pitch accent detection in English

Prosody is a rich information source in natural language, serving as a m...
research
06/24/2021

QASR: QCRI Aljazeera Speech Resource – A Large Scale Annotated Arabic Speech Corpus

We introduce the largest transcribed Arabic speech corpus, QASR, collect...
research
08/02/2022

Audio Deepfake Detection Based on a Combination of F0 Information and Real Plus Imaginary Spectrogram Features

Recently, pioneer research works have proposed a large number of acousti...
research
09/07/2021

Countering Online Hate Speech: An NLP Perspective

Online hate speech has caught everyone's attention from the news related...
research
05/23/2022

KOLD: Korean Offensive Language Dataset

Although large attention has been paid to the detection of hate speech, ...
research
06/11/2022

Svadhyaya system for the Second Diagnosing COVID-19 using Acoustics Challenge 2021

This report describes the system used for detecting COVID-19 positives u...
research
07/01/2020

Automated Empathy Detection for Oncology Encounters

Empathy involves understanding other people's situation, perspective, an...

Please sign up or login with your details

Forgot password? Click here to reset