SEP-28k: A Dataset for Stuttering Event Detection From Podcasts With People Who Stutter

02/24/2021
by   Colin Lea, et al.
0

The ability to automatically detect stuttering events in speech could help speech pathologists track an individual's fluency over time or help improve speech recognition systems for people with atypical speech patterns. Despite increasing interest in this area, existing public datasets are too small to build generalizable dysfluency detection systems and lack sufficient annotations. In this work, we introduce Stuttering Events in Podcasts (SEP-28k), a dataset containing over 28k clips labeled with five event types including blocks, prolongations, sound repetitions, word repetitions, and interjections. Audio comes from public podcasts largely consisting of people who stutter interviewing other people who stutter. We benchmark a set of acoustic models on SEP-28k and the public FluencyBank dataset and highlight how simply increasing the amount of training data improves relative detection performance by 28% and 24% F1 on each. Annotations from over 32k clips across both datasets will be publicly released.

READ FULL TEXT
research
03/10/2022

KSoF: The Kassel State of Fluency Dataset – A Therapy Centered Dataset of Stuttering

Stuttering is a complex speech disorder that negatively affects an indiv...
research
11/09/2015

Detecting events and key actors in multi-person videos

Multi-person event recognition is a challenging task, often with many pe...
research
04/07/2022

Detecting Dysfluencies in Stuttering Therapy Using wav2vec 2.0

Stuttering is a varied speech disorder that harms an individual's commun...
research
02/21/2019

The NIGENS General Sound Events Database

Computational auditory scene analysis is gaining interest in the last ye...
research
06/08/2023

Latent Phrase Matching for Dysarthric Speech

Many consumer speech recognition systems are not tuned for people with s...
research
10/27/2022

On Out-of-Distribution Detection for Audio with Deep Nearest Neighbors

Out-of-distribution (OOD) detection is concerned with identifying data p...
research
08/11/2023

Large-Scale Learning on Overlapped Speech Detection: New Benchmark and New General System

Overlapped Speech Detection (OSD) is an important part of speech applica...

Please sign up or login with your details

Forgot password? Click here to reset