End-to-End and Self-Supervised Learning for ComParE 2022 Stuttering Sub-Challenge

07/20/2022
by   Shakeel Ahmad Sheikh, et al.
0

In this paper, we present end-to-end and speech embedding based systems trained in a self-supervised fashion to participate in the ACM Multimedia 2022 ComParE Challenge, specifically the stuttering sub-challenge. In particular, we exploit the embeddings from the pre-trained Wav2Vec2.0 model for stuttering detection (SD) on the KSoF dataset. After embedding extraction, we benchmark with several methods for SD. Our proposed self-supervised based SD system achieves a UAR of 36.9 which is 31.32 (DeepSpectrum) challenge baseline (CBL). Moreover, we show that concatenating layer embeddings with Mel-frequency cepstral coefficients (MFCCs) features further improves the UAR of 33.81 respectively over the CBL. Finally, we demonstrate that the summing information across all the layers of Wav2Vec2.0 surpasses the CBL by a relative margin of 45.91 Computational Paralinguistics ChallengE

READ FULL TEXT

page 1

page 2

page 3

page 4

research
06/23/2022

Comparing supervised and self-supervised embedding for ExVo Multi-Task learning track

The ICML Expressive Vocalizations (ExVo) Multi-task challenge 2022, focu...
research
06/30/2021

Using Self-Supervised Feature Extractors with Attention for Automatic COVID-19 Detection from Speech

The ComParE 2021 COVID-19 Speech Sub-challenge provides a test-bed for t...
research
04/23/2023

End-to-End Feasible Optimization Proxies for Large-Scale Economic Dispatch

The paper proposes a novel End-to-End Learning and Repair (E2ELR) archit...
research
07/02/2023

End-to-End Out-of-distribution Detection with Self-supervised Sampling

Out-of-distribution (OOD) detection empowers the model trained on the cl...
research
07/22/2022

Scale dependant layer for self-supervised nuclei encoding

Recent developments in self-supervised learning give us the possibility ...
research
12/17/2021

Watermarking Images in Self-Supervised Latent Spaces

We revisit watermarking techniques based on pre-trained deep networks, i...
research
05/13/2022

The ACM Multimedia 2022 Computational Paralinguistics Challenge: Vocalisations, Stuttering, Activity, Mosquitoes

The ACM Multimedia 2022 Computational Paralinguistics Challenge addresse...

Please sign up or login with your details

Forgot password? Click here to reset