On the Robustness of Arabic Speech Dialect Identification

06/01/2023
by   Peter Sullivan, et al.
0

Arabic dialect identification (ADI) tools are an important part of the large-scale data collection pipelines necessary for training speech recognition models. As these pipelines require application of ADI tools to potentially out-of-domain data, we aim to investigate how vulnerable the tools may be to this domain shift. With self-supervised learning (SSL) models as a starting point, we evaluate transfer learning and direct classification from SSL features. We undertake our evaluation under rich conditions, with a goal to develop ADI systems from pretrained models and ultimately evaluate performance on newly collected data. In order to understand what factors contribute to model decisions, we carry out a careful human study of a subset of our data. Our analysis confirms that domain shift is a major challenge for ADI models. We also find that while self-training does alleviate this challenges, it may be insufficient for realistic conditions.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
08/28/2017

MIT-QCRI Arabic Dialect Identification System for the 2017 Multi-Genre Broadcast Challenge

In order to successfully annotate the Arabic speech con- tent found in o...
research
09/21/2017

Speech Recognition Challenge in the Wild: Arabic MGB-3

This paper describes the Arabic MGB-3 Challenge - Arabic Speech Recognit...
research
06/05/2023

N-Shot Benchmarking of Whisper on Diverse Arabic Speech Recognition

Whisper, the recently developed multilingual weakly supervised model, is...
research
08/26/2022

Self-Supervised Human Activity Recognition with Localized Time-Frequency Contrastive Representation Learning

In this paper, we propose a self-supervised learning solution for human ...
research
06/24/2021

QASR: QCRI Aljazeera Speech Resource – A Large Scale Annotated Arabic Speech Corpus

We introduce the largest transcribed Arabic speech corpus, QASR, collect...
research
09/20/2023

Leveraging Data Collection and Unsupervised Learning for Code-switched Tunisian Arabic Automatic Speech Recognition

Crafting an effective Automatic Speech Recognition (ASR) solution for di...

Please sign up or login with your details

Forgot password? Click here to reset