Exploring British Accents: Modeling the Trap-Bath Split with Functional Data Analysis

08/27/2020
by   Aranya Koshy, et al.
0

The sound of our speech is influenced by the places we come from. Great Britain contains a wide variety of distinctive accents which are of interest to linguistics. In particular, the "a" vowel in words like "class" is pronounced differently in the North and the South. Speech recordings of this vowel can be represented as formant curves or as Mel-frequency cepstral coefficient curves. Functional data analysis and generalized additive models offer techniques to model the variation in these curves. Our first aim is to model the difference between typical Northern and Southern vowels, by training two classifiers on the North-South Class Vowels dataset collected for this paper (Koshy 2020). Our second aim is to visualize geographical variation of accents in Great Britain. For this we use speech recordings from a second dataset, the British National Corpus (BNC) audio edition (Coleman et al. 2012). The trained models are used to predict the accent of speakers in the BNC, and then we model the geographical patterns in these predictions using a soap film smoother. This work demonstrates a flexible and interpretable approach to modeling phonetic accent variation in speech recordings.

READ FULL TEXT

page 4

page 6

page 14

page 22

page 26

page 27

research
01/26/2022

The Norwegian Parliamentary Speech Corpus

The Norwegian Parliamentary Speech Corpus (NPSC) is a speech dataset wit...
research
10/27/2022

Masked Autoencoders Are Articulatory Learners

Articulatory recordings track the positions and motion of different arti...
research
06/07/2023

A Study on the Reliability of Automatic Dysarthric Speech Assessments

Automating dysarthria assessments offers the opportunity to develop effe...
research
04/29/2023

Adversarial Representation Learning for Robust Privacy Preservation in Audio

Sound event detection systems are widely used in various applications su...
research
05/03/2023

Analysing the Impact of Audio Quality on the Use of Naturalistic Long-Form Recordings for Infant-Directed Speech Research

Modelling of early language acquisition aims to understand how infants b...
research
06/21/2021

Speech prosody and remote experiments: a technical report

The aim of this paper is twofold. First, we present a review of differen...

Please sign up or login with your details

Forgot password? Click here to reset