Acoustic absement in detail: Quantifying acoustic differences across time-series representations of speech data

04/12/2023
by   Matthew C. Kelley, et al.
0

The speech signal is a consummate example of time-series data. The acoustics of the signal change over time, sometimes dramatically. Yet, the most common type of comparison we perform in phonetics is between instantaneous acoustic measurements, such as formant values. In the present paper, I discuss the concept of absement as a quantification of differences between two time-series. I then provide an experimental example of absement applied to phonetic analysis for human and/or computer speech recognition. The experiment is a template-based speech recognition task, using dynamic time warping to compare the acoustics between recordings of isolated words. A recognition accuracy of 57.9 using absement as a tool, as well as the implications of using acoustics-only models of spoken word recognition with the word as the smallest discrete linguistic unit.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
06/10/2019

Word-level Speech Recognition with a Dynamic Lexicon

We propose a direct-to-word sequence model with a dynamic lexicon. Our w...
research
02/18/2019

Learned In Speech Recognition: Contextual Acoustic Word Embeddings

End-to-end acoustic-to-word speech recognition models have recently gain...
research
10/11/2020

A Case-Study on the Impact of Dynamic Time Warping in Time Series Regression

It is well understood that Dynamic Time Warping (DTW) is effective in re...
research
06/16/2021

Collaborative Training of Acoustic Encoders for Speech Recognition

On-device speech recognition requires training models of different sizes...
research
02/18/2018

Visual-Only Recognition of Normal, Whispered and Silent Speech

Silent speech interfaces have been recently proposed as a way to enable ...
research
05/08/2019

On the representation of speech and music

In most automatic speech recognition (ASR) systems, the audio signal is ...
research
06/21/2021

Speech prosody and remote experiments: a technical report

The aim of this paper is twofold. First, we present a review of differen...

Please sign up or login with your details

Forgot password? Click here to reset