Hearings and mishearings: decrypting the spoken word

09/01/2020
by   Anita Mehta, et al.
0

We propose a model of the speech perception of individual words in the presence of mishearings. This phenomenological approach is based on concepts used in linguistics, and provides a formalism that is universal across languages. We put forward an efficient two-parameter form for the word length distribution, and introduce a simple representation of mishearings, which we use in our subsequent modelling of word recognition. In a context-free scenario, word recognition often occurs via anticipation when, part-way into a word, we can correctly guess its full form. We give a quantitative estimate of this anticipation threshold when no mishearings occur, in terms of model parameters. As might be expected, the whole anticipation effect disappears when there are sufficiently many mishearings. Our global approach to the problem of speech perception is in the spirit of an optimisation problem. We show for instance that speech perception is easy when the word length is less than a threshold, to be identified with a static transition, and hard otherwise. We extend this to the dynamics of word recognition, proposing an intuitive approach highlighting the distinction between individual, isolated mishearings and clusters of contiguous mishearings. At least in some parameter range, a dynamical transition is manifest well before the static transition is reached, as is the case for many other examples of complex systems.

READ FULL TEXT
research
01/07/2020

Transition Property for α-Power Free Languages with α≥ 2 and k≥ 3 Letters

In 1985, Restivo and Salemi presented a list of five problems concerning...
research
07/06/2020

Contextualized Spoken Word Representations from Convolutional Autoencoders

A lot of work has been done recently to build sound language models for ...
research
11/27/2017

Language Bootstrapping: Learning Word Meanings From Perception-Action Association

We address the problem of bootstrapping language acquisition for an arti...
research
03/14/2022

Modelling word learning and recognition using visually grounded speech

Background: Computational models of speech recognition often assume that...
research
03/06/2017

Word forms - not just their lengths- are optimized for efficient communication

The inverse relationship between the length of a word and the frequency ...
research
05/25/2023

Visually grounded few-shot word acquisition with fewer shots

We propose a visually grounded speech model that acquires new words and ...

Please sign up or login with your details

Forgot password? Click here to reset