On the difficulty of a distributional semantics of spoken language

03/23/2018
by   Grzegorz Chrupała, et al.
0

The bulk of research in the area of speech processing concerns itself with supervised approaches to transcribing spoken language into text. In the domain of unsupervised learning most work on speech has focused on discovering relatively low level constructs such as phoneme inventories or word-like units. This is in contrast to research on written language, where there is a large body of work on unsupervised induction of semantic representations of words and whole sentences and longer texts. In this study we examine the challenges of adapting these approaches from written to spoken language. We conjecture that unsupervised learning of spoken language semantics becomes possible if we abstract from the surface variability. We simulate this setting by using a dataset of utterances spoken by a realistic but uniform synthetic voice. We evaluate two simple unsupervised models which, to varying degrees of success, learn semantic representations of speech fragments. Finally we suggest possible routes toward transferring our methods to the domain of unrestricted natural speech.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
05/08/2023

Putting Natural in Natural Language Processing

Human language is firstly spoken and only secondarily written. Text, h...
research
10/23/2022

Bootstrapping meaning through listening: Unsupervised learning of spoken sentence embeddings

Inducing semantic representations directly from speech signals is a high...
research
03/15/2023

Cascading and Direct Approaches to Unsupervised Constituency Parsing on Spoken Sentences

Past work on unsupervised parsing is constrained to written form. In thi...
research
09/14/2023

CiwaGAN: Articulatory information exchange

Humans encode information into sounds by controlling articulators and de...
research
07/05/2022

Making sense of spoken plurals

Distributional semantics offers new ways to study the semantics of morph...
research
08/21/2000

Processing Self Corrections in a speech to speech system

Speech repairs occur often in spontaneous spoken dialogues. The ability ...
research
02/25/2022

Learning English with Peppa Pig

Attempts to computationally simulate the acquisition of spoken language ...

Please sign up or login with your details

Forgot password? Click here to reset