Contextualized Spoken Word Representations from Convolutional Autoencoders

07/06/2020
by   Prakamya Mishra, et al.
0

A lot of work has been done recently to build sound language models for the textual data, but not much such has been done in the case of speech/audio type data. In the case of text, words can be represented by a unique fixed-length vector. Such models for audio type data can not only lead to great advances in the speech-related natural language processing tasks but can also reduce the need for converting speech to text for performing the same. This paper proposes a novel model architecture that produces syntactically, contextualized, and semantically adequate representation of varying length spoken words. The performance of the spoken word embeddings generated by the proposed model was validated by (1) inspecting the vector space generated, and (2) evaluating its performance on the downstream task of next spoken word prediction in a speech.

READ FULL TEXT
research
03/23/2018

Speech2Vec: A Sequence-to-Sequence Framework for Learning Word Embeddings from Speech

In this paper, we propose a novel deep neural network architecture, Spee...
research
11/23/2020

STEPs-RL: Speech-Text Entanglement for Phonetically Sound Representation Learning

In this paper, we present a novel multi-modal deep neural network archit...
research
06/01/2020

Transcription-Enriched Joint Embeddings for Spoken Descriptions of Images and Videos

In this work, we propose an effective approach for training unique embed...
research
09/17/2023

Augmenting text for spoken language understanding with Large Language Models

Spoken semantic parsing (SSP) involves generating machine-comprehensible...
research
09/01/2020

Hearings and mishearings: decrypting the spoken word

We propose a model of the speech perception of individual words in the p...
research
09/04/2023

Minimal Effective Theory for Phonotactic Memory: Capturing Local Correlations due to Errors in Speech

Spoken language evolves constrained by the economy of speech, which depe...
research
06/23/2021

Mixtures of Deep Neural Experts for Automated Speech Scoring

The paper copes with the task of automatic assessment of second language...

Please sign up or login with your details

Forgot password? Click here to reset