Bridging the Gap: Using Deep Acoustic Representations to Learn Grounded Language from Percepts and Raw Speech

12/27/2021
by   Gaoussou Youssouf Kebe, et al.
0

Learning to understand grounded language, which connects natural language to percepts, is a critical research area. Prior work in grounded language acquisition has focused primarily on textual inputs. In this work we demonstrate the feasibility of performing grounded language acquisition on paired visual percepts and raw speech inputs. This will allow interactions in which language about novel tasks and environments is learned from end users, reducing dependence on textual inputs and potentially mitigating the effects of demographic bias found in widely available speech recognition systems. We leverage recent work in self-supervised speech representation models and show that learned representations of speech can make language grounding systems more inclusive towards specific groups while maintaining or even increasing general performance.

READ FULL TEXT
research
06/16/2020

AVLnet: Learning Audio-Visual Language Representations from Instructional Videos

Current methods for learning visually grounded language from videos ofte...
research
05/21/2022

Self-Supervised Speech Representation Learning: A Review

Although supervised deep learning has revolutionized speech and audio pr...
research
07/29/2020

Presentation and Analysis of a Multimodal Dataset for Grounded LanguageLearning

Grounded language acquisition – learning how language-based interactions...
research
07/26/2016

Grounded Lexicon Acquisition - Case Studies in Spatial Language

This paper discusses grounded acquisition experiments of increasing comp...
research
10/28/2020

A Visuospatial Dataset for Naturalistic Verb Learning

We introduce a new dataset for training and evaluating grounded language...
research
07/20/2021

Neural Variational Learning for Grounded Language Acquisition

We propose a learning system in which language is grounded in visual per...
research
06/06/2022

Norm Participation Grounds Language

The striking recent advances in eliciting seemingly meaningful language ...

Please sign up or login with your details

Forgot password? Click here to reset