Learning Zero-Shot Multifaceted Visually Grounded Word Embeddingsvia Multi-Task Training

04/15/2021
by   Hassan Shahmohammadi, et al.
0

Language grounding aims at linking the symbolic representation of language (e.g., words) into the rich perceptual knowledge of the outside world. The general approach is to embed both textual and visual information into a common space -the grounded space-confined by an explicit relationship between both modalities. We argue that this approach sacrifices the abstract knowledge obtained from linguistic co-occurrence statistics in the process of acquiring perceptual information. The focus of this paper is to solve this issue by implicitly grounding the word embeddings. Rather than learning two mappings into a joint space, our approach integrates modalities by determining a reversible grounded mapping between the textual and the grounded space by means of multi-task learning. Evaluations on intrinsic and extrinsic tasks show that our embeddings are highly beneficial for both abstract and concrete words. They are strongly correlated with human judgments and outperform previous works on a wide range of benchmarks. Our grounded embeddings are publicly available here.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
06/17/2022

Language with Vision: a Study on Grounded Word and Sentence Embeddings

Language grounding to vision is an active field of research aiming to en...
research
02/07/2020

Incorporating Visual Semantics into Sentence Representations within a Grounded Space

Language grounding is an active field aiming at enriching textual repres...
research
06/30/2022

Visual grounding of abstract and concrete words: A response to Günther et al. (2020)

Current computational models capturing words' meaning mostly rely on tex...
research
03/06/2017

Sound-Word2Vec: Learning Word Representations Grounded in Sounds

To be able to interact better with humans, it is crucial for machines to...
research
09/08/2022

Visual Grounding of Inter-lingual Word-Embeddings

Visual grounding of Language aims at enriching textual representations o...
research
04/18/2021

Language in a (Search) Box: Grounding Language Learning in Real-World Human-Machine Interaction

We investigate grounded language learning through real-world data, by mo...
research
04/18/2021

A recipe for annotating grounded clarifications

In order to interpret the communicative intents of an utterance, it need...

Please sign up or login with your details

Forgot password? Click here to reset