Multimodal Word Sense Disambiguation in Creative Practice

07/15/2020
by   Manuel Ladron De Guevara, et al.
0

Language is ambiguous; many terms and expressions can convey the same idea. This is especially true in creative practice, where ideas and design intents are highly subjective. We present a dataset, Ambiguous Descriptions of Art Images (ADARI), of contemporary workpieces, which aims to provide a foundational resource for subjective image description and multimodal word disambiguation in the context of creative practice. The dataset contains a total of 240k images labeled with 260k descriptive sentences. It is additionally organized into sub-domains of architecture, art, design, fashion, furniture, product design and technology. In subjective image description, labels are not deterministic: for example, the ambiguous label dynamic might correspond to hundreds of different images. To understand this complexity, we analyze the ambiguity and relevance of text with respect to images using the state-of-the-art pre-trained BERT model for sentence classification. We provide a baseline for multi-label classification tasks and demonstrate the potential of multimodal approaches for understanding ambiguity in design intentions. We hope that ADARI dataset and baselines constitute a first step towards subjective label classification.

READ FULL TEXT

page 3

page 7

research
10/11/2022

Underspecification in Scene Description-to-Depiction Tasks

Questions regarding implicitness, ambiguity and underspecification are c...
research
09/19/2000

Modeling Ambiguity in a Multi-Agent System

This paper investigates the formal pragmatics of ambiguous expressions b...
research
01/20/2022

VISA: An Ambiguous Subtitles Dataset for Visual Scene-Aware Machine Translation

Existing multimodal machine translation (MMT) datasets consist of images...
research
08/25/2021

Product-oriented Machine Translation with Cross-modal Cross-lingual Pre-training

Translating e-commercial product descriptions, a.k.a product-oriented ma...
research
11/06/2016

Deep Label Distribution Learning with Label Ambiguity

Convolutional Neural Networks (ConvNets) have achieved excellent recogni...
research
11/06/2019

Addressing Ambiguity of Emotion Labels Through Meta-learning

Emotion labels in emotion recognition corpora are highly noisy and ambig...
research
07/27/2021

Ambiguity in Utopian XR-Games. Basic Principles for the Design of Virtual Worlds

Utopian images in XR-games are often ambiguous. How can ambiguity be con...

Please sign up or login with your details

Forgot password? Click here to reset