From Images to Sentences through Scene Description Graphs using Commonsense Reasoning and Knowledge

11/10/2015
by   Somak Aditya, et al.
0

In this paper we propose the construction of linguistic descriptions of images. This is achieved through the extraction of scene description graphs (SDGs) from visual scenes using an automatically constructed knowledge base. SDGs are constructed using both vision and reasoning. Specifically, commonsense reasoning is applied on (a) detections obtained from existing perception methods on given images, (b) a "commonsense" knowledge base constructed using natural language processing of image annotations and (c) lexical ontological knowledge from resources such as WordNet. Amazon Mechanical Turk(AMT)-based evaluations on Flickr8k, Flickr30k and MS-COCO datasets show that in most cases, sentences auto-constructed from SDGs obtained by our method give a more relevant and thorough description of an image than a recent state-of-the-art image caption based approach. Our Image-Sentence Alignment Evaluation results are also comparable to that of the recent state-of-the art approaches.

READ FULL TEXT

page 8

page 11

page 13

page 14

page 15

page 16

page 17

research
10/10/2020

Beyond Language: Learning Commonsense from Images for Reasoning

This paper proposes a novel approach to learn commonsense from images, i...
research
10/30/2021

Automatic Knowledge Augmentation for Generative Commonsense Reasoning

Generative commonsense reasoning is the capability of a language model t...
research
12/16/2021

SGEITL: Scene Graph Enhanced Image-Text Learning for Visual Commonsense Reasoning

Answering complex questions about images is an ambitious goal for machin...
research
11/20/2014

CIDEr: Consensus-based Image Description Evaluation

Automatically describing an image with a sentence is a long-standing cha...
research
09/18/2019

RestKB: A Library of Commonsense Knowledge about Dining at a Restaurant

This paper presents a library of commonsense knowledge, RestKB, develope...
research
01/11/2021

A Commonsense Reasoning Framework for Explanatory Emotion Attribution, Generation and Re-classification

In this work we present an explainable system for emotion attribution an...
research
02/13/2023

Text2shape Deep Retrieval Model: Generating Initial Cases for Mechanical Part Redesign under the Context of Case-Based Reasoning

Retrieving the similar solutions from the historical case base for new d...

Please sign up or login with your details

Forgot password? Click here to reset