How a General-Purpose Commonsense Ontology can Improve Performance of Learning-Based Image Retrieval

05/24/2017
by   Rodrigo Toro Icarte, et al.
0

The knowledge representation community has built general-purpose ontologies which contain large amounts of commonsense knowledge over relevant aspects of the world, including useful visual information, e.g.: "a ball is used by a football player", "a tennis player is located at a tennis court". Current state-of-the-art approaches for visual recognition do not exploit these rule-based knowledge sources. Instead, they learn recognition models directly from training examples. In this paper, we study how general-purpose ontologies---specifically, MIT's ConceptNet ontology---can improve the performance of state-of-the-art vision systems. As a testbed, we tackle the problem of sentence-based image retrieval. Our retrieval approach incorporates knowledge from ConceptNet on top of a large pool of object detectors derived from a deep learning technique. In our experiments, we show that ConceptNet can improve performance on a common benchmark dataset. Key to our performance is the use of the ESPGAME dataset to select visually relevant relations from ConceptNet. Consequently, a main conclusion of this work is that general-purpose commonsense ontologies improve performance on visual reasoning tasks when properly filtered to select meaningful visual relations.

READ FULL TEXT

page 1

page 6

research
11/25/2021

GPR1200: A Benchmark for General-Purpose Content-Based Image Retrieval

Even though it has extensively been shown that retrieval specific traini...
research
08/14/2018

Applying the Closed World Assumption to SUMO-based Ontologies

In commonsense knowledge representation, the Open World Assumption is ad...
research
10/23/2022

Retrieval Augmentation for Commonsense Reasoning: A Unified Approach

A common thread of retrieval-augmented methods in the existing literatur...
research
05/17/2023

CooK: Empowering General-Purpose Language Models with Modular and Collaborative Knowledge

Large language models (LLMs) are increasingly adopted for knowledge-inte...
research
08/03/2023

DOLCE: A Descriptive Ontology for Linguistic and Cognitive Engineering

DOLCE, the first top-level (foundational) ontology to be axiomatized, ha...
research
10/16/2022

COFAR: Commonsense and Factual Reasoning in Image Search

One characteristic that makes humans superior to modern artificially int...
research
08/01/2017

Improved Representation Learning for Predicting Commonsense Ontologies

Recent work in learning ontologies (hierarchical and partially-ordered s...

Please sign up or login with your details

Forgot password? Click here to reset