A semantics-driven methodology for high-quality image annotation

07/26/2023
by   Fausto Giunchiglia, et al.
0

Recent work in Machine Learning and Computer Vision has highlighted the presence of various types of systematic flaws inside ground truth object recognition benchmark datasets. Our basic tenet is that these flaws are rooted in the many-to-many mappings which exist between the visual information encoded in images and the intended semantics of the labels annotating them. The net consequence is that the current annotation process is largely under-specified, thus leaving too much freedom to the subjective judgment of annotators. In this paper, we propose vTelos, an integrated Natural Language Processing, Knowledge Representation, and Computer Vision methodology whose main goal is to make explicit the (otherwise implicit) intended annotation semantics, thus minimizing the number and role of subjective choices. A key element of vTelos is the exploitation of the WordNet lexico-semantic hierarchy as the main means for providing the meaning of natural language labels and, as a consequence, for driving the annotation of images based on the objects and the visual properties they depict. The methodology is validated on images populating a subset of the ImageNet hierarchy.

READ FULL TEXT

page 2

page 7

research
02/17/2022

Visual Ground Truth Construction as Faceted Classification

Recent work in Machine Learning and Computer Vision has provided evidenc...
research
03/02/2022

A Split Semantic Detection Algorithm for Psychological Sandplay Image

Psychological sandplay, as an important psychological analysis tool, is ...
research
04/18/2023

Incremental Image Labeling via Iterative Refinement

Data quality is critical for multimedia tasks, while various types of sy...
research
02/26/2022

Building a visual semantics aware object hierarchy

The semantic gap is defined as the difference between the linguistic rep...
research
12/14/2021

Two Contrasting Data Annotation Paradigms for Subjective NLP Tasks

Labelled data is the foundation of most natural language processing task...
research
07/29/2020

Between Subjectivity and Imposition: Power Dynamics in Data Annotation for Computer Vision

The interpretation of data is fundamental to machine learning. This pape...
research
10/15/2018

The Focus-Aspect-Polarity Model for Predicting Subjective Noun Attributes in Images

Subjective visual interpretation is a challenging yet important topic in...

Please sign up or login with your details

Forgot password? Click here to reset