Bridging Gap between Image Pixels and Semantics via Supervision: A Survey

07/29/2021
by   Jiali Duan, et al.
19

The fact that there exists a gap between low-level features and semantic meanings of images, called the semantic gap, is known for decades. Resolution of the semantic gap is a long standing problem. The semantic gap problem is reviewed and a survey on recent efforts in bridging the gap is made in this work. Most importantly, we claim that the semantic gap is primarily bridged through supervised learning today. Experiences are drawn from two application domains to illustrate this point: 1) object detection and 2) metric learning for content-based image retrieval (CBIR). To begin with, this paper offers a historical retrospective on supervision, makes a gradual transition to the modern data-driven methodology and introduces commonly used datasets. Then, it summarizes various supervision methods to bridge the semantic gap in the context of object detection and metric learning.

READ FULL TEXT

page 2

page 4

page 7

page 8

research
06/19/2017

Recent Advance in Content-based Image Retrieval: A Literature Survey

The explosive increase and ubiquitous accessibility of visual data on th...
research
10/17/2022

Bridging the Gap between Local Semantic Concepts and Bag of Visual Words for Natural Scene Image Retrieval

This paper addresses the problem of semantic-based image retrieval of na...
research
01/26/2020

An Effective Automatic Image Annotation Model Via Attention Model and Data Equilibrium

Nowadays, a huge number of images are available. However, retrieving a r...
research
03/28/2015

Socializing the Semantic Gap: A Comparative Survey on Image Tag Assignment, Refinement and Retrieval

Where previous reviews on content-based image retrieval emphasize on wha...
research
02/23/2022

An End-to-End Cascaded Image Deraining and Object Detection Neural Network

While the deep learning-based image deraining methods have made great pr...
research
08/18/2023

Generalized Sum Pooling for Metric Learning

A common architectural choice for deep metric learning is a convolutiona...
research
07/14/2023

Generalizable Embeddings with Cross-batch Metric Learning

Global average pooling (GAP) is a popular component in deep metric learn...

Please sign up or login with your details

Forgot password? Click here to reset