Image Annotation based on Deep Hierarchical Context Networks

12/21/2020
by   Mingyuan Jiu, et al.
0

Context modeling is one of the most fertile subfields of visual recognition which aims at designing discriminant image representations while incorporating their intrinsic and extrinsic relationships. However, the potential of context modeling is currently underexplored and most of the existing solutions are either context-free or restricted to simple handcrafted geometric relationships. We introduce in this paper DHCN: a novel Deep Hierarchical Context Network that leverages different sources of contexts including geometric and semantic relationships. The proposed method is based on the minimization of an objective function mixing a fidelity term, a context criterion and a regularizer. The solution of this objective function defines the architecture of a bi-level hierarchical context network; the first level of this network captures scene geometry while the second one corresponds to semantic relationships. We solve this representation learning problem by training its underlying deep network whose parameters correspond to the most influencing bi-level contextual relationships and we evaluate its performances on image annotation using the challenging ImageCLEF benchmark.

READ FULL TEXT
research
12/29/2019

Deep Context-Aware Kernel Networks

Context plays a crucial role in visual recognition as it provides comple...
research
03/23/2018

Learning Deep Context-Network Architectures for Image Annotation

Context plays an important role in visual pattern recognition as it prov...
research
04/05/2021

Hierarchical Pyramid Representations for Semantic Segmentation

Understanding the context of complex and cluttered scenes is a challengi...
research
10/17/2019

Topical Keyphrase Extraction with Hierarchical Semantic Networks

Topical keyphrase extraction is used to summarize large collections of t...
research
02/11/2019

Semantic Hierarchical Priors for Intrinsic Image Decomposition

Intrinsic Image Decomposition (IID) is a challenging and interesting com...
research
03/30/2016

Unsupervised Learning of Visual Representations by Solving Jigsaw Puzzles

In this paper we study the problem of image representation learning with...
research
05/29/2023

A Hierarchical Context-aware Modeling Approach for Multi-aspect and Multi-granular Pronunciation Assessment

Automatic Pronunciation Assessment (APA) plays a vital role in Computer-...

Please sign up or login with your details

Forgot password? Click here to reset