Learning Structured Semantic Embeddings for Visual Recognition

06/05/2017
by   Dong Li, et al.
0

Numerous embedding models have been recently explored to incorporate semantic knowledge into visual recognition. Existing methods typically focus on minimizing the distance between the corresponding images and texts in the embedding space but do not explicitly optimize the underlying structure. Our key observation is that modeling the pairwise image-image relationship improves the discrimination ability of the embedding model. In this paper, we propose the structured discriminative and difference constraints to learn visual-semantic embeddings. First, we exploit the discriminative constraints to capture the intra- and inter-class relationships of image embeddings. The discriminative constraints encourage separability for image instances of different classes. Second, we align the difference vector between a pair of image embeddings with that of the corresponding word embeddings. The difference constraints help regularize image embeddings to preserve the semantic relationships among word embeddings. Extensive evaluations demonstrate the effectiveness of the proposed structured embeddings for single-label classification, multi-label classification, and zero-shot recognition.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
03/20/2022

VGSE: Visually-Grounded Semantic Embeddings for Zero-Shot Learning

Human-annotated attributes serve as powerful semantic embeddings in zero...
research
12/22/2015

Multi-Instance Visual-Semantic Embedding

Visual-semantic embedding models have been recently proposed and shown t...
research
08/14/2019

Memory-Based Neighbourhood Embedding for Visual Recognition

Learning discriminative image feature embeddings is of great importance ...
research
08/11/2021

Statistical Dependency Guided Contrastive Learning for Multiple Labeling in Prenatal Ultrasound

Standard plane recognition plays an important role in prenatal ultrasoun...
research
07/23/2020

Zero-Shot Recognition through Image-Guided Semantic Classification

We present a new embedding-based framework for zero-shot learning (ZSL)....
research
10/27/2020

Improving Word Recognition using Multiple Hypotheses and Deep Embeddings

We propose a novel scheme for improving the word recognition accuracy us...
research
02/05/2021

Metric Embedding Sub-discrimination Study

Deep metric learning is a technique used in a variety of discriminative ...

Please sign up or login with your details

Forgot password? Click here to reset