Weakly-supervised learning of visual relations

07/29/2017
by   Julia Peyre, et al.
0

This paper introduces a novel approach for modeling visual relations between pairs of objects. We call relation a triplet of the form (subject, predicate, object) where the predicate is typically a preposition (eg. 'under', 'in front of') or a verb ('hold', 'ride') that links a pair of objects (subject, object). Learning such relations is challenging as the objects have different spatial configurations and appearances depending on the relation in which they occur. Another major challenge comes from the difficulty to get annotations, especially at box-level, for all possible triplets, which makes both learning and evaluation difficult. The contributions of this paper are threefold. First, we design strong yet flexible visual features that encode the appearance and spatial configuration for pairs of objects. Second, we propose a weakly-supervised discriminative clustering model to learn relations from image-level labels only. Third we introduce a new challenging dataset of unusual relations (UnRel) together with an exhaustive annotation, that enables accurate evaluation of visual relation retrieval. We show experimentally that our model results in state-of-the-art results on the visual relationship dataset significantly improving performance on previously unseen relations (zero-shot learning), and confirm this observation on our newly introduced UnRel dataset.

READ FULL TEXT

page 1

page 4

page 7

page 8

page 12

page 14

page 15

page 16

research
06/16/2020

Explanation-based Weakly-supervised Learning of Visual Relations with Graph Networks

Visual relationship detection is fundamental for holistic image understa...
research
08/07/2017

PPR-FCN: Weakly Supervised Visual Relation Detection via Parallel Pairwise R-FCN

We aim to tackle a novel vision task called Weakly Supervised Visual Rel...
research
12/13/2018

Detecting rare visual relations using analogies

We seek to detect visual relations in images of the form of triplets t =...
research
08/24/2023

SCoRD: Subject-Conditional Relation Detection with Text-Augmented Data

We propose Subject-Conditional Relation Detection SCoRD, where condition...
research
05/02/2019

Improving Visual Relation Detection using Depth Maps

State of the art visual relation detection methods have been relying on ...
research
07/28/2017

A Weakly Supervised Approach to Train Temporal Relation Classifiers and Acquire Regular Event Pairs Simultaneously

Capabilities of detecting temporal relations between two events can bene...
research
11/22/2019

Visual Relationship Detection with Low Rank Non-Negative Tensor Decomposition

We address the problem of Visual Relationship Detection (VRD) which aims...

Please sign up or login with your details

Forgot password? Click here to reset