Explanation-based Weakly-supervised Learning of Visual Relations with Graph Networks

06/16/2020
by   Federico Baldassarre, et al.
4

Visual relationship detection is fundamental for holistic image understanding. However, localizing and classifying (subject, predicate, object) triplets constitutes a hard learning objective due to the combinatorial explosion of possible relationships, their long-tail distribution in natural images, and an expensive annotation process. This paper introduces a novel weakly-supervised method for visual relationship detection that relies only on image-level predicate annotations. A graph neural network is trained to classify the predicates in an image from the graph representation of all objects, implicitly encoding an inductive bias for pairwise relationships. We then frame relationship detection as the explanation of such a predicate classifier, i.e. we reconstruct a complete relationship by recovering the subject and the object of a predicted predicate. Using this novel technique and minimal labels, we present comparable results to recent fully-supervised and weakly-supervised methods on three diverse and challenging datasets: HICO-DET for human-object interaction, Visual Relationship Detection for generic object-to-object relationships, and UnRel for unusual relationships.

READ FULL TEXT

page 2

page 6

page 8

page 11

page 19

page 20

page 25

page 30

research
07/29/2017

Weakly-supervised learning of visual relations

This paper introduces a novel approach for modeling visual relations bet...
research
03/07/2022

Unpaired Image Captioning by Image-level Weakly-Supervised Visual Concept Recognition

The goal of unpaired image captioning (UIC) is to describe images withou...
research
08/07/2017

PPR-FCN: Weakly Supervised Visual Relation Detection via Parallel Pairwise R-FCN

We aim to tackle a novel vision task called Weakly Supervised Visual Rel...
research
11/10/2020

Detecting Human-Object Interaction with Mixed Supervision

Human object interaction (HOI) detection is an important task in image u...
research
10/11/2020

Constructing a Visual Relationship Authenticity Dataset

A visual relationship denotes a relationship between two objects in an i...
research
05/28/2017

Care about you: towards large-scale human-centric visual relationship detection

Visual relationship detection aims to capture interactions between pairs...
research
05/29/2020

Fixed-size Objects Encoding for Visual Relationship Detection

In this paper, we propose a fixed-size object encoding method (FOE-VRD) ...

Please sign up or login with your details

Forgot password? Click here to reset