Probabilistic Modeling of Semantic Ambiguity for Scene Graph Generation

03/09/2021
by   Gengcong Yang, et al.
0

To generate "accurate" scene graphs, almost all existing methods predict pairwise relationships in a deterministic manner. However, we argue that visual relationships are often semantically ambiguous. Specifically, inspired by linguistic knowledge, we classify the ambiguity into three types: Synonymy Ambiguity, Hyponymy Ambiguity, and Multi-view Ambiguity. The ambiguity naturally leads to the issue of implicit multi-label, motivating the need for diverse predictions. In this work, we propose a novel plug-and-play Probabilistic Uncertainty Modeling (PUM) module. It models each union region as a Gaussian distribution, whose variance measures the uncertainty of the corresponding visual content. Compared to the conventional deterministic methods, such uncertainty modeling brings stochasticity of feature representation, which naturally enables diverse predictions. As a byproduct, PUM also manages to cover more fine-grained relationships and thus alleviates the issue of bias towards frequent relationships. Extensive experiments on the large-scale Visual Genome benchmark show that combining PUM with newly proposed ResCAGCN can achieve state-of-the-art performances, especially under the mean recall metric. Furthermore, we prove the universal effectiveness of PUM by plugging it into some existing models and provide insightful analysis of its ability to generate diverse yet plausible visual relationships.

READ FULL TEXT

page 1

page 3

page 8

research
03/08/2019

Knowledge-Embedded Routing Network for Scene Graph Generation

To understand a scene in depth not only involves locating/recognizing in...
research
08/04/2023

Improving Scene Graph Generation with Superpixel-Based Interaction Learning

Recent advances in Scene Graph Generation (SGG) typically model the rela...
research
03/06/2013

Qualitative Measures of Ambiguity

This paper introduces a qualitative measure of ambiguity and analyses it...
research
11/11/2022

Probabilistic Debiasing of Scene Graphs

The quality of scene graphs generated by the state-of-the-art (SOTA) mod...
research
03/21/2023

The Treasure Beneath Multiple Annotations: An Uncertainty-aware Edge Detector

Deep learning-based edge detectors heavily rely on pixel-wise labels whi...
research
07/08/2022

GEMS: Scene Expansion using Generative Models of Graphs

Applications based on image retrieval require editing and associating in...
research
06/12/2023

Deep Model Compression Also Helps Models Capture Ambiguity

Natural language understanding (NLU) tasks face a non-trivial amount of ...

Please sign up or login with your details

Forgot password? Click here to reset