Modeling Context Between Objects for Referring Expression Understanding

08/01/2016
by   Varun K. Nagaraja, et al.
0

Referring expressions usually describe an object using properties of the object and relationships of the object with other objects. We propose a technique that integrates context between objects to understand referring expressions. Our approach uses an LSTM to learn the probability of a referring expression, with input features from a region and a context region. The context regions are discovered using multiple-instance learning (MIL) since annotations for context objects are generally not available for training. We utilize max-margin based MIL objective functions for training the LSTM. Experiments on the Google RefExp and UNC RefExp datasets show that modeling context between objects provides better performance than modeling only object properties. We also qualitatively show that our technique can ground a referring expression to its referred region along with the supporting context region.

READ FULL TEXT

page 2

page 5

page 7

page 12

page 13

page 14

research
07/31/2016

Modeling Context in Referring Expressions

Humans refer to objects in their environments all the time, especially i...
research
05/26/2018

Using Syntax to Ground Referring Expressions in Natural Images

We introduce GroundNet, a neural network for referring expression recogn...
research
07/08/2019

Variational Context: Exploiting Visual and Textual Context for Grounding Referring Expressions

We focus on grounding (i.e., localizing or linking) referring expression...
research
12/05/2017

Grounding Referring Expressions in Images by Variational Context

We focus on grounding (i.e., localizing or linking) referring expression...
research
06/27/2012

Learning Object Arrangements in 3D Scenes using Human Context

We consider the problem of learning object arrangements in a 3D scene. T...
research
06/09/2019

Referring Expression Grounding by Marginalizing Scene Graph Likelihood

We focus on the task of grounding referring expressions in images, e.g.,...
research
12/30/2022

Gray–Wyner and Mutual Information Regions for Doubly Symmetric Binary Sources and Gaussian Sources

Nonconvex optimization plays a key role in multi-user information theory...

Please sign up or login with your details

Forgot password? Click here to reset