A Joint Speaker-Listener-Reinforcer Model for Referring Expressions

12/30/2016
by   Licheng Yu, et al.
0

Referring expressions are natural language constructions used to identify particular objects within a scene. In this paper, we propose a unified framework for the tasks of referring expression comprehension and generation. Our model is composed of three modules: speaker, listener, and reinforcer. The speaker generates referring expressions, the listener comprehends referring expressions, and the reinforcer introduces a reward function to guide sampling of more discriminative expressions. The listener-speaker modules are trained jointly in an end-to-end learning framework, allowing the modules to be aware of one another during learning while also benefiting from the discriminative reinforcer's feedback. We demonstrate that this unified framework and training achieves state-of-the-art results for both comprehension and generation on three referring expression datasets. Project and demo page: https://vision.cs.unc.edu/refer

READ FULL TEXT

page 2

page 8

page 10

research
07/31/2016

Modeling Context in Referring Expressions

Humans refer to objects in their environments all the time, especially i...
research
05/16/2022

Referring Expressions with Rational Speech Act Framework: A Probabilistic Approach

This paper focuses on a referring expression generation (REG) task in wh...
research
08/26/2020

On the Optimality of Vagueness: "Around", "Between", and the Gricean Maxims

Why is our language vague? We argue that in contexts in which a cooperat...
research
10/10/2019

Referring Expression Object Segmentation with Caption-Aware Consistency

Referring expressions are natural language descriptions that identify a ...
research
11/17/2017

Parallel Attention: A Unified Framework for Visual Object Discovery through Dialogs and Queries

Recognising objects according to a pre-defined fixed set of class labels...
research
01/24/2018

MAttNet: Modular Attention Network for Referring Expression Comprehension

In this paper, we address referring expression comprehension: localizing...
research
09/24/2018

Speaker Naming in Movies

We propose a new model for speaker naming in movies that leverages visua...

Please sign up or login with your details

Forgot password? Click here to reset