Pragmatic Issue-Sensitive Image Captioning

04/29/2020
by   Allen Nie, et al.
5

Image captioning systems have recently improved dramatically, but they still tend to produce captions that are insensitive to the communicative goals that captions should meet. To address this, we propose Issue-Sensitive Image Captioning (ISIC). In ISIC, a captioning system is given a target image and an issue, which is a set of images partitioned in a way that specifies what information is relevant. The goal of the captioner is to produce a caption that resolves this issue. To model this task, we use an extension of the Rational Speech Acts model of pragmatic language use. Our extension is built on top of state-of-the-art pretrained neural image captioners and explicitly reasons about issues in our sense. We establish experimentally that these models generate captions that are both highly descriptive and issue-sensitive, and we show how ISIC can complement and enrich the related task of Visual Question Answering.

READ FULL TEXT

page 1

page 5

page 6

page 8

research
05/22/2018

Joint Image Captioning and Question Answering

Answering visual questions need acquire daily common knowledge and model...
research
09/18/2020

Image Captioning with Attention for Smart Local Tourism using EfficientNet

Smart systems have been massively developed to help humans in various ta...
research
04/15/2018

Pragmatically Informative Image Captioning with Character-Level Reference

We combine a neural image captioner with a Rational Speech Acts (RSA) mo...
research
01/04/2022

Interactive Attention AI to translate low light photos to captions for night scene understanding in women safety

There is amazing progress in Deep Learning based models for Image captio...
research
11/09/2020

CapWAP: Captioning with a Purpose

The traditional image captioning task uses generic reference captions to...
research
09/02/2018

Chittron: An Automatic Bangla Image Captioning System

Automatic image caption generation aims to produce an accurate descripti...
research
07/26/2018

Rethinking the Form of Latent States in Image Captioning

RNNs and their variants have been widely adopted for image captioning. I...

Please sign up or login with your details

Forgot password? Click here to reset