iCap: Interative Image Captioning with Predictive Text

01/31/2020
by   Zhengxiong Jia, et al.
0

In this paper we study a brand new topic of interactive image captioning with human in the loop. Different from automated image captioning where a given test image is the sole input in the inference stage, we have access to both the test image and a sequence of (incomplete) user-input sentences in the interactive scenario. We formulate the problem as Visually Conditioned Sentence Completion (VCSC). For VCSC, we propose asynchronous bidirectional decoding for image caption completion (ABD-Cap). With ABD-Cap as the core module, we build iCap, a web-based interactive image captioning system capable of predicting new text with respect to live input from a user. A number of experiments covering both automated evaluations and real user studies show the viability of our proposals.

READ FULL TEXT

page 1

page 6

research
10/06/2018

A Comprehensive Study of Deep Learning for Image Captioning

Generating a description of an image is called image captioning. Image c...
research
10/06/2018

A Comprehensive Survey of Deep Learning for Image Captioning

Generating a description of an image is called image captioning. Image c...
research
11/07/2021

Machine-in-the-Loop Rewriting for Creative Image Captioning

Machine-in-the-loop writing aims to enable humans to collaborate with mo...
research
06/04/2016

Automated Image Captioning for Rapid Prototyping and Resource Constrained Environments

Significant performance gains in deep learning coupled with the exponent...
research
05/24/2023

Exploring Diverse In-Context Configurations for Image Captioning

After discovering that Language Models (LMs) can be good in-context few-...
research
02/28/2022

Interactive Machine Learning for Image Captioning

We propose an approach for interactive learning for an image captioning ...
research
04/05/2023

Towards Self-Explainability of Deep Neural Networks with Heatmap Captioning and Large-Language Models

Heatmaps are widely used to interpret deep neural networks, particularly...

Please sign up or login with your details

Forgot password? Click here to reset