Improving Image Captioning with Control Signal of Sentence Quality

06/07/2022
by   Zhangzi Zhu, et al.
0

In the dataset of image captioning, each image is aligned with several captions. Despite the fact that the quality of these descriptions varies, existing captioning models treat them equally in the training process. In this paper, we propose a new control signal of sentence quality, which is taken as an additional input to the captioning model. By integrating the control signal information, captioning models are aware of the quality level of the target sentences and handle them differently. Moreover, we propose a novel reinforcement training method specially designed for the control signal of sentence quality: Quality-oriented Self-Annotated Training (Q-SAT). Equipped with R-Drop strategy, models controlled by the highest quality level surpass baseline models a lot on accuracy-based evaluation metrics, which validates the effectiveness of our proposed methods.

READ FULL TEXT
research
10/16/2021

Self-Annotated Training for Controllable Image Captioning

The Controllable Image Captioning (CIC) task aims to generate captions c...
research
01/20/2021

Macroscopic Control of Text Generation for Image Captioning

Despite the fact that image captioning models have been able to generate...
research
11/27/2022

CLID: Controlled-Length Image Descriptions with Limited Data

Controllable image captioning models generate human-like image descripti...
research
09/15/2017

Self-Guiding Multimodal LSTM - when we do not have a perfect training dataset for image captioning

In this paper, a self-guiding multimodal LSTM (sg-LSTM) image captioning...
research
12/27/2022

Noise-aware Learning from Web-crawled Image-Text Data for Image Captioning

Image captioning is one of the straightforward tasks that can take advan...
research
09/06/2018

Object Hallucination in Image Captioning

Despite continuously improving performance, contemporary image captionin...
research
06/07/2019

Figure Captioning with Reasoning and Sequence-Level Training

Figures, such as bar charts, pie charts, and line plots, are widely used...

Please sign up or login with your details

Forgot password? Click here to reset