Reinforcing an Image Caption Generator Using Off-Line Human Feedback

11/21/2019
by   Paul Hongsuck Seo, et al.
0

Human ratings are currently the most accurate way to assess the quality of an image captioning model, yet most often the only used outcome of an expensive human rating evaluation is a few overall statistics over the evaluation dataset. In this paper, we show that the signal from instance-level human caption ratings can be leveraged to improve captioning models, even when the amount of caption ratings is several orders of magnitude less than the caption training data. We employ a policy gradient method to maximize the human ratings as rewards in an off-policy reinforcement learning setting, where policy gradients are estimated by samples from a distribution that focuses on the captions in a caption ratings dataset. Our empirical evidence indicates that the proposed method learns to generalize the human raters' judgments to a previously unseen set of images, as judged by a different set of human judges, and additionally on a different, multi-dimensional side-by-side human evaluation procedure.

READ FULL TEXT

page 1

page 7

research
09/08/2019

Quality Estimation for Image Captions Based on Large-scale Human Evaluations

Automatic image captioning has improved significantly in the last few ye...
research
11/27/2019

To Trust, or Not to Trust? A Study of Human Bias in Automated Video Interview Assessments

Supervised systems require human labels for training. But, are humans th...
research
03/13/2017

Users prefer Guetzli JPEG over same-sized libjpeg

We report on pairwise comparisons by human raters of JPEG images from li...
research
05/06/2019

Image Captioning with Clause-Focused Metrics in a Multi-Modal Setting for Marketing

Automatically generating descriptive captions for images is a well-resea...
research
06/18/2012

TrueLabel + Confusions: A Spectrum of Probabilistic Models in Analyzing Multiple Ratings

This paper revisits the problem of analyzing multiple ratings given by d...
research
03/17/2017

Towards Diverse and Natural Image Descriptions via a Conditional GAN

Despite the substantial progress in recent years, the image captioning t...
research
08/31/2019

Humor Detection: A Transformer Gets the Last Laugh

Much previous work has been done in attempting to identify humor in text...

Please sign up or login with your details

Forgot password? Click here to reset