Semantic Compositional Networks for Visual Captioning

11/23/2016
by   Zhe Gan, et al.
0

A Semantic Compositional Network (SCN) is developed for image captioning, in which semantic concepts (i.e., tags) are detected from the image, and the probability of each tag is used to compose the parameters in a long short-term memory (LSTM) network. The SCN extends each weight matrix of the LSTM to an ensemble of tag-dependent weight matrices. The degree to which each member of the ensemble is used to generate an image caption is tied to the image-dependent probability of the corresponding tag. In addition to captioning images, we also extend the SCN to generate captions for video clips. We qualitatively analyze semantic composition in SCNs, and quantitatively evaluate the algorithm on three benchmark datasets: COCO, Flickr30k, and Youtube2Text. Experimental results show that the proposed method significantly outperforms prior state-of-the-art approaches, across multiple evaluation metrics.

READ FULL TEXT

page 7

page 8

page 11

page 12

page 13

research
08/07/2019

Scene-based Factored Attention for Image Captioning

Image captioning has attracted ever-increasing research attention in the...
research
07/10/2020

Image Captioning with Compositional Neural Module Networks

In image captioning where fluency is an important factor in evaluation, ...
research
05/10/2021

Matching Visual Features to Hierarchical Semantic Topics for Image Paragraph Captioning

Observing a set of images and their corresponding paragraph-captions, a ...
research
04/28/2022

Controllable Image Captioning

State-of-the-art image captioners can generate accurate sentences to des...
research
05/08/2019

Multimodal Semantic Attention Network for Video Captioning

Inspired by the fact that different modalities in videos carry complemen...
research
12/02/2016

Guided Open Vocabulary Image Captioning with Constrained Beam Search

Existing image captioning models do not generalize well to out-of-domain...
research
04/29/2018

Sequence Tagging with Policy-Value Networks and Tree Search

In this paper we propose a novel reinforcement learning based model for ...

Please sign up or login with your details

Forgot password? Click here to reset