TPsgtR: Neural-Symbolic Tensor Product Scene-Graph-Triplet Representation for Image Captioning

11/22/2019
by   Chiranjib Sur, et al.
0

Image captioning can be improved if the structure of the graphical representations can be formulated with conceptual positional binding. In this work, we have introduced a novel technique for caption generation using the neural-symbolic encoding of the scene-graphs, derived from regional visual information of the images and we call it Tensor Product Scene-Graph-Triplet Representation (TP_sgtR). While, most of the previous works concentrated on identification of the object features in images, we introduce a neuro-symbolic embedding that can embed identified relationships among different regions of the image into concrete forms, instead of relying on the model to compose for any/all combinations. These neural symbolic representation helps in better definition of the neural symbolic space for neuro-symbolic attention and can be transformed to better captions. With this approach, we introduced two novel architectures (TP_sgtR-TDBU and TP_sgtR-sTDBU) for comparison and experiment result demonstrates that our approaches outperformed the other models, and generated captions are more comprehensive and natural.

READ FULL TEXT

page 1

page 6

research
02/09/2021

SG2Caps: Revisiting Scene Graphs for Image Captioning

The mainstream image captioning models rely on Convolutional Neural Netw...
research
01/27/2020

aiTPR: Attribute Interaction-Tensor Product Representation for Image Caption

Region visual features enhance the generative capability of the machines...
research
09/26/2017

Tensor Product Generation Networks

We present a new tensor product generation network (TPGN) that generates...
research
04/15/2022

Guiding Attention using Partial-Order Relationships for Image Captioning

The use of attention models for automated image captioning has enabled m...
research
07/29/2021

ReFormer: The Relational Transformer for Image Captioning

Image captioning is shown to be able to achieve a better performance by ...
research
12/17/2018

Feature Fusion Effects of Tensor Product Representation on (De)Compositional Network for Caption Generation for Images

Progress in image captioning is gradually getting complex as researchers...
research
05/17/2023

Tensor Products and Hyperdimensional Computing

Following up on a previous analysis of graph embeddings, we generalize a...

Please sign up or login with your details

Forgot password? Click here to reset