End-to-End Attention-based Image Captioning

04/30/2021
by   Carola Sundaramoorthy, et al.
0

In this paper, we address the problem of image captioning specifically for molecular translation where the result would be a predicted chemical notation in InChI format for a given molecular structure. Current approaches mainly follow rule-based or CNN+RNN based methodology. However, they seem to underperform on noisy images and images with small number of distinguishable features. To overcome this, we propose an end-to-end transformer model. When compared to attention-based techniques, our proposed model outperforms on molecular datasets.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
09/11/2021

Bornon: Bengali Image Captioning with Transformer-based Deep learning approach

Image captioning using Encoder-Decoder based approach where CNN is used ...
research
02/19/2022

Image-to-Graph Transformers for Chemical Structure Recognition

For several decades, chemical knowledge has been published in written te...
research
09/11/2018

End-to-end Image Captioning Exploits Multimodal Distributional Similarity

We hypothesize that end-to-end neural image captioning systems work seem...
research
12/08/2018

Attend More Times for Image Captioning

Most attention-based image captioning models attend to the image once pe...
research
06/16/2019

Image Captioning with Integrated Bottom-Up and Multi-level Residual Top-Down Attention for Game Scene Understanding

Image captioning has attracted considerable attention in recent years. H...
research
04/12/2016

Scan, Attend and Read: End-to-End Handwritten Paragraph Recognition with MDLSTM Attention

We present an attention-based model for end-to-end handwriting recogniti...
research
07/10/2018

Topic-Guided Attention for Image Captioning

Attention mechanisms have attracted considerable interest in image capti...

Please sign up or login with your details

Forgot password? Click here to reset