Multi-Modal Image Captioning for the Visually Impaired

05/17/2021
by   Hiba Ahsan, et al.
0

One of the ways blind people understand their surroundings is by clicking images and relying on descriptions generated by image captioning systems. Current work on captioning images for the visually impaired do not use the textual data present in the image when generating captions. This problem is critical as many visual scenes contain text. Moreover, up to 21 questions asked by blind people about the images they click pertain to the text present in them. In this work, we propose altering AoANet, a state-of-the-art image captioning model, to leverage the text detected in the image as an input feature. In addition, we use a pointer-generator mechanism to copy the detected text to the caption when tokens need to be reproduced accurately. Our model outperforms AoANet on the benchmark dataset VizWiz, giving a 35 performance improvement on CIDEr and SPICE scores, respectively.

READ FULL TEXT

page 2

page 6

research
03/24/2020

TextCaps: a Dataset for Image Captioning with Reading Comprehension

Image descriptions can help visually impaired people to quickly understa...
research
08/26/2023

Towards Real Time Egocentric Segment Captioning for The Blind and Visually Impaired in RGB-D Theatre Images

In recent years, image captioning and segmentation have emerged as cruci...
research
04/28/2023

Quality-agnostic Image Captioning to Safely Assist People with Vision Impairment

Automated image captioning has the potential to be a useful tool for peo...
research
02/20/2020

Captioning Images Taken by People Who Are Blind

While an important problem in the vision community is to design algorith...
research
06/14/2022

Automated Testing of Image Captioning Systems

Image captioning (IC) systems, which automatically generate a text descr...
research
05/24/2023

Exploring Diverse In-Context Configurations for Image Captioning

After discovering that Language Models (LMs) can be good in-context few-...

Please sign up or login with your details

Forgot password? Click here to reset