UIT-ViIC: A Dataset for the First Evaluation on Vietnamese Image Captioning

02/01/2020
by   Quan Hoang Lam, et al.
0

Image Captioning, the task of automatic generation of image captions, has attracted attentions from researchers in many fields of computer science, being computer vision, natural language processing and machine learning in recent years. This paper contributes to research on Image Captioning task in terms of extending dataset to a different language - Vietnamese. So far, there is no existed Image Captioning dataset for Vietnamese language, so this is the foremost fundamental step for developing Vietnamese Image Captioning. In this scope, we first build a dataset which contains manually written captions for images from Microsoft COCO dataset relating to sports played with balls, we called this dataset UIT-ViIC. UIT-ViIC consists of 19,250 Vietnamese captions for 3,850 images. Following that, we evaluate our dataset on deep neural network models and do comparisons with English dataset and two Vietnamese datasets built by different methods. UIT-ViIC is published on our lab website for research purposes.

READ FULL TEXT

page 2

page 10

research
05/02/2017

STAIR Captions: Constructing a Large-Scale Japanese Image Caption Dataset

In recent years, automatic generation of image descriptions (captions), ...
research
07/04/2022

Are metrics measuring what they should? An evaluation of image captioning task metrics

Image Captioning is a current research task to describe the image conten...
research
08/05/2023

A Comprehensive Analysis of Real-World Image Captioning and Scene Identification

Image captioning is a computer vision task that involves generating natu...
research
08/05/2021

Neural Twins Talk Alternative Calculations

Inspired by how the human brain employs a higher number of neural pathwa...
research
05/13/2018

Image Captioning

This paper discusses and demonstrates the outcomes from our experimentat...
research
03/14/2018

Unpaired Image Captioning by Language Pivoting

Image captioning is a multimodal task involving computer vision and natu...
research
05/07/2023

UIT-OpenViIC: A Novel Benchmark for Evaluating Image Captioning in Vietnamese

Image Captioning is one of the vision-language tasks that still interest...

Please sign up or login with your details

Forgot password? Click here to reset