Towards Adaptable and Interactive Image Captioning with Data Augmentation and Episodic Memory

06/06/2023
by   Aliki Anagnostopoulou, et al.
0

Interactive machine learning (IML) is a beneficial learning paradigm in cases of limited data availability, as human feedback is incrementally integrated into the training process. In this paper, we present an IML pipeline for image captioning which allows us to incrementally adapt a pre-trained image captioning model to a new data distribution based on user input. In order to incorporate user input into the model, we explore the use of a combination of simple data augmentation methods to obtain larger data batches for each newly annotated data instance and implement continual learning methods to prevent catastrophic forgetting from repeated updates. For our experiments, we split a domain-specific image captioning dataset, namely VizWiz, into non-overlapping parts to simulate an incremental input flow for continually adapting the model to new data. We find that, while data augmentation worsens results, even when relatively small amounts of data are available, episodic memory is an effective strategy to retain knowledge from previously seen clusters.

READ FULL TEXT

page 3

page 6

research
02/28/2022

Interactive Machine Learning for Image Captioning

We propose an approach for interactive learning for an image captioning ...
research
06/06/2023

Putting Humans in the Image Captioning Loop

Image Captioning (IC) models can highly benefit from human feedback in t...
research
09/19/2019

ContCap: A comprehensive framework for continual image captioning

While cutting-edge image captioning systems are increasingly describing ...
research
07/13/2020

RATT: Recurrent Attention to Transient Tasks for Continual Image Captioning

Research on continual learning has led to a variety of approaches to mit...
research
02/22/2021

Image Captioning using Deep Stacked LSTMs, Contextual Word Embeddings and Data Augmentation

Image Captioning, or the automatic generation of descriptions for images...
research
09/30/2022

SmallCap: Lightweight Image Captioning Prompted with Retrieval Augmentation

Recent advances in image captioning have focused on scaling the data and...
research
07/06/2023

OLR-WA Online Regression with Weighted Average

Machine Learning requires a large amount of training data in order to bu...

Please sign up or login with your details

Forgot password? Click here to reset