DeepDiary: Automatic Caption Generation for Lifelogging Image Streams

08/12/2016
by   Chenyou Fan, et al.
0

Lifelogging cameras capture everyday life from a first-person perspective, but generate so much data that it is hard for users to browse and organize their image collections effectively. In this paper, we propose to use automatic image captioning algorithms to generate textual representations of these collections. We develop and explore novel techniques based on deep learning to generate captions for both individual images and image streams, using temporal consistency constraints to create summaries that are both more compact and less noisy. We evaluate our techniques with quantitative and qualitative results, and apply captioning to an image retrieval application for finding potentially private images. Our results suggest that our automatic captioning algorithms, while imperfect, may work well enough to help users manage lifelogging photo collections.

READ FULL TEXT

page 6

page 7

research
10/06/2018

A Comprehensive Study of Deep Learning for Image Captioning

Generating a description of an image is called image captioning. Image c...
research
07/15/2022

LineCap: Line Charts for Data Visualization Captioning Models

Data visualization captions help readers understand the purpose of a vis...
research
08/12/2016

When was that made?

In this paper, we explore deep learning methods for estimating when obje...
research
08/09/2015

Image Representations and New Domains in Neural Image Captioning

We examine the possibility that recent promising results in automatic ca...
research
05/20/2019

Image Captioning based on Deep Learning Methods: A Survey

Image captioning is a challenging task and attracting more and more atte...
research
08/21/2020

Behavioural pattern discovery from collections of egocentric photo-streams

The automatic discovery of behaviour is of high importance when aiming t...
research
12/05/2021

Gaudí: Conversational Interactions with Deep Representations to Generate Image Collections

Based on recent advances in realistic language modeling (GPT-3) and cros...

Please sign up or login with your details

Forgot password? Click here to reset