Egocentric Image Captioning for Privacy-Preserved Passive Dietary Intake Monitoring

07/01/2021
by   Jianing Qiu, et al.
0

Camera-based passive dietary intake monitoring is able to continuously capture the eating episodes of a subject, recording rich visual information, such as the type and volume of food being consumed, as well as the eating behaviours of the subject. However, there currently is no method that is able to incorporate these visual clues and provide a comprehensive context of dietary intake from passive recording (e.g., is the subject sharing food with others, what food the subject is eating, and how much food is left in the bowl). On the other hand, privacy is a major concern while egocentric wearable cameras are used for capturing. In this paper, we propose a privacy-preserved secure solution (i.e., egocentric image captioning) for dietary assessment with passive monitoring, which unifies food recognition, volume estimation, and scene understanding. By converting images into rich text descriptions, nutritionists can assess individual dietary intake based on the captions instead of the original images, reducing the risk of privacy leakage from images. To this end, an egocentric dietary image captioning dataset has been built, which consists of in-the-wild images captured by head-worn and chest-worn cameras in field studies in Ghana. A novel transformer-based architecture is designed to caption egocentric dietary images. Comprehensive experiments have been conducted to evaluate the effectiveness and to justify the design of the proposed architecture for egocentric dietary image captioning. To the best of our knowledge, this is the first work that applies image captioning to dietary intake assessment in real life settings.

READ FULL TEXT

page 1

page 3

page 7

research
05/07/2021

An Intelligent Passive Food Intake Assessment System with Egocentric Cameras

Malnutrition is a major public health concern in low-and-middle-income c...
research
01/20/2023

Visual Semantic Relatedness Dataset for Image Captioning

Modern image captioning system relies heavily on extracting knowledge fr...
research
04/23/2018

Object Counts! Bringing Explicit Detections Back into Image Captioning

The use of explicit object detectors as an intermediate step to image ca...
research
05/04/2023

Image Captioners Sometimes Tell More Than Images They See

Image captioning, a.k.a. "image-to-text," which generates descriptive te...
research
09/02/2020

Structure-Aware Generation Network for Recipe Generation from Images

Sharing food has become very popular with the development of social medi...
research
04/29/2020

Image Captioning through Image Transformer

Automatic captioning of images is a task that combines the challenges of...
research
07/27/2020

Decomposed Generation Networks with Structure Prediction for Recipe Generation from Food Images

Recipe generation from food images and ingredients is a challenging task...

Please sign up or login with your details

Forgot password? Click here to reset