Multi-Task Learning for Calorie Prediction on a Novel Large-Scale Recipe Dataset Enriched with Nutritional Information

11/02/2020
by   Robin Ruede, et al.
0

A rapidly growing amount of content posted online, such as food recipes, opens doors to new exciting applications at the intersection of vision and language. In this work, we aim to estimate the calorie amount of a meal directly from an image by learning from recipes people have published on the Internet, thus skipping time-consuming manual data annotation. Since there are few large-scale publicly available datasets captured in unconstrained environments, we propose the pic2kcal benchmark comprising 308,000 images from over 70,000 recipes including photographs, ingredients and instructions. To obtain nutritional information of the ingredients and automatically determine the ground-truth calorie value, we match the items in the recipes with structured information from a food item database. We evaluate various neural networks for regression of the calorie quantity and extend them with the multi-task paradigm. Our learning procedure combines the calorie estimation with prediction of proteins, carbohydrates, and fat amounts as well as a multi-label ingredient classification. Our experiments demonstrate clear benefits of multi-task learning for calorie estimation, surpassing the single-task calorie regression by 9.9 research on this task, we make the code for generating the dataset and the models publicly available.

READ FULL TEXT

page 1

page 7

research
08/10/2021

FoodLogoDet-1500: A Dataset for Large-Scale Food Logo Detection via Multi-Scale Feature Decoupling Network

Food logo detection plays an important role in the multimedia for its wi...
research
01/27/2021

Language Modelling as a Multi-Task Problem

In this paper, we propose to study language modelling as a multi-task pr...
research
04/29/2022

Towards Automatic Parsing of Structured Visual Content through the Use of Synthetic Data

Structured Visual Content (SVC) such as graphs, flow charts, or the like...
research
04/06/2020

Field-Level Crop Type Classification with k Nearest Neighbors: A Baseline for a New Kenya Smallholder Dataset

Accurate crop type maps provide critical information for ensuring food s...
research
01/19/2023

Regularizing disparity estimation via multi task learning with structured light reconstruction

3D reconstruction is a useful tool for surgical planning and guidance. H...
research
06/28/2021

Multi-Task Learning for Scalable and Dense Multi-Layer Bayesian Map Inference

This paper presents a novel and flexible multi-task multi-layer Bayesian...
research
01/11/2023

SynMotor: A Benchmark Suite for Object Attribute Regression and Multi-task Learning

In this paper, we develop a novel benchmark suite including both a 2D sy...

Please sign up or login with your details

Forgot password? Click here to reset