Structure-Aware Generation Network for Recipe Generation from Images

09/02/2020
by   Hao Wang, et al.
4

Sharing food has become very popular with the development of social media. For many real-world applications, people are keen to know the underlying recipes of a food item. In this paper, we are interested in automatically generating cooking instructions for food. We investigate an open research task of generating cooking instructions based on only food images and ingredients, which is similar to the image captioning task. However, compared with image captioning datasets, the target recipes are long-length paragraphs and do not have annotations on structure information. To address the above limitations, we propose a novel framework of Structure-aware Generation Network (SGN) to tackle the food recipe generation task. Our approach brings together several novel ideas in a systematic framework: (1) exploiting an unsupervised learning approach to obtain the sentence-level tree structure labels before training; (2) generating trees of target recipes from images with the supervision of tree structure labels learned from (1); and (3) integrating the inferred tree structures with the recipe generation procedure. Our proposed model can produce high-quality and coherent recipes, and achieve the state-of-the-art performance on the benchmark Recipe1M dataset.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
10/04/2021

Learning Structural Representations for Recipe Generation and Food Retrieval

Food is significant to human daily life. In this paper, we are intereste...
research
12/14/2018

Inverse Cooking: Recipe Generation from Food Images

People enjoy food photography because they appreciate food. Behind each ...
research
07/27/2020

Decomposed Generation Networks with Structure Prediction for Recipe Generation from Food Images

Recipe generation from food images and ingredients is a challenging task...
research
09/01/2023

Diffusion Model with Clustering-based Conditioning for Food Image Generation

Image-based dietary assessment serves as an efficient and accurate solut...
research
01/08/2019

GILT: Generating Images from Long Text

Creating an image reflecting the content of a long text is a complex pro...
research
07/01/2021

Egocentric Image Captioning for Privacy-Preserved Passive Dietary Intake Monitoring

Camera-based passive dietary intake monitoring is able to continuously c...
research
04/21/2020

ParaCNN: Visual Paragraph Generation via Adversarial Twin Contextual CNNs

Image description generation plays an important role in many real-world ...

Please sign up or login with your details

Forgot password? Click here to reset