TASTEset – Recipe Dataset and Food Entities Recognition Benchmark

04/16/2022
by   Ania Wróblewska, et al.
1

Food Computing is currently a fast-growing field of research. Natural language processing (NLP) is also increasingly essential in this field, especially for recognising food entities. However, there are still only a few well-defined tasks that serve as benchmarks for solutions in this area. We introduce a new dataset – called TASTEset – to bridge this gap. In this dataset, Named Entity Recognition (NER) models are expected to find or infer various types of entities helpful in processing recipes, e.g. food products, quantities and their units, names of cooking processes, physical quality of ingredients, their purpose, taste. The dataset consists of 700 recipes with more than 13,000 entities to extract. We provide a few state-of-the-art baselines of named entity recognition models, which show that our dataset poses a solid challenge to existing models. The best model achieved, on average, 0.95 F_1 score, depending on the entity type – from 0.781 to 0.982. We share the dataset and the task to encourage progress on more in-depth and complex information extraction from recipes.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
06/15/2022

CMNEROne at SemEval-2022 Task 11: Code-Mixed Named Entity Recognition by leveraging multilingual data

Identifying named entities is, in general, a practical and challenging t...
research
05/12/2021

Kleister: Key Information Extraction Datasets Involving Long Documents with Complex Layouts

The relevance of the Key Information Extraction (KIE) task is increasing...
research
04/06/2023

Using LSTM and GRU With a New Dataset for Named Entity Recognition in the Arabic Language

Named entity recognition (NER) is a natural language processing task (NL...
research
04/09/2019

Knowledge-Augmented Language Model and its Application to Unsupervised Named-Entity Recognition

Traditional language models are unable to efficiently model entity names...
research
09/28/2021

Chekhov's Gun Recognition

Chekhov's gun is a dramatic principle stating that every element in a st...
research
04/06/2020

Field-Level Crop Type Classification with k Nearest Neighbors: A Baseline for a New Kenya Smallholder Dataset

Accurate crop type maps provide critical information for ensuring food s...
research
11/16/2022

H2-Golden-Retriever: Methodology and Tool for an Evidence-Based Hydrogen Research Grantsmanship

Hydrogen is poised to play a major role in decarbonizing the economy. Th...

Please sign up or login with your details

Forgot password? Click here to reset