Solving Price Per Unit Problem Around the World: Formulating Fact Extraction as Question Answering

04/12/2022
by   Tarik Arici, et al.
0

Price Per Unit (PPU) is an essential information for consumers shopping on e-commerce websites when comparing products. Finding total quantity in a product is required for computing PPU, which is not always provided by the sellers. To predict total quantity, all relevant quantities given in a product attributes such as title, description and image need to be inferred correctly. We formulate this problem as a question-answering (QA) task rather than named entity recognition (NER) task for fact extraction. In our QA approach, we first predict the unit of measure (UoM) type (e.g., volume, weight or count), that formulates the desired question (e.g., "What is the total volume?") and then use this question to find all the relevant answers. Our model architecture consists of two subnetworks for the two subtasks: a classifier to predict UoM type (or the question) and an extractor to extract the relevant quantities. We use a deep character-level CNN architecture for both subtasks, which enables (1) easy expansion to new stores with similar alphabets, (2) multi-span answering due to its span-image architecture and (3) easy deployment by keeping model-inference latency low. Our QA approach outperforms rule-based methods by 34.4 globally, with largest precision lift of 10.6

READ FULL TEXT

page 1

page 2

page 3

page 4

research
12/04/2021

A Russian Jeopardy! Data Set for Question-Answering Systems

Question answering (QA) is one of the most common NLP tasks that relates...
research
01/11/2017

Question Analysis for Arabic Question Answering Systems

The first step of processing a question in Question Answering(QA) System...
research
09/09/2022

Activity report analysis with automatic single or multispan answer extraction

In the era of loT (Internet of Things) we are surrounded by a plethora o...
research
11/15/2018

Improving Skin Condition Classification with a Question Answering Model

We present a skin condition classification methodology based on a sequen...
research
05/27/2016

Boosting Question Answering by Deep Entity Recognition

In this paper an open-domain factoid question answering system for Polis...
research
04/09/2021

UPB at SemEval-2021 Task 8: Extracting Semantic Information on Measurements as Multi-Turn Question Answering

Extracting semantic information on measurements and counts is an importa...
research
04/21/2023

KitchenScale: Learning to predict ingredient quantities from recipe contexts

Determining proper quantities for ingredients is an essential part of co...

Please sign up or login with your details

Forgot password? Click here to reset