Product Information Extraction using ChatGPT

06/23/2023
by   Alexander Brinkmann, et al.
0

Structured product data in the form of attribute/value pairs is the foundation of many e-commerce applications such as faceted product search, product comparison, and product recommendation. Product offers often only contain textual descriptions of the product attributes in the form of titles or free text. Hence, extracting attribute/value pairs from textual product descriptions is an essential enabler for e-commerce applications. In order to excel, state-of-the-art product information extraction methods require large quantities of task-specific training data. The methods also struggle with generalizing to out-of-distribution attributes and attribute values that were not a part of the training data. Due to being pre-trained on huge amounts of text as well as due to emergent effects resulting from the model size, Large Language Models like ChatGPT have the potential to address both of these shortcomings. This paper explores the potential of ChatGPT for extracting attribute/value pairs from product descriptions. We experiment with different zero-shot and few-shot prompt designs. Our results show that ChatGPT achieves a performance similar to a pre-trained language model but requires much smaller amounts of training data and computation for fine-tuning.

READ FULL TEXT

page 3

page 4

research
06/01/2023

Large Scale Generative Multimodal Attribute Extraction for E-commerce Attributes

E-commerce websites (e.g. Amazon) have a plethora of structured and unst...
research
02/23/2023

Automated Extraction of Fine-Grained Standardized Product Information from Unstructured Multilingual Web Data

Extracting structured information from unstructured data is one of the k...
research
05/02/2023

DreamPaint: Few-Shot Inpainting of E-Commerce Items for Virtual Try-On without 3D Modeling

We introduce DreamPaint, a framework to intelligently inpaint any e-comm...
research
04/29/2022

OA-Mine: Open-World Attribute Mining for E-Commerce Products with Weak Supervision

Automatic extraction of product attributes from their textual descriptio...
research
12/21/2022

ImPaKT: A Dataset for Open-Schema Knowledge Base Construction

Large language models have ushered in a golden age of semantic parsing. ...
research
06/28/2022

Adaptive Multi-view Rule Discovery for Weakly-Supervised Compatible Products Prediction

On e-commerce platforms, predicting if two products are compatible with ...
research
04/28/2023

Hedonic Prices and Quality Adjusted Price Indices Powered by AI

Accurate, real-time measurements of price index changes using electronic...

Please sign up or login with your details

Forgot password? Click here to reset