Towards Open-World Product Attribute Mining: A Lightly-Supervised Approach

05/26/2023
by   Liyan Xu, et al.
0

We present a new task setting for attribute mining on e-commerce products, serving as a practical solution to extract open-world attributes without extensive human intervention. Our supervision comes from a high-quality seed attribute set bootstrapped from existing resources, and we aim to expand the attribute vocabulary of existing seed types, and also to discover any new attribute types automatically. A new dataset is created to support our setting, and our approach Amacer is proposed specifically to tackle the limited supervision. Especially, given that no direct supervision is available for those unseen new attributes, our novel formulation exploits self-supervised heuristic and unsupervised latent attributes, which attains implicit semantic signals as additional supervision by leveraging product context. Experiments suggest that our approach surpasses various baselines by 12 F1, expanding attributes of existing types significantly by up to 12 times, and discovering values from 39

READ FULL TEXT

page 1

page 2

page 3

page 4

research
04/29/2022

OA-Mine: Open-World Attribute Mining for E-Commerce Products with Weak Supervision

Automatic extraction of product attributes from their textual descriptio...
research
09/12/2023

SAGE: Structured Attribute Value Generation for Billion-Scale Product Catalogs

We introduce SAGE; a Generative LLM for inferring attribute values for p...
research
04/19/2021

LaTeX-Numeric: Language-agnostic Text attribute eXtraction for E-commerce Numeric Attributes

In this paper, we present LaTeX-Numeric - a high-precision fully-automat...
research
10/28/2015

Flexibly Mining Better Subgroups

In subgroup discovery, also known as supervised pattern mining, discover...
research
06/12/2021

Scalable Approach for Normalizing E-commerce Text Attributes (SANTA)

In this paper, we present SANTA, a scalable framework to automatically n...
research
08/13/2019

Getting To Know You: User Attribute Extraction from Dialogues

User attributes provide rich and useful information for user understandi...
research
08/29/2022

SemanticAxis: Exploring Multi-attribute Data by Semantics Construction and Ranking Analysis

Mining the distribution of features and sorting items by combined attrib...

Please sign up or login with your details

Forgot password? Click here to reset