Octet: Online Catalog Taxonomy Enrichment with Self-Supervision

06/18/2020
by   Yuning Mao, et al.
0

Taxonomies have found wide applications in various domains, especially online for item categorization, browsing, and search. Despite the prevalent use of online catalog taxonomies, most of them in practice are maintained by humans, which is labor-intensive and difficult to scale. While taxonomy construction from scratch is considerably studied in the literature, how to effectively enrich existing incomplete taxonomies remains an open yet important research question. Taxonomy enrichment not only requires the robustness to deal with emerging terms but also the consistency between existing taxonomy structure and new term attachment. In this paper, we present a self-supervised end-to-end framework, Octet, for Online Catalog Taxonomy EnrichmenT. Octet leverages heterogeneous information unique to online catalog taxonomies such as user queries, items, and their relations to the taxonomy nodes while requiring no other supervision than the existing taxonomies. We propose to distantly train a sequence labeling model for term extraction and employ graph neural networks (GNNs) to capture the taxonomy structure as well as the query-item-taxonomy interactions for term attachment. Extensive experiments in different online domains demonstrate the superiority of Octet over state-of-the-art methods via both automatic and human evaluations. Notably, Octet enriches an online catalog taxonomy in production to 2 times larger in the open-world evaluation.

READ FULL TEXT

page 2

page 4

page 5

page 6

page 7

page 8

page 9

page 10

research
03/28/2022

Learning What You Need from What You Did: Product Taxonomy Expansion with User Behaviors Supervision

Taxonomies have been widely used in various domains to underpin numerous...
research
11/03/2016

Probabilistic Modeling of Progressive Filtering

Progressive filtering is a simple way to perform hierarchical classifica...
research
05/10/2018

End-to-End Reinforcement Learning for Automatic Taxonomy Induction

We present a novel end-to-end reinforcement learning approach to automat...
research
01/27/2021

Enquire One's Parent and Child Before Decision: Fully Exploit Hierarchical Structure for Self-Supervised Taxonomy Expansion

Taxonomy is a hierarchically structured knowledge graph that plays a cru...
research
02/04/2015

INRIASAC: Simple Hypernym Extraction Methods

Given a set of terms from a given domain, how can we structure them into...
research
07/26/2023

Sources of Opacity in Computer Systems: Towards a Comprehensive Taxonomy

Modern computer systems are ubiquitous in contemporary life yet many of ...
research
02/10/2022

TaxoEnrich: Self-Supervised Taxonomy Completion via Structure-Semantic Representations

Taxonomies are fundamental to many real-world applications in various do...

Please sign up or login with your details

Forgot password? Click here to reset