Producing Usable Taxonomies Cheaply and Rapidly at Pinterest Using Discovered Dynamic μ-Topics

01/29/2023
by   Abhijit Mahabal, et al.
0

Creating a taxonomy of interests is expensive and human-effort intensive: not only do we need to identify nodes and interconnect them, in order to use the taxonomy, we must also connect the nodes to relevant entities such as users, pins, and queries. Connecting to entities is challenging because of ambiguities inherent to language but also because individual interests are dynamic and evolve. Here, we offer an alternative approach that begins with bottom-up discovery of μ-topics called pincepts. The discovery process itself connects these μ-topics dynamically with relevant queries, pins, and users at high precision, automatically adapting to shifting interests. Pincepts cover all areas of user interest and automatically adjust to the specificity of user interests and are thus suitable for the creation of various kinds of taxonomies. Human experts associate taxonomy nodes with μ-topics (on average, 3 μ-topics per node), and the μ-topics offer a high-level data layer that allows quick definition, immediate inspection, and easy modification. Even more powerfully, μ-topics allow easy exploration of nearby semantic space, enabling curators to spot and fill gaps. Curators' domain knowledge is heavily leveraged and we thus don't need untrained mechanical Turks, allowing further cost reduction. These μ-topics thus offer a satisfactory "symbolic" stratum over which to define taxonomies. We have successfully applied this technique for very rapidly iterating on and launching the home decor and fashion styles taxonomy for style-based personalization, prominently featured at the top of Pinterest search results, at 94 long clicks and pin saves.

READ FULL TEXT
research
04/26/2020

Parallel Taxonomy Discovery

Recommender systems aim to personalize the shopping experience of a user...
research
09/07/2018

Term-Mouse-Fixations as an Additional Indicator for Topical User Interests in Domain-Specific Search

Models in Interactive Information Retrieval (IIR) are grounded very much...
research
10/26/2017

Klout Topics for Modeling Interests and Expertise of Users Across Social Networks

This paper presents Klout Topics, a lightweight ontology to describe soc...
research
08/22/2019

Argument Invention from First Principles

Competitive debaters often find themselves facing a challenging task -- ...
research
05/06/2023

Science and Technology Ontology: A Taxonomy of Emerging Topics

Ontologies play a critical role in Semantic Web technologies by providin...
research
05/19/2022

GitRanking: A Ranking of GitHub Topics for Software Classification using Active Sampling

GitHub is the world's largest host of source code, with more than 150M r...
research
03/24/2021

Ontology-Based Recommendation of Editorial Products

Major academic publishers need to be able to analyse their vast catalogu...

Please sign up or login with your details

Forgot password? Click here to reset