Large Scale Product Categorization using Structured and Unstructured Attributes

03/01/2019
by   Abhinandan Krishnan, et al.
0

Product categorization using text data for eCommerce is a very challenging extreme classification problem with several thousands of classes and several millions of products to classify. Even though multi-class text classification is a well studied problem both in academia and industry, most approaches either deal with treating product content as a single pile of text, or only consider a few product attributes for modelling purposes. Given the variety of products sold on popular eCommerce platforms, it is hard to consider all available product attributes as part of the modeling exercise, considering that products possess their own unique set of attributes based on category. In this paper, we compare hierarchical models to flat models and show that in specific cases, flat models perform better. We explore two Deep Learning based models that extract features from individual pieces of unstructured data from each product and then combine them to create a product signature. We also propose a novel idea of using structured attributes and their values together in an unstructured fashion along with convolutional filters such that the ordering of the attributes and the differing attributes by product categories no longer becomes a modelling challenge. This approach is also more robust to the presence of faulty product attribute names and values and can elegantly generalize to use both closed list and open list attributes.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
09/26/2016

Visual Fashion-Product Search at SK Planet

We build a large-scale visual search system which finds similar product ...
research
09/02/2021

Text Classification for Predicting Multi-level Product Categories

In an online shopping platform, a detailed classification of the product...
research
01/31/2020

Scalable bundling via dense product embeddings

Bundling, the practice of jointly selling two or more products at a disc...
research
07/15/2019

Ranking sentences from product description & bullets for better search

Products in an ecommerce catalog contain information-rich fields like de...
research
02/23/2023

Automated Extraction of Fine-Grained Standardized Product Information from Unstructured Multilingual Web Data

Extracting structured information from unstructured data is one of the k...
research
06/24/2020

AutoKnow: Self-Driving Knowledge Collection for Products of Thousands of Types

Can one build a knowledge graph (KG) for all products in the world? Know...
research
06/09/2016

e-Commerce product classification: our participation at cDiscount 2015 challenge

This report describes our participation in the cDiscount 2015 challenge ...

Please sign up or login with your details

Forgot password? Click here to reset