Text Classification for Predicting Multi-level Product Categories

09/02/2021
by   Hadi Jahanshahi, et al.
0

In an online shopping platform, a detailed classification of the products facilitates user navigation. It also helps online retailers keep track of the price fluctuations in a certain industry or special discounts on a specific product category. Moreover, an automated classification system may help to pinpoint incorrect or subjective categories suggested by an operator. In this study, we focus on product title classification of the grocery products. We perform a comprehensive comparison of six different text classification models to establish a strong baseline for this task, which involves testing both traditional and recent machine learning methods. In our experiments, we investigate the generalizability of the trained models to the products of other online retailers, the dynamic masking of infeasible subcategories for pretrained language models, and the benefits of incorporating product titles in multiple languages. Our numerical results indicate that dynamic masking of subcategories is effective in improving prediction accuracy. In addition, we observe that using bilingual product titles is generally beneficial, and neural network-based models perform significantly better than SVM and XGBoost models. Lastly, we investigate the reasons for the misclassified products and propose future research directions to further enhance the prediction models.

READ FULL TEXT
research
03/01/2019

Large Scale Product Categorization using Structured and Unstructured Attributes

Product categorization using text data for eCommerce is a very challengi...
research
04/06/2020

Deep Learning Based Text Classification: A Comprehensive Review

Deep learning based models have surpassed classical machine learning bas...
research
12/14/2018

Don't Classify, Translate: Multi-Level E-Commerce Product Categorization Via Machine Translation

E-commerce platforms categorize their products into a multi-level taxono...
research
06/09/2016

e-Commerce product classification: our participation at cDiscount 2015 challenge

This report describes our participation in the cDiscount 2015 challenge ...
research
12/26/2019

Text Classification for Azerbaijani Language Using Machine Learning and Embedding

Text classification systems will help to solve the text clustering probl...
research
06/06/2019

Counterfactual Inference for Consumer Choice Across Many Product Categories

This paper proposes a method for estimating consumer preferences among d...
research
10/15/2021

Dropping diversity of products of large US firms: Models and measures

It is widely assumed that in our lifetimes the products available in the...

Please sign up or login with your details

Forgot password? Click here to reset