Multi-Label Product Categorization Using Multi-Modal Fusion Models

06/30/2019
by   Pasawee Wirojwatanakul, et al.
0

In this study, we investigated multi-modal approaches using images, descriptions, and title to categorize e-commerce products on Amazon.com. Specifically, we examined late fusion models, where the modalities are fused at the decision level. Products were each assigned multiple labels, and the hierarchy in the labels were flattened and filtered. For our individual baseline models, we modified a CNN architecture to classify the description and title, and then modified Keras' ResNet-50 to classify the images, achieving F1 scores of 77.0 late fusion model can classify products more accurately than single modal models can, improving the F1 score to 88.2 shortcomings of the other modalities, demonstrating that increasing the number of modalities can be an effective method for improving the accuracy of multi-label classification problems.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
07/07/2022

Multimodal E-Commerce Product Classification Using Hierarchical Fusion

In this work, we present a multi-modal model for commercial product clas...
research
11/29/2016

Is a picture worth a thousand words? A Deep Multi-Modal Fusion Architecture for Product Classification in e-commerce

Classifying products into categories precisely and efficiently is a majo...
research
06/02/2023

Transformer-based Multi-Modal Learning for Multi Label Remote Sensing Image Classification

In this paper, we introduce a novel Synchronized Class Token Fusion (SCT...
research
08/14/2020

A Multimodal Late Fusion Model for E-Commerce Product Classification

The cataloging of product listings is a fundamental problem for most e-c...
research
09/10/2023

Multi-modal Extreme Classification

This paper develops the MUFIN technique for extreme classification (XC) ...
research
04/17/2021

Semi-Supervised Multi-Modal Multi-Instance Multi-Label Deep Network with Optimal Transport

Complex objects are usually with multiple labels, and can be represented...
research
05/06/2023

HateMM: A Multi-Modal Dataset for Hate Video Classification

Hate speech has become one of the most significant issues in modern soci...

Please sign up or login with your details

Forgot password? Click here to reset