Semantic Embedded Deep Neural Network: A Generic Approach to Boost Multi-Label Image Classification Performance

05/09/2023
by   Xin Shen, et al.
0

Fine-grained multi-label classification models have broad applications in Amazon production features, such as visual based label predictions ranging from fashion attribute detection to brand recognition. One challenge to achieve satisfactory performance for those classification tasks in real world is the wild visual background signal that contains irrelevant pixels which confuses model to focus onto the region of interest and make prediction upon the specific region. In this paper, we introduce a generic semantic-embedding deep neural network to apply the spatial awareness semantic feature incorporating a channel-wise attention based model to leverage the localization guidance to boost model performance for multi-label prediction. We observed an Avg.relative improvement of 15.27 baseline approach. Core experiment and ablation studies involve multi-label fashion attribute classification performed on Instagram fashion apparels' image. We compared the model performances among our approach, baseline approach, and 3 alternative approaches to leverage semantic features. Results show favorable performance for our approach.

READ FULL TEXT

page 1

page 2

page 4

research
05/07/2023

Data Efficient Training with Imbalanced Label Sample Distribution for Fashion Detection

Multi-label classification models have a wide range of applications in E...
research
11/20/2018

A Baseline for Multi-Label Image Classification Using Ensemble Deep CNN

Recent studies on multi-label image classification have been focusing on...
research
06/13/2019

The iMaterialist Fashion Attribute Dataset

Large-scale image databases such as ImageNet have significantly advanced...
research
06/07/2020

Thoracic Disease Identification and Localization using Distance Learning and Region Verification

The identification and localization of diseases in medical images using ...
research
02/07/2018

Classification of Things in DBpedia using Deep Neural Networks

The Semantic Web aims at representing knowledge about the real world at ...
research
02/18/2021

FrugalMCT: Efficient Online ML API Selection for Multi-Label Classification Tasks

Multi-label classification tasks such as OCR and multi-object recognitio...
research
11/12/2019

Pose Guided Attention for Multi-label Fashion Image Classification

We propose a compact framework with guided attention for multi-label cla...

Please sign up or login with your details

Forgot password? Click here to reset