Main Product Detection with Graph Networks for Fashion

01/25/2022
by   Vacit Oguz Yazici, et al.
0

Computer vision has established a foothold in the online fashion retail industry. Main product detection is a crucial step of vision-based fashion product feed parsing pipelines, focused in identifying the bounding boxes that contain the product being sold in the gallery of images of the product page. The current state-of-the-art approach does not leverage the relations between regions in the image, and treats images of the same product independently, therefore not fully exploiting visual and product contextual information. In this paper we propose a model that incorporates Graph Convolutional Networks (GCN) that jointly represent all detected bounding boxes in the gallery as nodes. We show that the proposed method is better than the state-of-the-art, especially, when we consider the scenario where title-input is missing at inference time and for cross-dataset evaluation, our method outperforms previous approaches by a large margin.

READ FULL TEXT

page 9

page 15

research
05/29/2023

Fashion Object Detection for Tops Bottoms

Fashion is one of the largest world's industries and computer vision tec...
research
08/10/2016

Fashion Landmark Detection in the Wild

Visual fashion analysis has attracted many attentions in the recent year...
research
12/04/2019

Better Understanding Hierarchical Visual Relationship for Image Caption

The Convolutional Neural Network (CNN) has been the dominant image featu...
research
05/22/2018

Learning Markov Clustering Networks for Scene Text Detection

A novel framework named Markov Clustering Network (MCN) is proposed for ...
research
05/23/2020

RAPiD: Rotation-Aware People Detection in Overhead Fisheye Images

Recent methods for people detection in overhead, fisheye images either u...
research
10/13/2020

Supertagging Combinatory Categorial Grammar with Attentive Graph Convolutional Networks

Supertagging is conventionally regarded as an important task for combina...
research
05/19/2022

Training Vision-Language Transformers from Captions Alone

We show that Vision-Language Transformers can be learned without human l...

Please sign up or login with your details

Forgot password? Click here to reset