Domain invariant hierarchical embedding for grocery products recognition

02/02/2019
by   Alessio Tonioni, et al.
10

Recognizing packaged grocery products based solely on appearance is still an open issue for modern computer vision systems due to peculiar challenges. Firstly, the number of different items to be recognized is huge (i.e., in the order of thousands) and rapidly changing over time. Moreover, there exist a significant domain shift between the images that should be recognized at test time, taken in stores by cheap cameras, and those available for training, usually just one or a few studio-quality images per product. We propose an end-to-end architecture comprising a GAN to address the domain shift at training time and a deep CNN trained on the samples generated by the GAN to learn an embedding of product images that enforces a hierarchy between product categories. At test time, we perform recognition by means of K-NN search against a database consisting of just one reference image per product. Experiments addressing recognition of products present in the training datasets as well as different ones unseen at training time show that our approach compares favourably to state-of-the-art methods on the grocery recognition task and generalize fairly well to similar ones.

READ FULL TEXT

page 2

page 4

page 5

page 13

page 14

page 15

research
10/03/2018

A deep learning pipeline for product recognition in store shelves

Recognition of grocery products in store shelves poses peculiar challeng...
research
10/03/2018

A deep learning pipeline for product recognition on store shelves

Recognition of grocery products in store shelves poses peculiar challeng...
research
07/23/2020

Towards Recognizing Unseen Categories in Unseen Domains

Current deep visual recognition systems suffer from severe performance d...
research
07/13/2021

eProduct: A Million-Scale Visual Search Benchmark to Address Product Recognition Challenges

Large-scale product recognition is one of the major applications of comp...
research
06/20/2022

Test Time Transform Prediction for Open Set Histopathological Image Recognition

Tissue typology annotation in Whole Slide histological images is a compl...
research
02/24/2017

Changing Model Behavior at Test-Time Using Reinforcement Learning

Machine learning models are often used at test-time subject to constrain...
research
04/23/2022

VISTA: Vision Transformer enhanced by U-Net and Image Colorfulness Frame Filtration for Automatic Retail Checkout

Multi-class product counting and recognition identifies product items fr...

Please sign up or login with your details

Forgot password? Click here to reset