A Multi Camera Unsupervised Domain Adaptation Pipeline for Object Detection in Cultural Sites through Adversarial Learning and Self-Training

10/03/2022
by   Giovanni Pasqualino, et al.
7

Object detection algorithms allow to enable many interesting applications which can be implemented in different devices, such as smartphones and wearable devices. In the context of a cultural site, implementing these algorithms in a wearable device, such as a pair of smart glasses, allow to enable the use of augmented reality (AR) to show extra information about the artworks and enrich the visitors' experience during their tour. However, object detection algorithms require to be trained on many well annotated examples to achieve reasonable results. This brings a major limitation since the annotation process requires human supervision which makes it expensive in terms of time and costs. A possible solution to reduce these costs consist in exploiting tools to automatically generate synthetic labeled images from a 3D model of the site. However, models trained with synthetic data do not generalize on real images acquired in the target scenario in which they are supposed to be used. Furthermore, object detectors should be able to work with different wearable devices or different mobile devices, which makes generalization even harder. In this paper, we present a new dataset collected in a cultural site to study the problem of domain adaptation for object detection in the presence of multiple unlabeled target domains corresponding to different cameras and a labeled source domain obtained considering synthetic images for training purposes. We present a new domain adaptation method which outperforms current state-of-the-art approaches combining the benefits of aligning the domains at the feature and pixel level with a self-training process. We release the dataset at the following link https://iplab.dmi.unict.it/OBJ-MDA/ and the code of the proposed architecture at https://github.com/fpv-iplab/STMDA-RetinaNet.

READ FULL TEXT

page 3

page 7

page 10

page 11

page 18

research
08/04/2020

Synthetic to Real Unsupervised Domain Adaptation for Single-Stage Artwork Recognition in Cultural Sites

Recognizing artworks in a cultural site using images acquired from the u...
research
12/15/2020

Unsupervised Domain Adaptation from Synthetic to Real Images for Anchorless Object Detection

Synthetic images are one of the most promising solutions to avoid high c...
research
03/09/2021

ST3D: Self-training for Unsupervised Domain Adaptation on 3D ObjectDetection

We present a new domain adaptive self-training pipeline, named ST3D, for...
research
02/03/2020

EGO-CH: Dataset and Fundamental Tasks for Visitors BehavioralUnderstanding using Egocentric Vision

Equipping visitors of a cultural site with a wearable device allows to e...
research
01/12/2023

1st Place Solution for ECCV 2022 OOD-CV Challenge Object Detection Track

OOD-CV challenge is an out-of-distribution generalization task. To solve...
research
04/10/2019

Egocentric Visitors Localization in Cultural Sites

We consider the problem of localizing visitors in a cultural site from e...
research
09/26/2014

Location Recognition Over Large Time Lags

Would it be possible to automatically associate ancient pictures to mode...

Please sign up or login with your details

Forgot password? Click here to reset