Synthetic to Real Unsupervised Domain Adaptation for Single-Stage Artwork Recognition in Cultural Sites

08/04/2020
by   Giovanni Pasqualino, et al.
1

Recognizing artworks in a cultural site using images acquired from the user's point of view (First Person Vision) allows to build interesting applications for both the visitors and the site managers. However, current object detection algorithms working in fully supervised settings need to be trained with large quantities of labeled data, whose collection requires a lot of times and high costs in order to achieve good performance. Using synthetic data generated from the 3D model of the cultural site to train the algorithms can reduce these costs. On the other hand, when these models are tested with real images, a significant drop in performance is observed due to the differences between real and synthetic images. In this study we consider the problem of Unsupervised Domain Adaptation for object detection in cultural sites. To address this problem, we created a new dataset containing both synthetic and real images of 16 different artworks. We hence investigated different domain adaptation techniques based on one-stage and two-stage object detector, image-to-image translation and feature alignment. Based on the observation that single-stage detectors are more robust to the domain shift in the considered settings, we proposed a new method based on RetinaNet and feature alignment that we called DA-RetinaNet. The proposed approach achieves better results than compared methods. To support research in this field we release the dataset at the following link https://iplab.dmi.unict.it/EGO-CH-OBJ-UDA/ and the code of the proposed architecture at https://github.com/fpv-iplab/DA-RetinaNet.

READ FULL TEXT

page 3

page 7

page 9

page 11

page 13

page 15

page 16

research
12/15/2020

Unsupervised Domain Adaptation from Synthetic to Real Images for Anchorless Object Detection

Synthetic images are one of the most promising solutions to avoid high c...
research
02/03/2020

Single-Stage Object Detection from Top-View Grid Maps on Custom Sensor Setups

We present our approach to unsupervised domain adaptation for single-sta...
research
08/29/2023

Detect, Augment, Compose, and Adapt: Four Steps for Unsupervised Domain Adaptation in Object Detection

Unsupervised domain adaptation (UDA) plays a crucial role in object dete...
research
04/10/2019

Egocentric Visitors Localization in Cultural Sites

We consider the problem of localizing visitors in a cultural site from e...
research
02/03/2020

EGO-CH: Dataset and Fundamental Tasks for Visitors BehavioralUnderstanding using Egocentric Vision

Equipping visitors of a cultural site with a wearable device allows to e...
research
09/26/2014

Location Recognition Over Large Time Lags

Would it be possible to automatically associate ancient pictures to mode...

Please sign up or login with your details

Forgot password? Click here to reset