Egocentric Human-Object Interaction Detection Exploiting Synthetic Data

04/14/2022
by   Rosario Leonardi, et al.
9

We consider the problem of detecting Egocentric HumanObject Interactions (EHOIs) in industrial contexts. Since collecting and labeling large amounts of real images is challenging, we propose a pipeline and a tool to generate photo-realistic synthetic First Person Vision (FPV) images automatically labeled for EHOI detection in a specific industrial scenario. To tackle the problem of EHOI detection, we propose a method that detects the hands, the objects in the scene, and determines which objects are currently involved in an interaction. We compare the performance of our method with a set of state-of-the-art baselines. Results show that using a synthetic dataset improves the performance of an EHOI detection system, especially when few real data are available. To encourage research on this topic, we publicly release the proposed dataset at the following url: https://iplab.dmi.unict.it/EHOI_SYNTH/.

READ FULL TEXT

page 3

page 5

page 11

page 12

page 14

page 15

page 16

page 17

research
06/21/2023

Exploiting Multimodal Synthetic Data for Egocentric Human-Object Interaction Detection in an Industrial Scenario

In this paper, we tackle the problem of Egocentric Human-Object Interact...
research
04/14/2022

Panoptic Segmentation using Synthetic and Real Data

Being able to understand the relations between the user and the surround...
research
09/19/2022

MECCANO: A Multimodal Egocentric Dataset for Humans Behavior Understanding in the Industrial-like Domain

Wearable cameras allow to acquire images and videos from the user's pers...
research
10/12/2020

The MECCANO Dataset: Understanding Human-Object Interactions from Egocentric Videos in an Industrial-like Domain

Wearable cameras allow to collect images and videos of humans interactin...
research
04/14/2022

Weakly Supervised Attended Object Detection Using Gaze Data as Annotations

We consider the problem of detecting and recognizing the objects observe...
research
07/22/2022

Neural-Sim: Learning to Generate Training Data with NeRF

Training computer vision models usually requires collecting and labeling...
research
07/04/2018

Transfer Learning From Synthetic To Real Images Using Variational Autoencoders For Precise Position Detection

Capturing and labeling camera images in the real world is an expensive t...

Please sign up or login with your details

Forgot password? Click here to reset