Multi-label affordance mapping from egocentric vision

09/05/2023
by   Lorenzo Mur-Labadia, et al.
0

Accurate affordance detection and segmentation with pixel precision is an important piece in many complex systems based on interactions, such as robots and assitive devices. We present a new approach to affordance perception which enables accurate multi-label segmentation. Our approach can be used to automatically extract grounded affordances from first person videos of interactions using a 3D map of the environment providing pixel level precision for the affordance location. We use this method to build the largest and most complete dataset on affordances based on the EPIC-Kitchen dataset, EPIC-Aff, which provides interaction-grounded, multi-label, metric and spatial affordance annotations. Then, we propose a new approach to affordance segmentation based on multi-label detection which enables multiple affordances to co-exists in the same space, for example if they are associated with the same object. We present several strategies of multi-label detection using several segmentation architectures. The experimental results highlight the importance of the multi-label detection. Finally, we show how our metric representation can be exploited for build a map of interaction hotspots in spatial action-centric zones and use that representation to perform a task-oriented navigation.

READ FULL TEXT
research
10/27/2017

Similarity-based Multi-label Learning

Multi-label classification is an important learning problem with many ap...
research
03/24/2017

Improving Classification by Improving Labelling: Introducing Probabilistic Multi-Label Object Interaction Recognition

This work deviates from easy-to-define class boundaries for object inter...
research
06/09/2018

Autoencoders for Multi-Label Prostate MR Segmentation

Organ image segmentation can be improved by implementing prior knowledge...
research
10/20/2022

PointTAD: Multi-Label Temporal Action Detection with Learnable Query Points

Traditional temporal action detection (TAD) usually handles untrimmed vi...
research
01/03/2018

Joint Optic Disc and Cup Segmentation Based on Multi-label Deep Network and Polar Transformation

Glaucoma is a chronic eye disease that leads to irreversible vision loss...
research
08/17/2023

Eosinophils Instance Object Segmentation on Whole Slide Imaging Using Multi-label Circle Representation

Eosinophilic esophagitis (EoE) is a chronic and relapsing disease charac...
research
01/01/2020

Residual Block-based Multi-Label Classification and Localization Network with Integral Regression for Vertebrae Labeling

Accurate identification and localization of the vertebrae in CT scans is...

Please sign up or login with your details

Forgot password? Click here to reset