
-
Is First Person Vision Challenging for Object Tracking? The TREK-100 Benchmark Dataset
Understanding human-object interactions is fundamental in First Person V...
read it
-
On Embodied Visual Navigation in Real Environments Through Habitat
Visual navigation models based on deep learning can learn effective poli...
read it
-
The MECCANO Dataset: Understanding Human-Object Interactions from Egocentric Videos in an Industrial-like Domain
Wearable cameras allow to collect images and videos of humans interactin...
read it
-
Synthetic to Real Unsupervised Domain Adaptation for Single-Stage Artwork Recognition in Cultural Sites
Recognizing artworks in a cultural site using images acquired from the u...
read it
-
SceneAdapt: Scene-based domain adaptation for semantic segmentation using adversarial learning
Semantic segmentation methods have achieved outstanding performance than...
read it
-
Rolling-Unrolling LSTMs for Action Anticipation from First-Person Video
In this paper, we tackle the problem of egocentric action anticipation, ...
read it
-
Knowledge Distillation for Action Anticipation via Label Smoothing
Human capability to anticipate near future from visual observations and ...
read it
-
EGO-CH: Dataset and Fundamental Tasks for Visitors BehavioralUnderstanding using Egocentric Vision
Equipping visitors of a cultural site with a wearable device allows to e...
read it
-
What Would You Expect? Anticipating Egocentric Actions with Rolling-Unrolling LSTMs and Modality Attention
Egocentric action anticipation consists in understanding which objects t...
read it
-
Egocentric Visitors Localization in Cultural Sites
We consider the problem of localizing visitors in a cultural site from e...
read it
-
Next-Active-Object prediction from Egocentric Videos
Although First Person Vision systems can sense the environment from the ...
read it
-
Scaling Egocentric Vision: The EPIC-KITCHENS Dataset
First-person vision is gaining interest as it offers a unique viewpoint ...
read it