
-
Supervision Levels Scale (SLS)
We propose a three-dimensional discrete and incremental scale to encode ...
read it
-
Fine-Grained Action Retrieval Through Multiple Parts-of-Speech Embeddings
We address the problem of cross-modal fine-grained action retrieval betw...
read it
-
Learning Visual Actions Using Multiple Verb-Only Labels
This work introduces verb-only representations for both recognition and ...
read it
-
Towards an Unequivocal Representation of Actions
This work introduces verb-only representations for actions and interacti...
read it
-
Scaling Egocentric Vision: The EPIC-KITCHENS Dataset
First-person vision is gaining interest as it offers a unique viewpoint ...
read it
-
Trespassing the Boundaries: Labeling Temporal Bounds for Object Interactions in Egocentric Video
Manual annotations of temporal bounds for object interactions (i.e. star...
read it
-
Improving Classification by Improving Labelling: Introducing Probabilistic Multi-Label Object Interaction Recognition
This work deviates from easy-to-define class boundaries for object inter...
read it
-
SEMBED: Semantic Embedding of Egocentric Action Videos
We present SEMBED, an approach for embedding an egocentric object intera...
read it