The ability to automatically detect and track surgical instruments in
en...
The Multiplane Image (MPI), containing a set of fronto-parallel RGBA lay...
Object affordance is an important concept in hand-object interaction,
pr...
Automated video-based assessment of surgical skills is a promising task ...
We introduce a scalable framework for novel view synthesis from RGB-D im...
Few-shot action recognition aims to recognize novel action classes using...
Object affordance is an important concept in human-object interaction,
p...
We study the problem of identifying object instances in a dynamic enviro...
In this survey, we present comprehensive analysis of 3D hand pose estima...
We aim to improve the performance of regressing hand keypoints and segme...
Detecting the positions of human hands and objects-in-contact (hand-obje...
First-person action recognition is a challenging task in video understan...
The human gaze is a cost-efficient physiological data that reveals human...
Every hand-object interaction begins with contact. Despite predicting th...
The attribution method provides a direction for interpreting opaque neur...
Hand segmentation is a crucial task in first-person vision. Since
first-...
In this report, we describe the technical details of our submission to t...
People spend an enormous amount of time and effort looking for lost obje...
Identifying and visualizing regions that are significant for a given dee...
Recent advances in computer vision have made it possible to automaticall...
In this work, we address two coupled tasks of gaze prediction and action...
This paper proposes a novel method for understanding daily hand-object
m...
We present a new computational model for gaze prediction in egocentric v...
We present a new task that predicts future locations of people observed ...
We propose a new multi-frame method for efficiently computing scene flow...
Describing the color and textural information of a person image is one o...
We propose a privacy-preserving framework for learning visual classifier...
We envision a future time when wearable cameras are worn by the masses a...
We present an accurate stereo matching method using local expansion move...