We present EgoHumans, a new multi-view multi-human video benchmark to ad...
We propose a method for in-hand 3D scanning of an unknown object from a
...
We propose a method for estimating the 6DoF pose of a rigid object with ...
Multi-person pose understanding from RGB videos includes three complex t...
Learning geometry, motion, and appearance priors of object classes is
im...
This paper proposes a do-it-all neural model of human hands, named LISA....
Neural shape models can represent complex 3D shapes with a compact laten...
Given a video captured from a first person perspective and recorded in a...
Object-oriented maps are important for scene understanding since they jo...
We introduce Replica, a dataset of 18 highly photo-realistic 3D indoor s...
Dense pixelwise prediction such as semantic segmentation is an up-to-dat...
Visual scene understanding is an important capability that enables robot...
Creating 3D maps on robots and other mobile devices has become a reality...