We present EgoHumans, a new multi-view multi-human video benchmark to ad...
One of the recent trends in vision problems is to use natural language
c...
We present a physics-based humanoid controller that achieves high-fideli...
Multi-modal trajectory forecasting methods commonly evaluate using
singl...
We tackle the task of text-to-3D creation with pre-trained latent-based ...
Motion-based association for Multi-Object Tracking (MOT) has recently
re...
Indoor localization systems often fuse inertial odometry with map inform...
We tackle the task of NeRF inversion for style-based neural radiance fie...
In this work, we propose a novel paradigm to encode the position of targ...
While recent camera-only 3D detection methods leverage multiple timestep...
Multi-Object Tracking (MOT) has rapidly progressed with the development ...
Top-down methods for monocular human mesh recovery have two stages: (1)
...
A typical pipeline for multi-object tracking (MOT) is to use a detector ...
We tackle the problem of domain adaptation in object detection, where th...
Humans typically perceive the establishment of an action in a video thro...
LiDAR sensors can be used to obtain a wide range of measurement signals ...
Point clouds and RGB images are naturally complementary modalities for 3...
We address the problem of estimating the 3D pose of a network of cameras...
An important step in the task of neural network design, such as
hyper-pa...
Various methods for solving the inverse reinforcement learning (IRL) pro...
Accurate smartphone localization (< 1-meter error) for indoor navigation...
Many smartphone applications use inertial measurement units (IMUs) to se...
3D multi-object tracking is an important component in robotic perception...
DETR is a recently proposed Transformer-based method which views object
...
The ability to both recognize and discover terrain characteristics is an...
We consider the few-shot classification task with an unbalanced dataset,...
3D Multi-object tracking (MOT) is crucial to autonomous systems. Recent ...
3D multi-object tracking (MOT) is essential to applications such as
auto...
Diabetic Retinopathy (DR) is a leading cause of blindness in working age...
Object detection and data association are critical components in multi-o...
We aim to enable robots to visually localize a target person through the...
3D Multi-object tracking (MOT) is crucial to autonomous systems. Recent ...
Blind or no-reference image quality assessment (NR-IQA) is a fundamental...
Predicting the future is a crucial first step to effective control, sinc...
Predicting the future is a crucial first step to effective control, sinc...
We consider the task of re-calibrating the 3D pose of a static surveilla...
This work addresses the task of open world semantic segmentation using R...
The ability to forecast a set of likely yet diverse possible future beha...
3D multi-object tracking (MOT) is an essential component technology for ...
We propose the use of a proportional-derivative (PD) control based polic...
We focus on the word-level visual lipreading, which requires recognizing...
For autonomous vehicles (AVs) to behave appropriately on roads populated...
Monocular 3D scene understanding tasks, such as object size estimation,
...
In crowded social scenarios with a myriad of external stimuli, human bra...
We focus on the problem of estimating the orientation of the ground plan...
Non-maximum suppression (NMS) is essential for state-of-the-art object
d...
We propose a collection of three shift-based primitives for building
eff...
We present HARMONIC, a large multi-modal dataset of human interactions i...
This paper proposes a novel method for understanding daily hand-object
m...
We explore the role of personalization for assistive navigational system...