Methods for video motion prediction either estimate jointly the instanta...
Camera pose estimation is a long-standing computer vision problem that t...
The variety of objects in the real world is nearly unlimited and is thus...
We present Viewset Diffusion: a framework for training image-conditioned...
We consider the problem of reconstructing a dynamic scene observed from ...
We present Farm3D, a method to learn category-specific 3D reconstructors...
Large-scale Vision-Language Models, such as CLIP, learn powerful image-t...
Incremental object detection (IOD) aims to train an object detector in
p...
Most approaches for self-supervised learning (SSL) are optimised on cura...
Reconstructing the 3D shape of an object from a single RGB image is a
lo...
We consider the problem of reconstructing a full 360 photographic model
...
Inferring a meaningful geometric scene representation from a single imag...
We consider the problem of learning a function that can estimate the 3D
...
In this work, we present a novel approach to photothermal super resoluti...
We propose a new approach to learn to segment multiple image objects wit...
Multi-modal retrieval is an important problem for many applications, suc...
Motion, measured via optical flow, provides a powerful cue to discover a...
Unsupervised localization and segmentation are long-standing computer vi...
With increasing focus on augmented and virtual reality applications (XR)...
There has been a recent surge in methods that aim to decompose and segme...
The goal of self-supervised visual representation learning is to learn
s...
Most of us are not experts in specific fields, such as ornithology.
None...
Computer vision has long relied on ImageNet and other large datasets of
...
Learning deformable 3D objects from 2D images is an extremely ill-posed
...
Recent research has shown that numerous human-interpretable directions e...
Is critical input information encoded in specific sparse pathways within...
A large part of the current success of deep learning lies in the
effecti...
Image manipulation can be considered a special case of image generation ...
We propose a method to learn 3D deformable object categories from raw
si...
Combining clustering and representation learning is one of the most prom...
The goal of the IARAI competition traffic4cast was to predict the city-w...
Understanding images without explicit supervision has become an importan...
We show that generative models can be used to capture visual geometry
co...
State-of-the-art methods for unsupervised representation learning can tr...
As deep reinforcement learning driven by visual perception becomes more
...
3D object detection and pose estimation from a single image are two
inhe...
Humans excel in grasping and manipulating objects because of their life-...
Path planning plays an essential role in many areas of robotics. Various...
Variational methods for revealing visual concepts learned by convolution...
Interaction and collaboration between humans and intelligent machines ha...
Prospection is an important part of how humans come up with new task pla...
Webly-supervised learning has recently emerged as an alternative paradig...
Real-time instrument tracking is a crucial requirement for various
compu...
Recurrent neural networks (RNNs) have achieved state-of-the-art performa...
Many prediction tasks contain uncertainty. In some cases, uncertainty is...
We propose a novel hands-free method to interactively segment 3D medical...
Over the last decade, Convolutional Neural Networks (CNN) saw a tremendo...
This paper addresses the problem of estimating the depth map of a scene ...