Object detection has been expanded from a limited number of categories t...
Large vision-language models are generally applicable to many downstream...
Driven by improved architectures and better representation learning
fram...
We explore a new class of diffusion models based on the transformer
arch...
Masked Autoencoding (MAE) has emerged as an effective approach for
pre-t...
The "Roaring 20s" of visual recognition began with the introduction of V...
Using natural language as a supervision for training visual recognition
...
Recent work has shown that self-supervised pre-training leads to improve...
We present Masked Feature Prediction (MaskFeat) for self-supervised
pre-...
Object detection is a central downstream task used to test if pre-traine...
This paper shows that masked autoencoders (MAE) are scalable self-superv...
Recent advances in 3D perception have shown impressive progress in
under...
This paper does not describe a novel method. Instead, it studies a
strai...
Invariance to a broad array of image corruptions, such as warping, noise...
The rapid progress in 3D scene understanding has come with growing deman...
Arguably one of the top success stories of deep learning is transfer
lea...
Neural networks are often represented as graphs of connections between
n...
Differentiable Neural Architecture Search (DNAS) has demonstrated great
...
Existing neural network architectures in computer vision — whether desig...
We present Momentum Contrast (MoCo) for unsupervised visual representati...
The long-tail distribution of the visual world poses great challenges fo...
Neural Architecture Search (NAS) has emerged as a promising technique fo...
Over the past several years progress in designing better neural network
...
Neural networks for image recognition have evolved through extensive man...
In this paper we study 3D convolutional networks for video understanding...
We present a simple, highly modularized network architecture for image
c...
We develop a new edge detection algorithm that tackles two important iss...
Our proposed deeply-supervised nets (DSN) method simultaneously minimize...