Sparse Mixture-of-Experts models (MoEs) have recently gained popularity ...
This paper summarizes model improvements and inference-time optimization...
In this paper, we study contrastive learning from an optimization
perspe...
This work targets automated designing and scaling of Vision Transformers...
This work presents a simple vision transformer design as a strong baseli...
A recent work from Bello shows that training and scaling strategies may ...
The speed-accuracy Pareto curve of object detection systems have advance...
Scale-permuted networks have shown promising results on object bounding ...
Novel computer vision architectures monopolize the spotlight, but the im...
Recently, SpineNet has demonstrated promising results on object detectio...
Convolutional neural networks typically encode an input image into a ser...
In this paper, we introduce the problem of estimating the real world dep...
In this paper, a new deep learning architecture for stereo disparity
est...
Compared to the general semantic segmentation problem, portrait segmenta...
We propose a deep neural network fusion architecture for fast and robust...