The input tokens to Vision Transformers carry little semantic meaning as...
Mixture of Experts (MoE) are rising in popularity as a means to train
ex...
There is a growing discrepancy in computer vision between large-scale mo...
Fine-tuning is a popular way of exploiting knowledge contained in a
pre-...
State-of-the-art detection systems are generally evaluated on their abil...
Style transfer usually refers to the task of applying color and texture
...
We develop a probabilistic technique for colorizing grayscale natural im...