We introduce VisIT-Bench (Visual InsTruction Benchmark), a benchmark for...
Datasets scraped from the internet have been critical to the successes o...
For machine learning systems to be reliable, we must understand their
pe...
Conventional CNNs for texture synthesis consist of a sequence of
(de)-co...
We study how robust current ImageNet models are to distribution shifts
a...
Autoregressive (AR) models have become a popular tool for unsupervised
l...
The application of deep recurrent networks to audio transcription has le...