Vision-language (VL) pre-training has recently received considerable
att...
Recent advances in multimodal vision and language modeling have predomin...
Multi-modal reasoning systems rely on a pre-trained object detector to
e...
The current modus operandi in NLP involves downloading and fine-tuning
p...
Current approaches to solving classification tasks in NLP involve fine-t...
Recent research towards understanding neural networks probes models in a...
A significant amount of information in today's world is stored in struct...