We present Integrated Multimodal Perception (IMP), a simple and scalable...
Self-supervised pre-training recently demonstrates success on large-scal...
Effective scaling and a flexible task interface enable large language mo...
We present a framework for learning multimodal representations from unla...
Neuro-symbolic representations have proved effective in learning structu...
We address the problem of phrase grounding by learning a multi-level com...
In this study, we propose a deep neural network for reconstructing
intel...