Memes are a popular form of communicating trends and ideas in social med...
The extraction of structured clinical information from free-text radiolo...
A major goal of multimodal research is to improve machine understanding ...
Recent transformer-based offline video instance segmentation (VIS) appro...
Building models that can be rapidly adapted to numerous tasks using only...
A comprehensive representation of an image requires understanding object...
We present a unified computational theory of perception and memory. In o...
This paper presents a new framework for training image-based classifiers...
A main challenge in scene graph classification is that the appearance of...
We analyse perception and memory using mathematical models for knowledge...
State of the art visual relation detection methods have been relying on
...
We propose an inverse reinforcement learning (IRL) approach using Deep
Q...