Existing dense or paragraph video captioning approaches rely on holistic...
We consider the problem of Visual Question Answering (VQA). Given an ima...
In this work, we introduce VQA 360, a novel task of visual question answ...
While there are several widely used object detection datasets, current
c...
Narrated 360 videos are typically provided in many touring scenarios to
...
For survival, a living agent must have the ability to assess risk (1) by...