We study visually grounded VideoQA in response to the emerging trends of...
This paper strives to solve complex video question answering (VideoQA) w...
We propose to perform video question answering (VideoQA) in a Contrastiv...
Short video platforms have become an important channel for news sharing,...
Video Question Answering (VideoQA) is the task of answering the natural
...
This paper proposes a Video Graph Transformer (VGT) model for Video Quet...
Video Question Answering (VideoQA) is the task of answering questions ab...
Video Question Answering (VideoQA) aims to answer natural language quest...
Video question answering requires the models to understand and reason ab...
We introduce NExT-QA, a rigorously designed video question answering
(Vi...
In this paper, we explore a novel task named visual Relation Grounding i...