-
Answer-checking in Context: A Multi-modal FullyAttention Network for Visual Question Answering
Visual Question Answering (VQA) is challenging due to the complex cross-...
read it
-
Finding the Evidence: Localization-aware Answer Prediction for Text Visual Question Answering
Image text carries essential information to understand the scene and per...
read it

Hantao Huang
is this you? claim profile