To explain the predicted answers and evaluate the reasoning abilities of...
Several multi-hop reading comprehension datasets have been proposed to
r...
The issue of shortcut learning is widely known in NLP and has been an
im...
A multi-hop question answering (QA) dataset aims to test reasoning and
i...