XOR QA: Cross-lingual Open-Retrieval Question Answering

10/22/2020
by   Akari Asai, et al.
9

Multilingual question answering tasks typically assume answers exist in the same language as the question. Yet in practice, many languages face both information scarcity—where languages have few reference articles—and information asymmetry—where questions reference concepts from other cultures. This work extends open-retrieval question answering to a cross-lingual setting enabling questions from one language to be answered via answer content from another language. We construct a large-scale dataset built on questions from TyDi QA lacking same-language answers. Our task formulation, called Cross-lingual Open Retrieval Question Answering (XOR QA), includes 40k information-seeking questions from across 7 diverse non-English languages. Based on this dataset, we introduce three new tasks that involve cross-lingual document retrieval using multi-lingual and English resources. We establish baselines with state-of-the-art machine translation systems and cross-lingual pretrained models. Experimental results suggest that XOR QA is a challenging task that will facilitate the development of novel techniques for multilingual question answering. Our data and code are available at https://nlp.cs.washington.edu/xorqa.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
07/26/2021

One Question Answering Model for Many Languages with Cross-lingual Dense Passage Retrieval

We present CORA, a Cross-lingual Open-Retrieval Answer Generation model ...
research
10/16/2019

MLQA: Evaluating Cross-lingual Extractive Question Answering

Question answering (QA) models have shown rapid progress enabled by the ...
research
11/16/2022

Unified Question Answering in Slovene

Question answering is one of the most challenging tasks in language unde...
research
04/06/2023

Bridging the Language Gap: Knowledge Injected Multilingual Question Answering

Question Answering (QA) is the task of automatically answering questions...
research
05/23/2023

Evaluating and Modeling Attribution for Cross-Lingual Question Answering

Trustworthy answer content is abundant in many high-resource languages a...
research
05/16/2023

xPQA: Cross-Lingual Product Question Answering across 12 Languages

Product Question Answering (PQA) systems are key in e-commerce applicati...

Please sign up or login with your details

Forgot password? Click here to reset