Neural Arabic Question Answering

06/12/2019
by   Hussein Mozannar, et al.
0

This paper tackles the problem of open domain factual Arabic question answering (QA) using Wikipedia as our knowledge source. This constrains the answer of any question to be a span of text in Wikipedia. Open domain QA for Arabic entails three challenges: annotated QA datasets in Arabic, large scale efficient information retrieval and machine reading comprehension. To deal with the lack of Arabic QA datasets we present the Arabic Reading Comprehension Dataset (ARCD) composed of 1,395 questions posed by crowdworkers on Wikipedia articles, and a machine translation of the Stanford Question Answering Dataset (Arabic-SQuAD). Our system for open domain question answering in Arabic (SOQAL) is based on two components: (1) a document retriever using a hierarchical TF-IDF approach and (2) a neural reading comprehension model using the pre-trained bi-directional transformer BERT. Our experiments on ARCD indicate the effectiveness of our approach with our BERT-based reader achieving a 61.3 F1 score, and our open domain system SOQAL achieving a 27.6 F1 score.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
02/13/2022

PQuAD: A Persian Question Answering Dataset

We present Persian Question Answering Dataset (PQuAD), a crowdsourced re...
research
11/10/2021

Pre-trained Transformer-Based Approach for Arabic Question Answering : A Comparative Study

Question answering(QA) is one of the most challenging yet widely investi...
research
11/10/2019

Knowledge Guided Text Retrieval and Reading for Open Domain Question Answering

This paper presents a general approach for open-domain question answerin...
research
03/31/2017

Reading Wikipedia to Answer Open-Domain Questions

This paper proposes to tackle open- domain question answering using Wiki...
research
02/05/2019

End-to-End Open-Domain Question Answering with BERTserini

We demonstrate an end-to-end question answering system that integrates B...
research
08/05/2021

Decoupled Transformer for Scalable Inference in Open-domain Question Answering

Large transformer models, such as BERT, achieve state-of-the-art results...
research
10/19/2021

DEEPAGÉ: Answering Questions in Portuguese about the Brazilian Environment

The challenge of climate change and biome conservation is one of the mos...

Please sign up or login with your details

Forgot password? Click here to reset