XLMRQA: Open-Domain Question Answering on Vietnamese Wikipedia-based Textual Knowledge Source

04/14/2022
by   Kiet Van Nguyen, et al.
0

Question answering (QA) is a natural language understanding task within the fields of information retrieval and information extraction that has attracted much attention from the computational linguistics and artificial intelligence research community in recent years because of the strong development of machine reading comprehension-based models. A reader-based QA system is a high-level search engine that can find correct answers to queries or questions in open-domain or domain-specific texts using machine reading comprehension (MRC) techniques. The majority of advancements in data resources and machine-learning approaches in the MRC and QA systems, on the other hand, especially in two resource-rich languages such as English and Chinese. A low-resource language like Vietnamese has witnessed a scarcity of research on QA systems. This paper presents XLMRQA, the first Vietnamese QA system using a supervised transformer-based reader on the Wikipedia-based textual knowledge source (using the UIT-ViQuAD corpus), outperforming the two robust QA systems using deep neural network models: DrQA and BERTserini with 24.46 From the results obtained on the three systems, we analyze the influence of question types on the performance of the QA systems.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
03/06/2023

AmQA: Amharic Question Answering Dataset

Question Answering (QA) returns concise answers or answer lists from nat...
research
02/16/2020

Text-based Question Answering from Information Retrieval and Deep Neural Network Perspectives: A Survey

Text-based Question Answering (QA) is a challenging task which aims at f...
research
09/25/2021

More Than Reading Comprehension: A Survey on Datasets and Metrics of Textual Question Answering

Textual Question Answering (QA) aims to provide precise answers to user'...
research
05/12/2022

DTW at Qur'an QA 2022: Utilising Transfer Learning with Transformers for Question Answering in a Low-resource Domain

The task of machine reading comprehension (MRC) is a useful benchmark to...
research
03/31/2017

Reading Wikipedia to Answer Open-Domain Questions

This paper proposes to tackle open- domain question answering using Wiki...
research
04/24/2020

Question Answering over Curated and Open Web Sources

The last few years have seen an explosion of research on the topic of au...
research
10/09/2021

A Framework for Rationale Extraction for Deep QA models

As neural-network-based QA models become deeper and more complex, there ...

Please sign up or login with your details

Forgot password? Click here to reset