IslamicPCQA: A Dataset for Persian Multi-hop Complex Question Answering in Islamic Text Resources

04/23/2023
by   Arash Ghafouri, et al.
0

Nowadays, one of the main challenges for Question Answering Systems is to answer complex questions using various sources of information. Multi-hop questions are a type of complex questions that require multi-step reasoning to answer. In this article, the IslamicPCQA dataset is introduced. This is the first Persian dataset for answering complex questions based on non-structured information sources and consists of 12,282 question-answer pairs extracted from 9 Islamic encyclopedias. This dataset has been created inspired by the HotpotQA English dataset approach, which was customized to suit the complexities of the Persian language. Answering questions in this dataset requires more than one paragraph and reasoning. The questions are not limited to any prior knowledge base or ontology, and to provide robust reasoning ability, the dataset also includes supporting facts and key sentences. The prepared dataset covers a wide range of Islamic topics and aims to facilitate answering complex Persian questions within this subject matter

READ FULL TEXT

page 7

page 9

page 11

research
09/25/2018

HotpotQA: A Dataset for Diverse, Explainable Multi-hop Question Answering

Existing question answering (QA) datasets fail to train QA systems to pe...
research
10/25/2021

Improving Embedded Knowledge Graph Multi-hop Question Answering by introducing Relational Chain Reasoning

Knowledge Base Question Answering (KBQA) aims to answer userquestions fr...
research
11/11/2021

A Chinese Multi-type Complex Questions Answering Dataset over Wikidata

Complex Knowledge Base Question Answering is a popular area of research ...
research
02/12/2016

TabMCQ: A Dataset of General Knowledge Tables and Multiple-choice Questions

We describe two new related resources that facilitate modelling of gener...
research
10/04/2022

Mintaka: A Complex, Natural, and Multilingual Dataset for End-to-End Question Answering

We introduce Mintaka, a complex, natural, and multilingual dataset desig...
research
01/18/2019

Identifying Unclear Questions in Community Question Answering Websites

Thousands of complex natural language questions are submitted to communi...
research
10/12/2022

OpenCQA: Open-ended Question Answering with Charts

Charts are very popular to analyze data and convey important insights. P...

Please sign up or login with your details

Forgot password? Click here to reset