NorQuAD: Norwegian Question Answering Dataset

05/03/2023
by   Sardana Ivanova, et al.
0

In this paper we present NorQuAD: the first Norwegian question answering dataset for machine reading comprehension. The dataset consists of 4,752 manually created question-answer pairs. We here detail the data collection procedure and present statistics of the dataset. We also benchmark several multilingual and Norwegian monolingual language models on the dataset and compare them against human performance. The dataset will be made freely available.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
02/13/2022

PQuAD: A Persian Question Answering Dataset

We present Persian Question Answering Dataset (PQuAD), a crowdsourced re...
research
09/16/2022

ScreenQA: Large-Scale Question-Answer Pairs over Mobile App Screenshots

We present a new task and dataset, ScreenQA, for screen content understa...
research
09/23/2021

BiRdQA: A Bilingual Dataset for Question Answering on Tricky Riddles

A riddle is a question or statement with double or veiled meanings, foll...
research
09/19/2023

Benchmarks for Pirá 2.0, a Reading Comprehension Dataset about the Ocean, the Brazilian Coast, and Climate Change

Pirá is a reading comprehension dataset focused on the ocean, the Brazil...
research
05/20/2023

VNHSGE: VietNamese High School Graduation Examination Dataset for Large Language Models

The VNHSGE (VietNamese High School Graduation Examination) dataset, deve...
research
01/15/2019

Incremental Reading for Question Answering

Any system which performs goal-directed continual learning must not only...
research
05/11/2023

Overinformative Question Answering by Humans and Machines

When faced with a polar question, speakers often provide overinformative...

Please sign up or login with your details

Forgot password? Click here to reset