FrenchMedMCQA: A French Multiple-Choice Question Answering Dataset for Medical domain

04/09/2023
by   Yanis Labrak, et al.
0

This paper introduces FrenchMedMCQA, the first publicly available Multiple-Choice Question Answering (MCQA) dataset in French for medical domain. It is composed of 3,105 questions taken from real exams of the French medical specialization diploma in pharmacy, mixing single and multiple answers. Each instance of the dataset contains an identifier, a question, five possible answers and their manual correction(s). We also propose first baseline models to automatically process this MCQA task in order to report on the current performances and to highlight the difficulty of the task. A detailed analysis of the results showed that it is necessary to have representations adapted to the medical domain or to the MCQA task: in our case, English specialized models yielded better results than generic French ones, even though FrenchMedMCQA is in French. Corpus, models and tools are available online.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
03/27/2022

MedMCQA : A Large-scale Multi-Subject Multi-Choice Dataset for Medical domain Question Answering

This paper introduces MedMCQA, a new large-scale, Multiple-Choice Questi...
research
11/02/2008

Effect of Tuned Parameters on a LSA MCQ Answering Model

This paper presents the current state of a work in progress, whose objec...
research
08/18/2021

MeDiaQA: A Question Answering Dataset on Medical Dialogues

In this paper, we introduce MeDiaQA, a novel question answering(QA) data...
research
05/01/2022

ELQA: A Corpus of Questions and Answers about the English Language

We introduce a community-sourced dataset for English Language Question A...
research
03/13/2023

Generating multiple-choice questions for medical question answering with distractors and cue-masking

Medical multiple-choice question answering (MCQA) is particularly diffic...
research
01/30/2022

A Dataset for Medical Instructional Video Classification and Question Answering

This paper introduces a new challenge and datasets to foster research to...
research
11/08/2019

The TechQA Dataset

We introduce TechQA, a domain-adaptation question answering dataset for ...

Please sign up or login with your details

Forgot password? Click here to reset