Cross-lingual Approaches for the Detection of Adverse Drug Reactions in German from a Patient's Perspective

08/03/2022
by   Lisa Raithel, et al.
0

In this work, we present the first corpus for German Adverse Drug Reaction (ADR) detection in patient-generated content. The data consists of 4,169 binary annotated documents from a German patient forum, where users talk about health issues and get advice from medical doctors. As is common in social media data in this domain, the class labels of the corpus are very imbalanced. This and a high topic imbalance make it a very challenging dataset, since often, the same symptom can have several causes and is not always related to a medication intake. We aim to encourage further multi-lingual efforts in the domain of ADR detection and provide preliminary experiments for binary classification using different methods of zero- and few-shot learning based on a multi-lingual model. When fine-tuning XLM-RoBERTa first on English patient forum data and then on the new German data, we achieve an F1-score of 37.52 for the positive class. We make the dataset and models publicly available for the community.

READ FULL TEXT

page 7

page 12

page 13

research
11/28/2021

Zero-Shot Cross-Lingual Transfer in Legal Domain Using Transformer Models

Zero-shot cross-lingual transfer is an important feature in modern NLP m...
research
10/02/2020

Cross-Lingual Transfer Learning for Complex Word Identification

Complex Word Identification (CWI) is a task centered on detecting hard-t...
research
05/24/2021

View Distillation with Unlabeled Data for Extracting Adverse Drug Effects from User-Generated Data

We present an algorithm based on multi-layer transformers for identifyin...
research
05/23/2020

From Witch's Shot to Music Making Bones – Resources for Medical Laymen to Technical Language and Vice Versa

Many people share information in social media or forums, like food they ...
research
03/18/2020

X-Stance: A Multilingual Multi-Target Dataset for Stance Detection

We extract a large-scale stance detection dataset from comments written ...
research
06/29/2020

Want to Identify, Extract and Normalize Adverse Drug Reactions in Tweets? Use RoBERTa

This paper presents our approach for task 2 and task 3 of Social Media M...
research
04/07/2020

The Russian Drug Reaction Corpus and Neural Models for Drug Reactions and Effectiveness Detection in User Reviews

The Russian Drug Reaction Corpus (RuDReC) is a new partially annotated c...

Please sign up or login with your details

Forgot password? Click here to reset