Multilingual Simplification of Medical Texts

05/21/2023
by   Sebastian Joseph, et al.
0

Automated text simplification aims to produce simple versions of complex texts. This task is especially useful in the medical domain, where the latest medical findings are typically communicated via complex and technical articles. This creates barriers for laypeople seeking access to up-to-date medical findings, consequently impeding progress on health literacy. Most existing work on medical text simplification has focused on monolingual settings, with the result that such evidence would be available only in just one language (most often, English). This work addresses this limitation via multilingual simplification, i.e., directly simplifying complex texts into simplified texts in multiple languages. We introduce MultiCochrane, the first sentence-aligned multilingual text simplification dataset for the medical domain in four languages: English, Spanish, French, and Farsi. We evaluate fine-tuned and zero-shot models across these languages, with extensive human assessments and analyses. Although models can now generate viable simplified texts, we identify outstanding challenges that this dataset might be used to address.

READ FULL TEXT

page 1

page 6

page 16

page 31

research
05/25/2023

Revisiting non-English Text Simplification: A Unified Multilingual Benchmark

Recent advancements in high-quality, large-scale English resources have ...
research
10/20/2020

AutoMeTS: The Autocomplete for Medical Text Simplification

The goal of text simplification (TS) is to transform difficult text into...
research
04/15/2022

Evaluating Factuality in Text Simplification

Automated simplification models aim to make input texts more readable. S...
research
02/13/2023

AbLit: A Resource for Analyzing and Generating Abridged Versions of English Literature

Creating an abridged version of a text involves shortening it while main...
research
04/12/2021

Paragraph-level Simplification of Medical Texts

We consider the problem of learning to simplify medical texts. This is i...
research
05/29/2021

Constructing Flow Graphs from Procedural Cybersecurity Texts

Following procedural texts written in natural languages is challenging. ...
research
02/17/2023

Med-EASi: Finely Annotated Dataset and Models for Controllable Simplification of Medical Texts

Automatic medical text simplification can assist providers with patient-...

Please sign up or login with your details

Forgot password? Click here to reset