AutoMeTS: The Autocomplete for Medical Text Simplification

10/20/2020
by   Hoang Van, et al.
0

The goal of text simplification (TS) is to transform difficult text into a version that is easier to understand and more broadly accessible to a wide variety of readers. In some domains, such as healthcare, fully automated approaches cannot be used since information must be accurately preserved. Instead, semi-automated approaches can be used that assist a human writer in simplifying text faster and at a higher quality. In this paper, we examine the application of autocomplete to text simplification in the medical domain. We introduce a new parallel medical data set consisting of aligned English Wikipedia with Simple English Wikipedia sentences and examine the application of pretrained neural language models (PNLMs) on this dataset. We compare four PNLMs(BERT, RoBERTa, XLNet, and GPT-2), and show how the additional context of the sentence to be simplified can be incorporated to achieve better results (6.17 an ensemble model that combines the four PNLMs and outperforms the best individual model by 2.1 64.52

READ FULL TEXT

page 1

page 2

page 3

page 4

research
05/21/2023

Multilingual Simplification of Medical Texts

Automated text simplification aims to produce simple versions of complex...
research
05/10/2023

WikiSQE: A Large-Scale Dataset for Sentence Quality Estimation in Wikipedia

Wikipedia can be edited by anyone and thus contains various quality sent...
research
07/02/2020

Processing South Asian Languages Written in the Latin Script: the Dakshina Dataset

This paper describes the Dakshina dataset, a new resource consisting of ...
research
02/11/2023

NapSS: Paragraph-level Medical Text Simplification via Narrative Prompting and Sentence-matching Summarization

Accessing medical literature is difficult for laypeople as the content i...
research
03/30/2022

Neural Pipeline for Zero-Shot Data-to-Text Generation

In data-to-text (D2T) generation, training on in-domain data leads to ov...
research
12/10/2021

LSH methods for data deduplication in a Wikipedia artificial dataset

This paper illustrates locality sensitive hasing (LSH) models for the id...
research
08/19/2023

Evaluating Transfer Learning for Simplifying GitHub READMEs

Software documentation captures detailed knowledge about a software prod...

Please sign up or login with your details

Forgot password? Click here to reset