LSBert: A Simple Framework for Lexical Simplification

06/25/2020
by   Jipeng Qiang, et al.
0

Lexical simplification (LS) aims to replace complex words in a given sentence with their simpler alternatives of equivalent meaning, to simplify the sentence. Recently unsupervised lexical simplification approaches only rely on the complex word itself regardless of the given sentence to generate candidate substitutions, which will inevitably produce a large number of spurious candidates. In this paper, we propose a lexical simplification framework LSBert based on pretrained representation model Bert, that is capable of (1) making use of the wider context when both detecting the words in need of simplification and generating substitue candidates, and (2) taking five high-quality features into account for ranking candidates, including Bert prediction order, Bert-based language model, and the paraphrase database PPDB, in addition to the word frequency and word similarity commonly used in other LS methods. We show that our system outputs lexical simplifications that are grammatically correct and semantically appropriate, and obtains obvious improvement compared with these baselines, outperforming the state-of-the-art by 29.8 Accuracy points on three well-known benchmarks.

READ FULL TEXT

page 4

page 11

research
07/14/2019

A Simple BERT-Based Approach for Lexical Simplification

Lexical simplification (LS) aims to replace complex words in a given sen...
research
01/15/2022

Automatic Lexical Simplification for Turkish

In this paper, we present the first automatic lexical simplification sys...
research
06/08/2021

Swords: A Benchmark for Lexical Substitution with Improved Data Coverage and Quality

We release a new benchmark for lexical substitution, the task of finding...
research
11/17/2017

Phonological (un)certainty weights lexical activation

Spoken word recognition involves at least two basic computations. First ...
research
12/15/2021

Tracing Text Provenance via Context-Aware Lexical Substitution

Text content created by humans or language models is often stolen or mis...
research
12/19/2022

MANTIS at TSAR-2022 Shared Task: Improved Unsupervised Lexical Simplification with Pretrained Encoders

In this paper we present our contribution to the TSAR-2022 Shared Task o...
research
05/14/2023

ParaLS: Lexical Substitution via Pretrained Paraphraser

Lexical substitution (LS) aims at finding appropriate substitutes for a ...

Please sign up or login with your details

Forgot password? Click here to reset