Lexical Complexity Prediction: An Overview

03/08/2023
by   Kai North, et al.
0

The occurrence of unknown words in texts significantly hinders reading comprehension. To improve accessibility for specific target populations, computational modelling has been applied to identify complex words in texts and substitute them for simpler alternatives. In this paper, we present an overview of computational approaches to lexical complexity prediction focusing on the work carried out on English data. We survey relevant approaches to this problem which include traditional machine learning classifiers (e.g. SVMs, logistic regression) and deep neural networks as well as a variety of features, such as those inspired by literature in psycholinguistics as well as word frequency, word length, and many others. Furthermore, we introduce readers to past competitions and available datasets created on this topic. Finally, we include brief sections on applications of lexical complexity prediction, such as readability and text simplification, together with related studies on languages other than English.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
02/17/2021

Predicting Lexical Complexity in English Texts

The first step in most text simplification is to predict which words are...
research
05/19/2023

Deep Learning Approaches to Lexical Simplification: A Survey

Lexical Simplification (LS) is the task of replacing complex for simpler...
research
03/16/2020

CompLex — A New Corpus for Lexical Complexity Predicition from Likert Scale Data

Predicting which words are considered hard to understand for a given tar...
research
01/22/2021

Application of Lexical Features Towards Improvement of Filipino Readability Identification of Children's Literature

Proper identification of grade levels of children's reading materials is...
research
01/08/2022

A comprehensive review and evaluation on text predictive and entertainment systems

One of the most important ways to experience communication and interact ...
research
07/10/2023

Measuring Lexical Diversity in Texts: The Twofold Length Problem

The impact of text length on the estimation of lexical diversity has cap...
research
05/18/2021

LCP-RIT at SemEval-2021 Task 1: Exploring Linguistic Features for Lexical Complexity Prediction

This paper describes team LCP-RIT's submission to the SemEval-2021 Task ...

Please sign up or login with your details

Forgot password? Click here to reset