Predicting the Relative Difficulty of Single Sentences With and Without Surrounding Context

by   Elliot Schumacher, et al.

The problem of accurately predicting relative reading difficulty across a set of sentences arises in a number of important natural language applications, such as finding and curating effective usage examples for intelligent language tutoring systems. Yet while significant research has explored document- and passage-level reading difficulty, the special challenges involved in assessing aspects of readability for single sentences have received much less attention, particularly when considering the role of surrounding passages. We introduce and evaluate a novel approach for estimating the relative reading difficulty of a set of sentences, with and without surrounding context. Using different sets of lexical and grammatical features, we explore models for predicting pairwise relative difficulty using logistic regression, and examine rankings generated by aggregating pairwise difficulty labels using a Bayesian rating system to form a final ranking. We also compare rankings derived for sentences assessed with and without context, and find that contextual features can help predict differences in relative difficulty judgments across these two conditions.



There are no comments yet.


page 1

page 2

page 3

page 4


Characterizing the Influence of Features on Reading Difficulty Estimation for Non-native Readers

In recent years, the number of people studying English as a second langu...

Predicting Text Readability from Scrolling Interactions

Judging the readability of text has many important applications, for ins...

Automatic Quality Estimation for Natural Language Generation: Ranting (Jointly Rating and Ranking)

We present a recurrent neural network based system for automatic quality...

On the Use of Context for Predicting Citation Worthiness of Sentences in Scholarly Articles

In this paper, we study the importance of context in predicting the cita...

Enabling Language Models to Fill in the Blanks

We present a simple approach for text infilling, the task of predicting ...

Grammatical Templates: Improving Text Difficulty Evaluation for Language Learners

Language students are most engaged while reading texts at an appropriate...

A Readability Analysis of Campaign Speeches from the 2016 US Presidential Campaign

Readability is defined as the reading level of the speech from grade 1 t...
This week in AI

Get the week's most popular data science and artificial intelligence research sent straight to your inbox every Saturday.