An Empirical Study on Explainable Prediction of Text Complexity: Preliminaries for Text Simplification

07/31/2020
by   Cristina Garbacea, et al.
5

Text simplification is concerned with reducing the language complexity and improving the readability of professional content so that the text is accessible to readers at different ages and educational levels. As a promising practice to improve the fairness and transparency of text information systems, the notion of text simplification has been mixed in existing literature, ranging all the way through assessing the complexity of single words to automatically generating simplified documents. We show that the general problem of text simplification can be formally decomposed into a compact pipeline of tasks to ensure the transparency and explanability of the process. In this paper, we present a systematic analysis of the first two steps in this pipeline: 1) predicting the complexity of a given piece of text, and 2) identifying complex components from the text considered to be complex. We show that these two tasks can be solved separately, using either lexical approaches or the state-of-the-art deep learning methods, or they can be solved jointly through an end-to-end, explainable machine learning predictor. We propose formal evaluation metrics for both tasks, through which we are able to compare the performance of the candidate approaches using multiple datasets from a diversity of domains.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
12/19/2022

(Psycho-)Linguistic Features Meet Transformer Models for Improved Explainable and Controllable Text Simplification

State-of-the-art text simplification (TS) systems adopt end-to-end neura...
research
08/23/2019

Neural data-to-text generation: A comparison between pipeline and end-to-end architectures

Traditionally, most data-to-text applications have been designed using a...
research
06/16/2021

Developing a Fidelity Evaluation Approach for Interpretable Machine Learning

Although modern machine learning and deep learning methods allow for com...
research
10/08/2020

A Cascade Approach to Neural Abstractive Summarization with Content Selection and Fusion

We present an empirical study in favor of a cascade architecture to neur...
research
02/28/2023

Automatic Heteronym Resolution Pipeline Using RAD-TTS Aligners

Grapheme-to-phoneme (G2P) transduction is part of the standard text-to-s...
research
01/22/2021

A systematic literature review on state-of-the-art deep learning methods for process prediction

Process mining enables the reconstruction and evaluation of business pro...
research
05/24/2023

How To Control Text Simplification? An Empirical Study of Control Tokens for Meaning Preserving Controlled Simplification

Text simplification rewrites text to be more readable for a specific aud...

Please sign up or login with your details

Forgot password? Click here to reset