DeepAI AI Chat
Log In Sign Up

Subjective Assessment of Text Complexity: A Dataset for German Language

by   Babak Naderi, et al.

This paper presents TextComplexityDE, a dataset consisting of 1000 sentences in German language taken from 23 Wikipedia articles in 3 different article-genres to be used for developing text-complexity predictor models and automatic text simplification in German language. The dataset includes subjective assessment of different text-complexity aspects provided by German learners in level A and B. In addition, it contains manual simplification of 250 of those sentences provided by native speakers and subjective assessment of the simplified sentences by participants from the target group. The subjective ratings were collected using both laboratory studies and crowdsourcing approach.


page 1

page 2

page 3

page 4


A Corpus for Automatic Readability Assessment and Text Simplification of German

In this paper, we present a corpus for use in automatic readability asse...

Pseudo-Labels Are All You Need

Automatically estimating the complexity of texts for readers has a varie...

Modeling subjective assessments of guilt in newspaper crime narratives

Crime reporting is a prevalent form of journalism with the power to shap...

Automatic Readability Assessment of German Sentences with Transformer Ensembles

Reliable methods for automatic readability assessment have the potential...

A Transfer Learning Based Model for Text Readability Assessment in German

Text readability assessment has a wide range of applications for differe...

Effect of Language Proficiency on Subjective Evaluation of Noise Suppression Algorithms

Speech communication systems based on Voice-over-IP technology are frequ...

Introducing the Welsh Text Summarisation Dataset and Baseline Systems

Welsh is an official language in Wales and is spoken by an estimated 884...