DeepAI AI Chat
Log In Sign Up

A Transfer Learning Based Model for Text Readability Assessment in German

07/13/2022
by   Salar Mohtaj, et al.
13

Text readability assessment has a wide range of applications for different target people, from language learners to people with disabilities. The fast pace of textual content production on the web makes it impossible to measure text complexity without the benefit of machine learning and natural language processing techniques. Although various research addressed the readability assessment of English text in recent years, there is still room for improvement of the models for other languages. In this paper, we proposed a new model for text complexity assessment for German text based on transfer learning. Our results show that the model outperforms more classical solutions based on linguistic features extraction from input text. The best model is based on the BERT pre-trained language model achieved the Root Mean Square Error (RMSE) of 0.483.

READ FULL TEXT

page 1

page 2

page 3

page 4

01/11/2022

A Feature Extraction based Model for Hate Speech Identification

The detection of hate speech online has become an important task, as off...
04/16/2019

Subjective Assessment of Text Complexity: A Dataset for German Language

This paper presents TextComplexityDE, a dataset consisting of 1000 sente...
09/16/2019

MFCC-based Recurrent Neural Network for Automatic Clinical Depression Recognition and Assessment from Speech

Major depression, also known as clinical depression, is a constant sense...
10/15/2021

Scribosermo: Fast Speech-to-Text models for German and other Languages

Recent Speech-to-Text models often require a large amount of hardware re...
09/09/2022

Automatic Readability Assessment of German Sentences with Transformer Ensembles

Reliable methods for automatic readability assessment have the potential...
08/19/2022

Pseudo-Labels Are All You Need

Automatically estimating the complexity of texts for readers has a varie...
01/20/2021

The Challenges of Persian User-generated Textual Content: A Machine Learning-Based Approach

Over recent years a lot of research papers and studies have been publish...