A Transfer Learning Based Model for Text Readability Assessment in German

07/13/2022
by   Salar Mohtaj, et al.
13

Text readability assessment has a wide range of applications for different target people, from language learners to people with disabilities. The fast pace of textual content production on the web makes it impossible to measure text complexity without the benefit of machine learning and natural language processing techniques. Although various research addressed the readability assessment of English text in recent years, there is still room for improvement of the models for other languages. In this paper, we proposed a new model for text complexity assessment for German text based on transfer learning. Our results show that the model outperforms more classical solutions based on linguistic features extraction from input text. The best model is based on the BERT pre-trained language model achieved the Root Mean Square Error (RMSE) of 0.483.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
01/11/2022

A Feature Extraction based Model for Hate Speech Identification

The detection of hate speech online has become an important task, as off...
research
04/16/2019

Subjective Assessment of Text Complexity: A Dataset for German Language

This paper presents TextComplexityDE, a dataset consisting of 1000 sente...
research
09/16/2019

MFCC-based Recurrent Neural Network for Automatic Clinical Depression Recognition and Assessment from Speech

Major depression, also known as clinical depression, is a constant sense...
research
10/15/2021

Scribosermo: Fast Speech-to-Text models for German and other Languages

Recent Speech-to-Text models often require a large amount of hardware re...
research
09/09/2022

Automatic Readability Assessment of German Sentences with Transformer Ensembles

Reliable methods for automatic readability assessment have the potential...
research
07/08/2022

A Medical Information Extraction Workbench to Process German Clinical Text

Background: In the information extraction and natural language processin...
research
01/20/2021

The Challenges of Persian User-generated Textual Content: A Machine Learning-Based Approach

Over recent years a lot of research papers and studies have been publish...

Please sign up or login with your details

Forgot password? Click here to reset