Under the Microscope: Interpreting Readability Assessment Models for Filipino

10/01/2021
by   Joseph Marvin Imperial, et al.
0

Readability assessment is the process of identifying the level of ease or difficulty of a certain piece of text for its intended audience. Approaches have evolved from the use of arithmetic formulas to more complex pattern-recognizing models trained using machine learning algorithms. While using these approaches provide competitive results, limited work is done on analyzing how linguistic variables affect model inference quantitatively. In this work, we dissect machine learning-based readability assessment models in Filipino by performing global and local model interpretation to understand the contributions of varying linguistic features and discuss its implications in the context of the Filipino language. Results show that using a model trained with top features from global interpretation obtained higher performance than the ones using features selected by Spearman correlation. Likewise, we also empirically observed local feature weight boundaries for discriminating reading difficulty at an extremely fine-grained level and their corresponding effects if values are perturbed.

READ FULL TEXT

page 5

page 6

page 8

research
07/31/2021

Diverse Linguistic Features for Assessing Reading Difficulty of Educational Filipino Texts

In order to ensure quality and effective learning, fluency, and comprehe...
research
06/15/2021

Knowledge-Rich BERT Embeddings for Readability Assessment

Automatic readability assessment (ARA) is the task of evaluating the lev...
research
03/29/2016

A Readable Read: Automatic Assessment of Language Learning Materials based on Linguistic Complexity

Corpora and web texts can become a rich language learning resource if we...
research
02/01/2022

Firm-based relatedness using machine learning

The relatedness between an economic actor (for instance a country, or a ...
research
07/09/2021

Learning Syntactic Dense Embedding with Correlation Graph for Automatic Readability Assessment

Deep learning models for automatic readability assessment generally disc...
research
05/30/2020

Linguistic Features for Readability Assessment

Readability assessment aims to automatically classify text by the level ...
research
12/03/2015

Predicting the top and bottom ranks of billboard songs using Machine Learning

The music industry is a 130 billion industry. Predicting whether a song ...

Please sign up or login with your details

Forgot password? Click here to reset