Lexical Bias In Essay Level Prediction

09/21/2018
by   Georgios Balikas, et al.
0

Automatically predicting the level of non-native English speakers given their written essays is an interesting machine learning problem. In this work I present the system "balikasg" that achieved the state-of-the-art performance in the CAp 2018 data science challenge among 14 systems. I detail the feature extraction, feature engineering and model selection steps and I evaluate how these decisions impact the system's performance. The paper concludes with remarks for future work.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
05/24/2018

Native Language Cognate Effects on Second Language Lexical Choice

We present a computational analysis of cognate effects on the spontaneou...
research
12/29/2020

Detection of Lexical Stress Errors in Non-native (L2) English with Data Augmentation and Attention

This paper describes two novel complementary techniques that improve the...
research
12/19/2016

Photo-Quality Evaluation based on Computational Aesthetics: Review of Feature Extraction Techniques

Researchers try to model the aesthetic quality of photographs into low a...
research
01/27/2021

A phonetic model of non-native spoken word processing

Non-native speakers show difficulties with spoken word processing. Many ...
research
08/31/2018

Speaker Fluency Level Classification Using Machine Learning Techniques

Level assessment for foreign language students is necessary for putting ...
research
06/29/2023

Statistically Enhanced Learning: a feature engineering framework to boost (any) learning algorithms

Feature engineering is of critical importance in the field of Data Scien...
research
04/24/2017

Detecting English Writing Styles For Non Native Speakers

This paper presents the first attempt, up to our knowledge, to classify ...

Please sign up or login with your details

Forgot password? Click here to reset