Automated assessment of non-native learner essays: Investigating the role of linguistic features

12/02/2016
by   Sowmya Vajjala, et al.
0

Automatic essay scoring (AES) refers to the process of scoring free text responses to given prompts, considering human grader scores as the gold standard. Writing such essays is an essential component of many language and aptitude exams. Hence, AES became an active and established area of research, and there are many proprietary systems used in real life applications today. However, not much is known about which specific linguistic features are useful for prediction and how much of this is consistent across datasets. This article addresses that by exploring the role of various linguistic features in automatic essay scoring using two publicly available datasets of non-native English essays written in test taking scenarios. The linguistic properties are modeled by encoding lexical, syntactic, discourse and error types of learner language in the feature set. Predictive models are then developed using these features on both datasets and the most predictive features are compared. While the results show that the feature set used results in good predictive models with both datasets, the question "what are the most predictive features?" has a different answer for each dataset.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
06/06/2023

Exploring Linguistic Features for Turkish Text Readability

This paper presents the first comprehensive study on automatic readabili...
research
08/23/2018

Role of Intonation in Scoring Spoken English

In this paper, we have introduced and evaluated intonation based feature...
research
08/26/2020

Machine learning approach of Japanese composition scoring and writing aided system's design

Automatic scoring system is extremely complex for any language. Because ...
research
09/30/2019

Lexical Features Are More Vulnerable, Syntactic Features Have More Predictive Power

Understanding the vulnerability of linguistic features extracted from no...
research
07/13/2017

Is writing style predictive of scientific fraud?

The problem of detecting scientific fraud using machine learning was rec...
research
11/30/2021

Automated Speech Scoring System Under The Lens: Evaluating and interpreting the linguistic cues for language proficiency

English proficiency assessments have become a necessary metric for filte...
research
07/17/2017

Detecting Off-topic Responses to Visual Prompts

Automated methods for essay scoring have made great progress in recent y...

Please sign up or login with your details

Forgot password? Click here to reset