The effects of data size on Automated Essay Scoring engines

08/30/2021
by   Christopher Ormerod, et al.
0

We study the effects of data size and quality on the performance on Automated Essay Scoring (AES) engines that are designed in accordance with three different paradigms; A frequency and hand-crafted feature-based model, a recurrent neural network model, and a pretrained transformer-based language model that is fine-tuned for classification. We expect that each type of model benefits from the size and the quality of the training data in very different ways. Standard practices for developing training data for AES engines were established with feature-based methods in mind, however, since neural networks are increasingly being considered in a production setting, this work seeks to inform us as to how to establish better training data for neural networks that will be used in production.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
05/15/2023

The Effectiveness of a Dynamic Loss Function in Neural Network Based Automated Essay Scoring

Neural networks and in particular the attention mechanism have brought s...
research
02/02/2023

idT5: Indonesian Version of Multilingual T5 Transformer

Indonesian language is spoken by almost 200 million people and is the 10...
research
02/23/2022

Short-answer scoring with ensembles of pretrained language models

We investigate the effectiveness of ensembles of pretrained transformer-...
research
02/22/2017

Fine-Grained Entity Type Classification by Jointly Learning Representations and Label Embeddings

Fine-grained entity type classification (FETC) is the task of classifyin...
research
08/14/2020

Adaptable Multi-Domain Language Model for Transformer ASR

We propose an adapter based multi-domain Transformer based language mode...
research
06/04/2016

Neural Architectures for Fine-grained Entity Type Classification

In this work, we investigate several neural network architectures for fi...
research
02/24/2016

Multilingual Twitter Sentiment Classification: The Role of Human Annotators

What are the limits of automated Twitter sentiment classification? We an...

Please sign up or login with your details

Forgot password? Click here to reset