Short-answer scoring with ensembles of pretrained language models

02/23/2022
by   Christopher Ormerod, et al.
0

We investigate the effectiveness of ensembles of pretrained transformer-based language models on short answer questions using the Kaggle Automated Short Answer Scoring dataset. We fine-tune a collection of popular small, base, and large pretrained transformer-based language models, and train one feature-base model on the dataset with the aim of testing ensembles of these models. We used an early stopping mechanism and hyperparameter optimization in training. We observe that generally that the larger models perform slightly better, however, they still fall short of state-of-the-art results one their own. Once we consider ensembles of models, there are ensembles of a number of large networks that do produce state-of-the-art results, however, these ensembles are too large to realistically be put in a production environment.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
02/25/2021

Automated essay scoring using efficient transformer-based language models

Automated Essay Scoring (AES) is a cross-disciplinary effort involving E...
research
11/25/2021

Transformer-based Korean Pretrained Language Models: A Survey on Three Years of Progress

With the advent of Transformer, which was used in translation models in ...
research
04/12/2023

Boosted Prompt Ensembles for Large Language Models

Methods such as chain-of-thought prompting and self-consistency have pus...
research
08/30/2021

The effects of data size on Automated Essay Scoring engines

We study the effects of data size and quality on the performance on Auto...
research
09/25/2021

Finetuning Transformer Models to Build ASAG System

Research towards creating systems for automatic grading of student answe...
research
09/06/2020

Duluth at SemEval-2020 Task 7: Using Surprise as a Key to Unlock Humorous Headlines

We use pretrained transformer-based language models in SemEval-2020 Task...
research
09/09/2022

Automatic Readability Assessment of German Sentences with Transformer Ensembles

Reliable methods for automatic readability assessment have the potential...

Please sign up or login with your details

Forgot password? Click here to reset