Short Answer Grading Using One-shot Prompting and Text Similarity Scoring Model

05/29/2023
by   Su-Youn Yoon, et al.
0

In this study, we developed an automated short answer grading (ASAG) model that provided both analytic scores and final holistic scores. Short answer items typically consist of multiple sub-questions, and providing an analytic score and the text span relevant to each sub-question can increase the interpretability of the automated scores. Furthermore, they can be used to generate actionable feedback for students. Despite these advantages, most studies have focused on predicting only holistic scores due to the difficulty in constructing dataset with manual annotations. To address this difficulty, we used large language model (LLM)-based one-shot prompting and a text similarity scoring model with domain adaptation using small manually annotated dataset. The accuracy and quadratic weighted kappa of our model were 0.67 and 0.71 on a subset of the publicly available ASAG dataset. The model achieved a substantial improvement over the majority baseline.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
06/01/2023

Modeling and Analyzing Scorer Preferences in Short-Answer Math Questions

Automated scoring of student responses to open-ended questions, includin...
research
04/24/2018

IRT scoring and the principle of consistent order

IRT models are being increasingly used worldwide for test construction a...
research
10/25/2021

"So You Think You're Funny?": Rating the Humour Quotient in Standup Comedy

Computational Humour (CH) has attracted the interest of Natural Language...
research
05/26/2023

Prompt- and Trait Relation-aware Cross-prompt Essay Trait Scoring

Automated essay scoring (AES) aims to score essays written for a given p...
research
05/22/2023

Distilling ChatGPT for Explainable Automated Student Answer Assessment

Assessing student answers and providing valuable feedback is crucial for...
research
11/17/2022

ProtSi: Prototypical Siamese Network with Data Augmentation for Few-Shot Subjective Answer Evaluation

Subjective answer evaluation is a time-consuming and tedious task, and t...
research
06/14/2023

Combining piano performance dimensions for score difficulty classification

Predicting the difficulty of playing a musical score is essential for st...

Please sign up or login with your details

Forgot password? Click here to reset