IRT scoring and the principle of consistent order

04/24/2018
by   Nancy Lacourly, et al.
0

IRT models are being increasingly used worldwide for test construction and scoring. The study examines the practical implications of estimating individual scores in a paper-and-pencil high-stakes test using 2PL and 3PL models, specifically whether the principle of consistent order holds when scoring with IRT. The principle states that student A, who answers the same (or a larger) number of items of greater difficulty than student B, should outscore B. Results of analyses conducted using actual scores from the Chilean national admission test in mathematics indicate the principle does not hold when scoring with 2PL or 3PL models. Students who answer more items and of greater difficulty may be assigned lower scores. The findings can be explained by examining the mathematical models, since estimated ability scores are an increasing function of the accumulated estimated discriminations for the correct items, not their difficulty. For high stakes tests the decision to use complex model should therefore be a matter of serious deliberation for policy makers and test experts, since fairness and transparency may be compromised.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
05/29/2023

Short Answer Grading Using One-shot Prompting and Text Similarity Scoring Model

In this study, we developed an automated short answer grading (ASAG) mod...
research
06/22/2021

Face Identification Proficiency Test Designed Using Item Response Theory

Measures of face identification proficiency are essential to ensure accu...
research
08/21/2023

Using language models in the implicit automated assessment of mathematical short answer items

We propose a new way to assess certain short constructed responses to ma...
research
02/15/2021

Best vs. All: Equity and Accuracy of Standardized Test Score Reporting

We study a game theoretic model of standardized testing for college admi...
research
10/27/2021

You Are the Best Reviewer of Your Own Papers: An Owner-Assisted Scoring Mechanism

I consider the setting where reviewers offer very noisy scores for a num...
research
12/30/2020

Test Score Algorithms for Budgeted Stochastic Utility Maximization

Motivated by recent developments in designing algorithms based on indivi...
research
07/06/2018

Improving information quality of Wikipedia articles with cooperative principle

Purpose: The purpose of this paper is to investigate the impact of coope...

Please sign up or login with your details

Forgot password? Click here to reset