Addressing Selection Bias in Computerized Adaptive Testing: A User-Wise Aggregate Influence Function Approach

08/23/2023
by   Soonwoo Kwon, et al.
0

Computerized Adaptive Testing (CAT) is a widely used, efficient test mode that adapts to the examinee's proficiency level in the test domain. CAT requires pre-trained item profiles, for CAT iteratively assesses the student real-time based on the registered items' profiles, and selects the next item to administer using candidate items' profiles. However, obtaining such item profiles is a costly process that involves gathering a large, dense item-response data, then training a diagnostic model on the collected data. In this paper, we explore the possibility of leveraging response data collected in the CAT service. We first show that this poses a unique challenge due to the inherent selection bias introduced by CAT, i.e., more proficient students will receive harder questions. Indeed, when naively training the diagnostic model using CAT response data, we observe that item profiles deviate significantly from the ground-truth. To tackle the selection bias issue, we propose the user-wise aggregate influence function method. Our intuition is to filter out users whose response data is heavily biased in an aggregate manner, as judged by how much perturbation the added data will introduce during parameter estimation. This way, we may enhance the performance of CAT while introducing minimal bias to the item profiles. We provide extensive experiments to demonstrate the superiority of our proposed method based on the three public datasets and one dataset that contains real-world CAT response data.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
08/17/2021

BOBCAT: Bilevel Optimization-Based Computerized Adaptive Testing

Computerized adaptive testing (CAT) refers to a form of tests that are p...
research
07/19/2023

Amortised Design Optimization for Item Response Theory

Item Response Theory (IRT) is a well known method for assessing response...
research
06/27/2016

Content-Based Top-N Recommendation using Heterogeneous Relations

Top-N recommender systems have been extensively studied. However, the sp...
research
07/17/2020

A network approach to item response data: Development and applications of latent space item response models

We propose a novel network approach to item response data with advantage...
research
10/09/2022

A Spectral Approach to Item Response Theory

The Rasch model is one of the most fundamental models in item response t...
research
04/18/2022

MP2: A Momentum Contrast Approach for Recommendation with Pointwise and Pairwise Learning

Binary pointwise labels (aka implicit feedback) are heavily leveraged by...
research
07/19/2023

UniMatch: A Unified User-Item Matching Framework for the Multi-purpose Merchant Marketing

When doing private domain marketing with cloud services, the merchants u...

Please sign up or login with your details

Forgot password? Click here to reset