Exploring Language Patterns in a Medical Licensure Exam Item Bank

11/20/2021
by   Swati Padhee, et al.
0

This study examines the use of natural language processing (NLP) models to evaluate whether language patterns used by item writers in a medical licensure exam might contain evidence of biased or stereotypical language. This type of bias in item language choices can be particularly impactful for items in a medical licensure assessment, as it could pose a threat to content validity and defensibility of test score validity evidence. To the best of our knowledge, this is the first attempt using machine learning (ML) and NLP to explore language bias on a large item bank. Using a prediction algorithm trained on clusters of similar item stems, we demonstrate that our approach can be used to review large item banks for potential biased language or stereotypical patient characteristics in clinical science vignettes. The findings may guide the development of methods to address stereotypical language patterns found in test items and enable an efficient updating of those items, if needed, to reflect contemporary norms, thereby improving the evidence to support the validity of the test scores.

READ FULL TEXT
research
06/04/2018

Neural Network-based exploration of construct validity for Russian version of the 10-item Big Five Inventory

This study aims to present a new method of exploring construct validity ...
research
08/23/2019

Training Optimus Prime, M.D.: Generating Medical Certification Items by Fine-Tuning OpenAI's gpt2 Transformer Model

This article describes new results of an application using transformer-b...
research
11/11/2020

Situated Data, Situated Systems: A Methodology to Engage with Power Relations in Natural Language Processing Research

We propose a bias-aware methodology to engage with power relations in na...
research
04/16/2023

Mini-VLAT: A Short and Effective Measure of Visualization Literacy

The visualization community regards visualization literacy as a necessar...
research
01/26/2019

The CATS Hackathon: Creating and Refining Test Items for Cybersecurity Concept Inventories

For two days in February 2018, 17 cybersecurity educators and profession...
research
04/23/2021

Heterogeneous item populations across individuals: Consequences for the factor model, item inter-correlations, and scale validity

The paper is devoted to the consequences of blind random selection of it...
research
03/29/2020

Improving Emergency Department ESI Acuity Assignment Using Machine Learning and Clinical Natural Language Processing

Effective triage is critical to mitigating the effect of increased volum...

Please sign up or login with your details

Forgot password? Click here to reset