Creating Large Language Model Resistant Exams: Guidelines and Strategies

04/18/2023
by   Simon kaare Larsen, et al.
0

The proliferation of Large Language Models (LLMs), such as ChatGPT, has raised concerns about their potential impact on academic integrity, prompting the need for LLM-resistant exam designs. This article investigates the performance of LLMs on exams and their implications for assessment, focusing on ChatGPT's abilities and limitations. We propose guidelines for creating LLM-resistant exams, including content moderation, deliberate inaccuracies, real-world scenarios beyond the model's knowledge base, effective distractor options, evaluating soft skills, and incorporating non-textual information. The article also highlights the significance of adapting assessments to modern tools and promoting essential skills development in students. By adopting these strategies, educators can maintain academic integrity while ensuring that assessments accurately reflect contemporary professional settings and address the challenges and opportunities posed by artificial intelligence in education.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
12/19/2022

ChatGPT: The End of Online Exam Integrity?

This study evaluated the ability of ChatGPT, a recently developed artifi...
research
08/21/2023

Using Large Language Models for Cybersecurity Capture-The-Flag Challenges and Certification Questions

The assessment of cybersecurity Capture-The-Flag (CTF) exercises involve...
research
07/10/2023

Detecting LLM-Generated Text in Computing Education: A Comparative Study for ChatGPT Cases

Due to the recent improvements and wide availability of Large Language M...
research
01/05/2021

Cybersecurity Knowledge and Skills Taught in Capture the Flag Challenges

Capture the Flag challenges are a popular form of cybersecurity educatio...
research
09/10/2019

A Case Study of Spreadsheet Use within the Finance and Academic Registry units within a Higher Education Institution

This paper presents the findings of a case study of spreadsheet use in a...
research
05/07/2023

Professional Certification Benchmark Dataset: The First 500 Jobs For Large Language Models

The research creates a professional certification survey to test large l...

Please sign up or login with your details

Forgot password? Click here to reset