Generating High-Precision Feedback for Programming Syntax Errors using Large Language Models

01/24/2023
by   Tung Phung, et al.
0

Large language models (LLMs), such as Codex, hold great promise in enhancing programming education by automatically generating feedback for students. We investigate using LLMs to generate feedback for fixing syntax errors in Python programs, a key scenario in introductory programming. More concretely, given a student's buggy program, our goal is to generate feedback comprising a fixed program along with a natural language explanation describing the errors/fixes, inspired by how a human tutor would give feedback. While using LLMs is promising, the critical challenge is to ensure high precision in the generated feedback, which is imperative before deploying such technology in classrooms. The main research question we study is: Can we develop LLMs-based feedback generation techniques with a tunable precision parameter, giving educators quality control over the feedback that students receive? To this end, we introduce PyFiXV, our technique to generate high-precision feedback powered by Codex. The key idea behind PyFiXV is to use a novel run-time validation mechanism to decide whether the generated feedback is suitable for sharing with the student; notably, this validation mechanism also provides a precision knob to educators. We perform an extensive evaluation using two real-world datasets of Python programs with syntax errors and show the efficacy of PyFiXV in generating high-precision feedback.

READ FULL TEXT
research
03/19/2016

Automated Correction for Syntax Errors in Programming Assignments using Recurrent Neural Networks

We present a method for automatically generating repair feedback for syn...
research
07/11/2023

What do LLMs need to Synthesize Correct Router Configurations?

We investigate whether Large Language Models (e.g., GPT-4) can synthesiz...
research
11/20/2020

Effects of Human vs. Automatic Feedback on Students' Understanding of AI Concepts and Programming Style

The use of automatic grading tools has become nearly ubiquitous in large...
research
06/30/2021

Verifix: Verified Repair of Programming Assignments

Automated feedback generation for introductory programming assignments i...
research
06/02/2021

Automatic Assessment of the Design Quality of Python Programs with Personalized Feedback

The assessment of program functionality can generally be accomplished wi...
research
08/07/2023

Exploring Automated Distractor and Feedback Generation for Math Multiple-choice Questions via In-context Learning

Multiple-choice questions (MCQs) are ubiquitous in almost all levels of ...
research
03/22/2021

ast2vec: Utilizing Recursive Neural Encodings of Python Programs

Educational datamining involves the application of datamining techniques...

Please sign up or login with your details

Forgot password? Click here to reset