Novice Type Error Diagnosis with Natural Language Models

10/07/2022
by   Chuqin Geng, et al.
0

Strong static type systems help programmers eliminate many errors without much burden of supplying type annotations. However, this flexibility makes it highly non-trivial to diagnose ill-typed programs, especially for novice programmers. Compared to classic constraint solving and optimization-based approaches, the data-driven approach has shown great promise in identifying the root causes of type errors with higher accuracy. Instead of relying on hand-engineered features, this work explores natural language models for type error localization, which can be trained in an end-to-end fashion without requiring any features. We demonstrate that, for novice type error diagnosis, the language model-based approach significantly outperforms the previous state-of-the-art data-driven approach. Specifically, our model could predict type errors correctly 62 Nate's data-driven model by 11 Furthermore, we also apply structural probes to explain the performance difference between different language models.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
08/25/2017

Learning to Blame: Localizing Novice Type Errors with Data-Driven Diagnosis

Localizing type errors is challenging in languages with global type infe...
research
03/26/2021

Correcting Automated and Manual Speech Transcription Errors using Warped Language Models

Masked language models have revolutionized natural language processing s...
research
05/25/2022

Detecting Label Errors using Pre-Trained Language Models

We show that large pre-trained language models are extremely capable of ...
research
04/04/2023

Using Language Models For Knowledge Acquisition in Natural Language Reasoning Problems

For a natural language problem that requires some non-trivial reasoning ...
research
06/11/2019

Calibration, Entropy Rates, and Memory in Language Models

Building accurate language models that capture meaningful long-term depe...
research
06/22/2018

Evaluating language models of tonal harmony

This study borrows and extends probabilistic language models from natura...
research
12/22/2020

Uncertainty and Surprisal Jointly Deliver the Punchline: Exploiting Incongruity-Based Features for Humor Recognition

Humor recognition has been widely studied as a text classification probl...

Please sign up or login with your details

Forgot password? Click here to reset