Epicure: Distilling Sequence Model Predictions into Patterns

08/16/2023
by   Miltiadis Allamanis, et al.
0

Most machine learning models predict a probability distribution over concrete outputs and struggle to accurately predict names over high entropy sequence distributions. Here, we explore finding abstract, high-precision patterns intrinsic to these predictions in order to make abstract predictions that usefully capture rare sequences. In this short paper, we present Epicure, a method that distils the predictions of a sequence model, such as the output of beam search, into simple patterns. Epicure maps a model's predictions into a lattice that represents increasingly more general patterns that subsume the concrete model predictions. On the tasks of predicting a descriptive name of a function given the source code of its body and detecting anomalous names given a function, we show that Epicure yields accurate naming patterns that match the ground truth more often compared to just the highest probability model prediction. For a false alarm rate of 10 compared to the best model prediction, making Epicure well-suited for scenarios that require high precision.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
07/23/2019

Analyzing the Variety Loss in the Context of Probabilistic Trajectory Prediction

Trajectory or behavior prediction of traffic agents is an important comp...
research
06/18/2012

Learning the Experts for Online Sequence Prediction

Online sequence prediction is the problem of predicting the next element...
research
06/20/2018

Accurate and Diverse Sampling of Sequences based on a "Best of Many" Sample Objective

For autonomous agents to successfully operate in the real world, anticip...
research
01/10/2022

Predictions of Reynolds and Nusselt numbers in turbulent convection using machine-learning models

In this paper, we develop a multivariate regression model and a neural n...
research
02/09/2021

Demystifying Code Summarization Models

The last decade has witnessed a rapid advance in machine learning models...
research
09/22/2020

DeepIaC: Deep Learning-Based Linguistic Anti-pattern Detection in IaC

Linguistic anti-patterns are recurring poor practices concerning inconsi...
research
11/17/2016

Nothing Else Matters: Model-Agnostic Explanations By Identifying Prediction Invariance

At the core of interpretable machine learning is the question of whether...

Please sign up or login with your details

Forgot password? Click here to reset