MP-CodeCheck: Evolving Logical Expression Code Anomaly Learning with Iterative Self-Supervision

04/14/2022
by   Urs C. Muff, et al.
0

Machine programming (MP) is concerned with automating software development. According to studies, software engineers spend upwards of 50 development time debugging software. To help accelerate debugging, we present MP-CodeCheck (MPCC). MPCC is an MP system that attempts to identify anomalous code patterns within logical program expressions. In designing MPCC, we developed two novel programming language representations, the formations of which are critical in its ability to exhaustively and efficiently process the billions of lines of code that are used in its self-supervised training. To quantify MPCC's performance, we compare it against ControlFlag, a state-of-the-art self-supervised code anomaly detection system; we find that MPCC is more spatially and temporally efficient. We demonstrate MPCC's anomalous code detection capabilities by exercising it on a variety of open-source GitHub repositories and one proprietary code base. We also provide a brief qualitative study on some of the different classes of code anomalies that MPCC can detect to provide an abbreviated insight into its capabilities.

READ FULL TEXT
research
05/10/2022

Self-Supervised Anomaly Detection: A Survey and Outlook

Over the past few years, anomaly detection, a subfield of machine learni...
research
11/06/2020

ControlFlag: A Self-supervised Idiosyncratic Pattern Detection System for Software Control Structures

Software debugging has been shown to utilize upwards of 50 time. Machine...
research
04/03/2020

Using Large-Scale Anomaly Detection on Code to Improve Kotlin Compiler

In this work, we apply anomaly detection to source code and bytecode to ...
research
09/21/2017

AutoPerf: A Generalized Zero-Positive Learning System to Detect Software Performance Anomalies

In this paper, we present AutoPerf, a generalized software performance a...
research
04/08/2021

CutPaste: Self-Supervised Learning for Anomaly Detection and Localization

We aim at constructing a high performance model for defect detection tha...
research
01/11/2023

Anomalies, Representations, and Self-Supervision

We develop a self-supervised method for density-based anomaly detection ...
research
09/21/2021

Self-supervised Representation Learning for Reliable Robotic Monitoring of Fruit Anomalies

Data augmentation can be a simple yet powerful tool for autonomous robot...

Please sign up or login with your details

Forgot password? Click here to reset