Exploring the Verifiability of Code Generated by GitHub Copilot

09/05/2022
by   Dakota Wong, et al.
0

GitHub's Copilot generates code quickly. We investigate whether it generates good code. Our approach is to identify a set of problems, ask Copilot to generate solutions, and attempt to formally verify these solutions with Dafny. Our formal verification is with respect to hand-crafted specifications. We have carried out this process on 6 problems and succeeded in formally verifying 4 of the created solutions. We found evidence which corroborates the current consensus in the literature: Copilot is a powerful tool; however, it should not be "flying the plane" by itself.

READ FULL TEXT

page 1

page 4

page 6

research
06/19/2013

Verifying the Steane code with Quantomatic

In this paper we give a partially mechanized proof of the correctness of...
research
12/02/2021

CEV Framework: A Central Bank Digital Currency Evaluation and Verification Framework with Focus of Consensus Algorithms and Operating Models

We propose a general framework (CEV Framework) for recommending and veri...
research
11/05/2020

An SMT-Based Approach for Verifying Binarized Neural Networks

Deep learning has emerged as an effective approach for creating modern s...
research
03/26/2020

Benchmarking Software Model Checkers on Automotive Code

This paper reports on our experiences with verifying automotive C code b...
research
11/25/2019

CAMUS: A Framework to Build Formal Specifications for Deep Perception Systems Using Simulators

The topic of provable deep neural network robustness has raised consider...
research
08/25/2021

Toward Formal Data Set Verification for Building Effective Machine Learning Models

In order to properly train a machine learning model, data must be proper...
research
03/07/2022

Scalable Verification of GNN-based Job Schedulers

Recently, Graph Neural Networks (GNNs) have been applied for scheduling ...

Please sign up or login with your details

Forgot password? Click here to reset