Using Paragraph Vectors to improve our existing code review assisting tool-CRUSO

04/29/2021
by   Ritu Kapur, et al.
0

Code reviews are one of the effective methods to estimate defectiveness in source code. However, the existing methods are dependent on experts or inefficient. In this paper, we improve the performance (in terms of speed and memory usage) of our existing code review assisting tool–CRUSO. The central idea of the approach is to estimate the defectiveness for an input source code by using the defectiveness score of similar code fragments present in various StackOverflow (SO) posts. The significant contributions of our paper are i) SOpostsDB: a dataset containing the PVA vectors and the SO posts information, ii) CRUSO-P: a code review assisting system based on PVA models trained on SOpostsDB. For a given input source code, CRUSO-P labels it as Likely to be defective, Unlikely to be defective, Unpredictable. To develop CRUSO-P, we processed >3 million SO posts and 188200+ GitHub source files. CRUSO-P is designed to work with source code written in the popular programming languages C, C#, Java, JavaScript, and Python. CRUSO-P outperforms CRUSO with an improvement of 97.82 a storage reduction of 99.15 of 99.6 improvement of 5.6

READ FULL TEXT

page 1

page 2

page 3

page 4

research
03/15/2018

Using StackOverflow content to assist in code review

An important goal for programmers is to minimize cost of identifying and...
research
09/21/2018

SCC: Automatic Classification of Code Snippets

Determining the programming language of a source code file has been cons...
research
03/21/2018

Estimating defectiveness of source code: A predictive model using GitHub content

Two key contributions presented in this paper are: i) A method for build...
research
02/06/2019

A Comparison of Information Retrieval Techniques for Detecting Source Code Plagiarism

Plagiarism is a commonly encountered problem in the academia. While ther...
research
03/22/2021

psc2code: Denoising Code Extraction from Programming Screencasts

In this paper, we propose an approach named psc2code to denoise the proc...
research
09/02/2023

Towards Code Watermarking with Dual-Channel Transformations

The expansion of the open source community and the rise of large languag...
research
04/01/2019

STYLE-ANALYZER: fixing code style inconsistencies with interpretable unsupervised algorithms

Source code reviews are manual, time-consuming, and expensive. Human inv...

Please sign up or login with your details

Forgot password? Click here to reset