Academic Source Code Plagiarism Detection by Measuring Program Behavioural Similarity

02/08/2021
by   Hayden Cheers, et al.
0

Source code plagiarism is a long-standing issue in tertiary computer science education. Many source code plagiarism detection tools have been proposed to aid in the detection of source code plagiarism. However, existing detection tools are not robust to pervasive plagiarism-hiding transformations, and as a result can be inaccurate in the detection of plagiarised source code. This article presents BPlag, a behavioural approach to source code plagiarism detection. BPlag is designed to be both robust to pervasive plagiarism-hiding transformations, and accurate in the detection of plagiarised source code. Greater robustness and accuracy is afforded by analysing the behaviour of a program, as behaviour is perceived to be the least susceptible aspect of a program impacted upon by plagiarism-hiding transformations. BPlag applies symbolic execution to analyse execution behaviour and represent a program in a novel graph-based format. Plagiarism is then detected by comparing these graphs and evaluating similarity scores. BPlag is evaluated for robustness, accuracy and efficiency against 5 commonly used source code plagiarism detection tools. It is then shown that BPlag is more robust to plagiarism-hiding transformations and more accurate in the detection of plagiarised source code, but is less efficient than compared tools.

READ FULL TEXT
research
02/08/2021

Evaluating the robustness of source code plagiarism detection tools to pervasive plagiarism-hiding modifications

Source code plagiarism is a common occurrence in undergraduate computer ...
research
11/01/2017

Detecting Disguised Plagiarism

Source code plagiarism detection is a problem that has been addressed se...
research
07/26/2019

Scalable Source Code Similarity Detection in Large Code Repositories

Source code similarity are increasingly used in application development ...
research
10/24/2022

Scalable Program Clone Search Through Spectral Analysis

We consider the problem of program clone search, i.e. given a target pro...
research
09/23/2018

Which Source Code Plagiarism Detection Approach is More Humane?

This paper contributes in developing source code plagiarism detection th...
research
03/26/2018

Source Code Optimization using Equivalent Mutants

A mutant is a program obtained by syntactically modifying a program's so...
research
04/12/2013

The Recomputation Manifesto

Replication of scientific experiments is critical to the advance of scie...

Please sign up or login with your details

Forgot password? Click here to reset