Analyzing Similarity in Mathematical Content To Enhance the Detection of Academic Plagiarism

01/25/2018
by   Maurice-Roman Isele, et al.
0

Despite the effort put into the detection of academic plagiarism, it continues to be a ubiquitous problem spanning all disciplines. Various tools have been developed to assist human inspectors by automatically identifying suspicious documents. However, to our knowledge currently none of these tools use mathematical content for their analysis. This is problematic, because mathematical content potentially represents a significant amount of the scientific contribution in academic documents. Hence, ignoring mathematical content limits the detection of plagiarism considerably, especially in disciplines with frequent use of mathematics. This paper aims to help close this gap by providing an overview of existing approaches in mathematical information retrieval and an analysis of their applicability for different possible cases of mathematical plagiarism. I find that whereas syntax-based approaches perform particularly well in detecting undisguised plagiarism, structure-based and hybrid approaches promise to also detect forms of disguised mathematical plagiarism, such as plagiarism with renamed identifiers. However, more research in this area is needed to enable the detection of more complex mathematical plagiarism: the scope of current approaches is restricted to the formula-level, an extension to the section-level is needed. Additionally, the general detection of equivalence transformations is currently not feasible. Despite these remaining problems, I conclude that the presented approaches could already be used for a basic automated detection system targeting mathematical plagiarism and therefore enhance current plagiarism detection systems.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
06/27/2019

Improving Academic Plagiarism Detection for STEM Documents by Analyzing Mathematical Content and Citations

Identifying academic plagiarism is a pressing task for educational and r...
research
03/03/2023

Discovery and Recognition of Formula Concepts using Machine Learning

Citation-based Information Retrieval (IR) methods for scientific documen...
research
05/08/2019

Forms of Plagiarism in Digital Mathematical Libraries

We report on an exploratory analysis of the forms of plagiarism observab...
research
06/10/2021

Analyzing Non-Textual Content Elements to Detect Academic Plagiarism

Identifying academic plagiarism is a pressing problem, among others, for...
research
06/02/2021

The Struggle with Academic Plagiarism: Approaches based on Semantic Similarity

Academic plagiarism is a serious problem nowadays. Due to the existence ...
research
04/13/2022

A Multidimensional Artistic Approach to Enhance Understanding of Julia Sets through Computer Programming

This article proposes an artistic approach to increase and enrich the un...
research
06/04/2023

Using artificial-intelligence tools to make LaTeX content accessible to blind readers

Screen-reader software enables blind users to access large segments of e...

Please sign up or login with your details

Forgot password? Click here to reset