Identifying Algorithm Names in Code Comments

07/10/2019
by   Jakapong Klainongsuang, et al.
0

For recent machine-learning-based tasks like API sequence generation, comment generation, and document generation, large amount of data is needed. When software developers implement algorithms in code, we find that they often mention algorithm names in code comments. Code annotated with such algorithm names can be valuable data sources. In this paper, we propose an automatic method of algorithm name identification. The key idea is extracting important N-gram words containing the word `algorithm' in the last. We also consider part of speech patterns to derive rules for appropriate algorithm name identification. The result of our rule evaluation produced high precision and recall values (more than 0.70). We apply our rules to extract algorithm names in a large amount of comments from active FLOSS projects written in seven programming languages, C, C++, Java, JavaScript, Python, PHP, and Ruby, and report commonly mentioned algorithm names in code comments.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
05/06/2019

Analyzing Code Comments to Boost Program Comprehension

We are trying to find source code comments that help programmers underst...
research
08/25/2021

RepliComment: Identifying Clones in Code Comments

Code comments are the primary means to document implementation and facil...
research
03/19/2021

API2Com: On the Improvement of Automatically Generated Code Comments Using API Documentations

Code comments can help in program comprehension and are considered as im...
research
10/31/2019

Implementation of an Index Optimize Technology for Highly Specialized Terms based on the Phonetic Algorithm Metaphone

When compiling databases, for example to meet the needs of healthcare es...
research
04/16/2020

Deep Generation of Coq Lemma Names Using Elaborated Terms

Coding conventions for naming, spacing, and other essentially stylistic ...
research
10/07/2022

Removing Qualified Names in Modular Languages

Although the notion of qualified names is popular in module systems, it ...
research
03/01/2021

Roosterize: Suggesting Lemma Names for Coq Verification Projects Using Deep Learning

Naming conventions are an important concern in large verification projec...

Please sign up or login with your details

Forgot password? Click here to reset