Performance Comparison of Binary Machine Learning Classifiers in Identifying Code Comment Types: An Exploratory Study

03/02/2023
by   Amila Indika, et al.
0

Code comments are vital to source code as they help developers with program comprehension tasks. Written in natural language (usually English), code comments convey a variety of different information, which are grouped into specific categories. In this study, we construct 19 binary machine learning classifiers for code comment categories that belong to three different programming languages. We present a comparison of performance scores for different types of machine learning classifiers and show that the Linear SVC classifier has the highest average F1 score of 0.5474.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
05/06/2019

Analyzing Code Comments to Boost Program Comprehension

We are trying to find source code comments that help programmers underst...
research
02/25/2023

STACC: Code Comment Classification using SentenceTransformers

Code comments are a key resource for information about software artefact...
research
08/20/2021

Readability and Understandability Scores for Snippet Assessment: an Exploratory Study

Code search engines usually use readability feature to rank code snippet...
research
07/06/2018

Recommending Insightful Comments for Source Code using Crowdsourced Knowledge

Recently, automatic code comment generation is proposed to facilitate pr...
research
01/21/2021

Soft Genetic Programming Binary Classifiers

The study of the classifier's design and it's usage is one of the most i...
research
03/23/2021

Characterising the Knowledge about Primitive Variables in Java Code Comments

Primitive types are fundamental components available in any programming ...
research
11/18/2017

Prediction Scores as a Window into Classifier Behavior

Most multi-class classifiers make their prediction for a test sample by ...

Please sign up or login with your details

Forgot password? Click here to reset