STACC: Code Comment Classification using SentenceTransformers

02/25/2023
by   Ali Al-Kaswan, et al.
0

Code comments are a key resource for information about software artefacts. Depending on the use case, only some types of comments are useful. Thus, automatic approaches to classify these comments have been proposed. In this work, we address this need by proposing, STACC, a set of SentenceTransformers-based binary classifiers. These lightweight classifiers are trained and tested on the NLBSE Code Comment Classification tool competition dataset, and surpass the baseline by a significant margin, achieving an average F1 score of 0.74 against the baseline of 0.31, which is an improvement of 139 are publicly available.

READ FULL TEXT
research
03/02/2023

Performance Comparison of Binary Machine Learning Classifiers in Identifying Code Comment Types: An Exploratory Study

Code comments are vital to source code as they help developers with prog...
research
03/10/2021

Identifying bot activity in GitHub pull request and issue comments

Development bots are used on Github to automate repetitive activities. S...
research
06/07/2021

Predicting Different Types of Subtle Toxicity in Unhealthy Online Conversations

This paper investigates the use of machine learning models for the class...
research
06/10/2023

Bootstrapping Code-Text Pretrained Language Model to Detect Inconsistency Between Code and Comment

Comments on source code serve as critical documentation for enabling dev...
research
05/19/2023

Searching by Code: a New SearchBySnippet Dataset and SnippeR Retrieval Model for Searching by Code Snippets

Code search is an important task that has seen many developments in rece...
research
10/07/2020

A ground-truth dataset and classification model for detecting bots in GitHub issue and PR comments

Bots are frequently used in Github repositories to automate repetitive a...
research
05/03/2019

Time-sync Video Tag Extraction Using Semantic Association Graph

Time-sync comments reveal a new way of extracting the online video tags....

Please sign up or login with your details

Forgot password? Click here to reset