Mining Software Repositories with a Collaborative Heuristic Repository

03/02/2021
by   Hlib Babii, et al.
0

Many software engineering studies or tasks rely on categorizing software engineering artifacts. In practice, this is done either by defining simple but often imprecise heuristics, or by manual labelling of the artifacts. Unfortunately, errors in these categorizations impact the tasks that rely on them. To improve the precision of these categorizations, we propose to gather heuristics in a collaborative heuristic repository, to which researchers can contribute a large amount of diverse heuristics for a variety of tasks on a variety of SE artifacts. These heuristics are then leveraged by state-of-the-art weak supervision techniques to train high-quality classifiers, thus improving the categorizations. We present an initial version of the heuristic repository, which we applied to the concrete task of commit classification.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
01/06/2020

The SmartSHARK Ecosystem for Software Repository Mining

Software repository mining is the foundation for many empirical software...
research
01/30/2018

Data-Driven Search-based Software Engineering

This paper introduces Data-Driven Search-based Software Engineering (DSE...
research
02/24/2021

Software Engineering for Robotic Systems:a systematic mapping study

Robots are being applied in a vast range of fields, leading researchers ...
research
12/12/2018

Searching for Relevant Lessons Learned Using Hybrid Information Retrieval Classifiers: A Case Study in Software Engineering

The lessons learned (LL) repository is one of the most valuable sources ...
research
02/27/2018

PROMETHEUS: PROcedural METhodology for developing HEuristics of USability

Usability is used to assess the effectiveness of a software product from...
research
08/03/2020

Understanding and Improving Artifact Sharing in Software Engineering Research

In recent years, many software engineering researchers have begun to inc...
research
11/20/2015

Analysis of SVN Repositories for Remote Access

Software Evolution is considered to be essential and challenging charact...

Please sign up or login with your details

Forgot password? Click here to reset