Conclusion Stability for Natural Language Based Mining of Design Discussions

06/17/2021
by   Alvi Mahadi, et al.
0

Developer discussions range from in-person hallway chats to comment chains on bug reports. Being able to identify discussions that touch on software design would be helpful in documentation and refactoring software. Design mining is the application of machine learning techniques to correctly label a given discussion artifact, such as a pull request, as pertaining (or not) to design. In this paper we demonstrate a simple example of how design mining works. We then show how conclusion stability is poor on different artifact types and different projects. We show two techniques – augmentation and context specificity – that greatly improve the conclusion stability and cross-project relevance of design mining. Our new approach achieves AUC of 0.88 on within dataset classification and 0.80 on the cross-dataset classification task.

READ FULL TEXT

page 17

page 30

research
01/06/2020

Cross-Dataset Design Discussion Mining

Being able to identify software discussions that are primarily about des...
research
10/04/2021

Identifying non-natural language artifacts in bug reports

Bug reports are a popular target for natural language processing (NLP). ...
research
05/31/2020

An Exploratory Characterization of Bugs in COVID-19 Software Projects

Context: The dire consequences of the COVID-19 pandemic has influenced d...
research
10/08/2021

Learning to Describe Solutions for Bug Reports Based on Developer Discussions

When a software bug is reported, developers engage in a discussion to co...
research
03/28/2021

Extractive Summarization of Related Bug-fixing Comments in Support of Bug Repair

When developers investigate a new bug report, they search for similar pr...
research
05/09/2021

Improving Patent Mining and Relevance Classification using Transformers

Patent analysis and mining are time-consuming and costly processes for c...

Please sign up or login with your details

Forgot password? Click here to reset