Dev2vec: Representing Domain Expertise of Developers in an Embedding Space

07/11/2022
by   Arghavan Moradi Dakhel, et al.
0

Accurate assessment of the domain expertise of developers is important for assigning the proper candidate to contribute to a project or to attend a job role. Since the potential candidate can come from a large pool, the automated assessment of this domain expertise is a desirable goal. While previous methods have had some success within a single software project, the assessment of a developer's domain expertise from contributions across multiple projects is more challenging. In this paper, we employ doc2vec to represent the domain expertise of developers as embedding vectors. These vectors are derived from different sources that contain evidence of developers' expertise, such as the description of repositories that they contributed, their issue resolving history, and API calls in their commits. We name it dev2vec and demonstrate its effectiveness in representing the technical specialization of developers. Our results indicate that encoding the expertise of developers in an embedding vector outperforms state-of-the-art methods and improves the F1-score up to 21 developers is the most informative source of information to represent the domain expertise of developers in embedding spaces.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
05/20/2020

Representation of Developer Expertise in Open Source Software

With tens of millions of projects and developers, the OSS ecosystem is b...
research
12/16/2021

Developing a Suitability Assessment Criteria for Software Developers: Behavioral Assessment Using Psychometric Test

Developing a Suitability Assessment Criteria for Software Developers: Be...
research
07/08/2021

GitQ- Towards Using Badges as Visual Cues for GitHub Projects

GitHub hosts millions of software repositories, facilitating developers ...
research
06/21/2018

Whom Are You Going to Call?: Determinants of @-Mentions in GitHub Discussions

Open Source Software (OSS) project success relies on crowd contributions...
research
10/06/2021

RevASIDE: Assignment of Suitable Reviewer Sets for Publications from Fixed Candidate Pools

Scientific publishing heavily relies on the assessment of quality of sub...
research
09/20/2018

Should I Bug You? Identifying Domain Experts in Software Projects Using Code Complexity Metrics

In any sufficiently complex software system there are experts, having a ...
research
09/30/2022

Towards effective assessment of steady state performance in Java software: Are we there yet?

Microbenchmarking is a widely used form of performance testing in Java s...

Please sign up or login with your details

Forgot password? Click here to reset