Bot Detection in GitHub Repositories

03/31/2022
by   Natarajan Chidambaram, et al.
0

Contemporary social coding platforms like GitHub promote collaborative development. Many open-source software repositories hosted in these platforms use machine accounts (bots) to automate and facilitate a wide range of effort-intensive and repetitive activities. Determining if an account corresponds to a bot or a human contributor is important for socio-technical development analytics, for example, to understand how humans collaborate and interact in the presence of bots, to assess the positive and negative impact of using bots, to identify the top project contributors, to identify potential bus factors, and so on. Our project aims to include the trained machine learning (ML) classifier from the BoDeGHa bot detection tool as a plugin to the GrimoireLab software development analytics platform. In this work, we present the procedure to form a pipeline for retrieving contribution and contributor data using Perceval, distinguishing bots from humans using BoDeGHa, and visualising the results using Kibana.

READ FULL TEXT

page 1

page 2

page 3

research
03/31/2022

Leveraging Predictions from Multiple Repositories to Improve Bot Detection

Contemporary social coding platforms such as GitHub facilitate collabora...
research
05/08/2023

The GitHub Development Workflow Automation Ecosystems

Large-scale software development has become a highly collaborative and g...
research
07/25/2023

BotHawk: An Approach for Bots Detection in Open Source Software Projects

Social coding platforms have revolutionized collaboration in software de...
research
09/06/2022

Understanding Skills for OSS Communities on GitHub

The development of open source software (OSS) is a broad field which req...
research
03/13/2023

Automatic Identification and Extraction of Assumptions on GitHub

In software development, due to the lack of knowledge or information, ti...
research
03/05/2021

Bots Don't Mind Waiting, Do They? Comparing the Interaction With Automatically and Manually Created Pull Requests

As a maintainer of an open source software project, you are usually happ...
research
11/16/2020

Forking Without Clicking: on How to Identify Software Repository Forks

The notion of software ”fork” has been shifting over time from the (nega...

Please sign up or login with your details

Forgot password? Click here to reset