Mining Treatment-Outcome Constructs from Sequential Software Engineering Data

01/17/2019
by   Maleknaz Nayebi, et al.
0

Many investigations in empirical software engineering look at sequences of data resulting from development or management processes. In this paper, we propose an analytical approach called the Gandhi-Washington Method (GWM) to investigate the impact of recurring events in software projects. GWM takes an encoding of events and activities provided by a software analyst as input. It uses regular expressions to automatically condense and summarize information and infer treatments. Relating the treatments to the outcome through statistical tests, treatment-outcome constructs are automatically mined from the data. The output of GWM is a set of treatment-outcome constructs. Each treatment in the set of mined constructs is significantly different from the other treatments considering the impact on the outcome and/or is structurally different from other treatments considering the sequence of events. We describe GWM and classes of problems to which GWM can be applied. We demonstrate the applicability of this method for empirical studies on sequences of file editing, code ownership, and release cycle time.

READ FULL TEXT

page 14

page 20

research
09/09/2022

Joint Non-parametric Point Process model for Treatments and Outcomes: Counterfactual Time-series Prediction Under Policy Interventions

Policy makers need to predict the progression of an outcome before adopt...
research
11/21/2019

Analysing Time-Stamped Co-Editing Networks in Software Development Teams using git2net

Data from software repositories have become an important foundation for ...
research
09/27/2021

Assessing Outcome-to-Outcome Interference in Sibling Fixed Effects Models

Sibling fixed effects (FE) models are useful for estimating causal treat...
research
09/29/2020

GraphITE: Estimating Individual Effects of Graph-structured Treatments

Outcome estimation of treatments for target individuals is an important ...
research
02/28/2020

Estimating the impact of treatment compliance over time on smoking cessation using data from ecological momentary assessments (EMA)

The Wisconsin Smoker's Health Study (WSHS2) was a longitudinal trial con...
research
07/18/2018

Moving Beyond the Mean: Analyzing Variance in Software Engineering Experiments

Software Engineering (SE) experiments are traditionally analyzed with st...
research
09/06/2016

Automatically extracting, ranking and visually summarizing the treatments for a disease

Clinicians are expected to have up-to-date and broad knowledge of diseas...

Please sign up or login with your details

Forgot password? Click here to reset