Navigating the challenges in creating complex data systems: a development philosophy

10/21/2022
by   Sören Dittmer, et al.
0

In this perspective, we argue that despite the democratization of powerful tools for data science and machine learning over the last decade, developing the code for a trustworthy and effective data science system (DSS) is getting harder. Perverse incentives and a lack of widespread software engineering (SE) skills are among many root causes we identify that naturally give rise to the current systemic crisis in reproducibility of DSSs. We analyze why SE and building large complex systems is, in general, hard. Based on these insights, we identify how SE addresses those difficulties and how we can apply and generalize SE methods to construct DSSs that are fit for purpose. We advocate two key development philosophies, namely that one should incrementally grow – not biphasically plan and build – DSSs, and one should always employ two types of feedback loops during development: one which tests the code's correctness and another that evaluates the code's efficacy.

READ FULL TEXT
research
12/14/2020

A Software Engineering Perspective on Engineering Machine Learning Systems: State of the Art and Challenges

Context: Advancements in machine learning (ML) lead to a shift from the ...
research
07/27/2022

Software Engineering for Serverless Computing

Serverless computing is an emerging cloud computing paradigm that has be...
research
03/14/2023

MeROS: SysML-based Metamodel for ROS-based Systems

The complexity of today's robot control systems implies difficulty in de...
research
03/16/2021

Understanding and Modeling AI-Intensive System Development

Developers of AI-Intensive Systems–i.e., systems that involve both "trad...
research
09/20/2020

A Benchmark Study of the Contemporary Toxicity Detectors on Software Engineering Interactions

Automated filtering of toxic conversations may help an Open-source softw...
research
08/03/2021

The application of artificial intelligence in software engineering: a review challenging conventional wisdom

The field of artificial intelligence (AI) is witnessing a recent upsurge...
research
03/15/2020

How to Improve AI Tools (by Adding in SE Knowledge): Experiments with the TimeLIME Defect Reduction Tool

AI algorithms are being used with increased frequency in SE research and...

Please sign up or login with your details

Forgot password? Click here to reset