Building Domain-Specific Machine Learning Workflows: A Conceptual Framework for the State-of-the-Practice

03/16/2022
by   Bentley James Oakes, et al.
0

Domain experts are increasingly employing machine learning to solve their domain-specific problems. This article presents six key challenges that a domain expert faces in transforming their problem into a computational workflow, and then into an executable implementation. These challenges arise out of our conceptual framework which presents the "route" of options that a domain expert may choose to take while developing their solution. To ground our conceptual framework in the state-of-the-practice, this article discusses a selection of available textual and graphical workflow systems and their support for these six challenges. Case studies from the literature in various domains are also examined to highlight the tools used by the domain experts as well as a classification of the domain-specificity and machine learning usage of their problem, workflow, and implementation. The state-of-the-practice informs our discussion of the six key challenges, where we identify which challenges are not sufficiently addressed by available tools. We also suggest possible research directions for software engineering researchers to increase the automation of these tools and disseminate best-practice techniques between software engineering and various scientific domains.

READ FULL TEXT
research
11/10/2022

Evaluation of tools for describing, reproducing and reusing scientific workflows

In the field of computational science and engineering, workflows often e...
research
08/25/2022

Continuous Deep Learning: A Workflow to Bring Models into Production

Researchers have been highly active to investigate the classical machine...
research
09/13/2023

Reusability Challenges of Scientific Workflows: A Case Study for Galaxy

Scientific workflow has become essential in software engineering because...
research
10/21/2020

A Level-wise Taxonomic Perspective on Automated Machine Learning to Date and Beyond: Challenges and Opportunities

Automated machine learning (AutoML) is essentially automating the proces...
research
10/08/2019

Multilevel Modelling and Domain-Specific Languages

Modern software engineering deals with demanding problems that yield lar...
research
02/18/2021

Using Jupyter for reproducible scientific workflows

Literate computing has emerged as an important tool for computational st...
research
09/12/2019

A Survey of DevOps Concepts and Challenges

DevOps is a collaborative and multidisciplinary organizational effort to...

Please sign up or login with your details

Forgot password? Click here to reset