Towards a Prediction of Machine Learning Training Time to Support Continuous Learning Systems Development

09/20/2023
by   Francesca Marzi, et al.
0

The problem of predicting the training time of machine learning (ML) models has become extremely relevant in the scientific community. Being able to predict a priori the training time of an ML model would enable the automatic selection of the best model both in terms of energy efficiency and in terms of performance in the context of, for instance, MLOps architectures. In this paper, we present the work we are conducting towards this direction. In particular, we present an extensive empirical study of the Full Parameter Time Complexity (FPTC) approach by Zheng et al., which is, to the best of our knowledge, the only approach formalizing the training time of ML models as a function of both dataset's and model's parameters. We study the formulations proposed for the Logistic Regression and Random Forest classifiers, and we highlight the main strengths and weaknesses of the approach. Finally, we observe how, from the conducted study, the prediction of training time is strictly related to the context (i.e., the involved dataset) and how the FPTC approach is not generalizable.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
09/04/2019

Regression-clustering for Improved Accuracy and Training Cost with Molecular-Orbital-Based Machine Learning

Machine learning (ML) in the representation of molecular-orbital-based (...
research
03/16/2018

Snap Machine Learning

We describe an efficient, scalable machine learning library that enables...
research
09/04/2019

Regression-clustering for Improved Accuracy and Training Cost with Molecular-Orbital-BasedMachine Learning

Machine learning (ML) in the representation of molecular-orbital-based (...
research
06/19/2021

Prediction of the facial growth direction with Machine Learning methods

First attempts of prediction of the facial growth (FG) direction were ma...
research
04/12/2022

A Machine Learning Approach to Determine the Semantic Versioning Type of npm Packages Releases

Semantic versioning policy is widely used to indicate the level of chang...
research
07/28/2023

Empirical Study of Straggler Problem in Parameter Server on Iterative Convergent Distributed Machine Learning

The purpose of this study is to test the effectiveness of current stragg...
research
12/02/2022

Matching DNN Compression and Cooperative Training with Resources and Data Availability

To make machine learning (ML) sustainable and apt to run on the diverse ...

Please sign up or login with your details

Forgot password? Click here to reset