Simulating Performance of ML Systems with Offline Profiling

02/17/2020
by   Hongming Huang, et al.
0

We advocate that simulation based on offline profiling is a promising approach to better understand and improve the complex ML systems. Our approach uses operation-level profiling and dataflow based simulation to ensure it offers a unified and automated solution for all frameworks and ML models, and is also accurate by considering the various parallelization strategies in a real system.

READ FULL TEXT

page 1

page 2

page 3

research
02/16/2023

Experimental Demonstration of ML-Based DWDM System Margin Estimation

SNR margins between partially and fully loaded DWDM systems are estimate...
research
09/20/2021

Description of Corner Cases in Automated Driving: Goals and Challenges

Scaling the distribution of automated vehicles requires handling various...
research
06/04/2023

Proteus: Simulating the Performance of Distributed DNN Training

DNN models are becoming increasingly larger to achieve unprecedented acc...
research
03/22/2023

Challenges and opportunities for machine learning in multiscale computational modeling

Many mechanical engineering applications call for multiscale computation...
research
05/18/2022

The games we play: critical complexity improves machine learning

When mathematical modelling is applied to capture a complex system, mult...
research
10/26/2022

Desiderata for next generation of ML model serving

Inference is a significant part of ML software infrastructure. Despite t...
research
06/22/2023

In Situ Framework for Coupling Simulation and Machine Learning with Application to CFD

Recent years have seen many successful applications of machine learning ...

Please sign up or login with your details

Forgot password? Click here to reset