HOLMES: Health OnLine Model Ensemble Serving for Deep Learning Models in Intensive Care Units

08/10/2020
by   Shenda Hong, et al.
0

Deep learning models have achieved expert-level performance in healthcare with an exclusive focus on training accurate models. However, in many clinical environments such as intensive care unit (ICU), real-time model serving is equally if not more important than accuracy, because in ICU patient care is simultaneously more urgent and more expensive. Clinical decisions and their timeliness, therefore, directly affect both the patient outcome and the cost of care. To make timely decisions, we argue the underlying serving system must be latency-aware. To compound the challenge, health analytic applications often require a combination of models instead of a single model, to better specialize individual models for different targets, multi-modal data, different prediction windows, and potentially personalized predictions. To address these challenges, we propose HOLMES-an online model ensemble serving framework for healthcare applications. HOLMES dynamically identifies the best performing set of models to ensemble for highest accuracy, while also satisfying sub-second latency constraints on end-to-end prediction. We demonstrate that HOLMES is able to navigate the accuracy/latency tradeoff efficiently, compose the ensemble, and serve the model ensemble pipeline, scaling to simultaneously streaming data from 100 patients, each producing waveform data at 250 Hz. HOLMES outperforms the conventional offline batch-processed inference for the same clinical task in terms of accuracy and latency (by order of magnitude). HOLMES is tested on risk prediction task on pediatric cardio ICU data with above 95 accuracy and sub-second latency on 64-bed simulation.

READ FULL TEXT
POST COMMENT

Comments

There are no comments yet.

Authors

page 3

06/01/2018

Visualizing Patient Timelines in the Intensive Care Unit

Electronic Health Records (EHRs) contain a large volume of heterogeneous...
12/17/2021

Optimal discharge of patients from intensive care via a data-driven policy learning framework

Clinical decision support tools rooted in machine learning and optimizat...
06/09/2021

Cocktail: Leveraging Ensemble Learning for Optimized Model Serving in Public Cloud

With a growing demand for adopting ML models for a varietyof application...
05/12/2021

Early prediction of respiratory failure in the intensive care unit

The development of respiratory failure is common among patients in inten...
04/02/2019

BARISTA: Efficient and Scalable Serverless Serving System for Deep Learning Prediction Services

Pre-trained deep learning models are increasingly being used to offer a ...
01/14/2019

A Self-Correcting Deep Learning Approach to Predict Acute Conditions in Critical Care

In critical care, intensivists are required to continuously monitor high...
This week in AI

Get the week's most popular data science and artificial intelligence research sent straight to your inbox every Saturday.