Personalized Online Machine Learning

09/21/2021
by   Ivana Malenica, et al.
53

In this work, we introduce the Personalized Online Super Learner (POSL) – an online ensembling algorithm for streaming data whose optimization procedure accommodates varying degrees of personalization. Namely, POSL optimizes predictions with respect to baseline covariates, so personalization can vary from completely individualized (i.e., optimization with respect to baseline covariate subject ID) to many individuals (i.e., optimization with respect to common baseline covariates). As an online algorithm, POSL learns in real-time. POSL can leverage a diversity of candidate algorithms, including online algorithms with different training and update times, fixed algorithms that are never updated during the procedure, pooled algorithms that learn from many individuals' time-series, and individualized algorithms that learn from within a single time-series. POSL's ensembling of this hybrid of base learning strategies depends on the amount of data collected, the stationarity of the time-series, and the mutual characteristics of a group of time-series. In essence, POSL decides whether to learn across samples, through time, or both, based on the underlying (unknown) structure in the data. For a wide range of simulations that reflect realistic forecasting scenarios, and in a medical data application, we examine the performance of POSL relative to other current ensembling and online learning methods. We show that POSL is able to provide reliable predictions for time-series data and adjust to changing data-generating environments. We further cultivate POSL's practicality by extending it to settings where time-series enter/exit dynamically over chronological time.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
03/07/2019

GRATIS: GeneRAting TIme Series with diverse and controllable characteristics

The explosion of time series data in recent years has brought a flourish...
research
06/05/2018

EigenNetworks

In many applications, the interdependencies among a set of N time series...
research
03/27/2020

New Perspectives on the Use of Online Learning for Congestion Level Prediction over Traffic Data

This work focuses on classification over time series data. When a time s...
research
02/17/2021

POLA: Online Time Series Prediction by Adaptive Learning Rates

Online prediction for streaming time series data has practical use for m...
research
08/20/2017

Boltzmann machines for time-series

We review Boltzmann machines extended for time-series. These models ofte...
research
09/12/2022

An Evaluation of Low Overhead Time Series Preprocessing Techniques for Downstream Machine Learning

In this paper we address the application of pre-processing techniques to...
research
08/20/2020

Reinforcement Learning based dynamic weighing of Ensemble Models for Time Series Forecasting

Ensemble models are powerful model building tools that are developed wit...

Please sign up or login with your details

Forgot password? Click here to reset