Coupling Online-Offline Learning for Multi-distributional Data Streams

02/12/2022
by   Zhilin Zhao, et al.
0

The distributions of real-life data streams are usually nonstationary, where one exciting setting is that a stream can be decomposed into several offline intervals with a fixed time horizon but different distributions and an out-of-distribution online interval. We call such data multi-distributional data streams, on which learning an on-the-fly expert for unseen samples with a desirable generalization is demanding yet highly challenging owing to the multi-distributional streaming nature, particularly when initially limited data is available for the online interval. To address these challenges, this work introduces a novel optimization method named coupling online-offline learning (CO_2) with theoretical guarantees about the knowledge transfer, the regret, and the generalization error. CO_2 extracts knowledge by training an offline expert for each offline interval and update an online expert by an off-the-shelf online optimization method in the online interval. CO_2 outputs a hypothesis for each sample by adaptively coupling both the offline experts and the underlying online expert through an expert-tracking strategy to adapt to the dynamic environment. To study the generalization performance of the output hypothesis, we propose a general theory to analyze its excess risk bound related to the loss function properties, the hypothesis class, the data distribution, and the regret.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
06/14/2012

On Local Regret

Online learning aims to perform nearly as well as the best hypothesis in...
research
06/26/2021

Contextual Inverse Optimization: Offline and Online Learning

We study the problems of offline and online contextual optimization with...
research
12/30/2022

Detecting Change Intervals with Isolation Distributional Kernel

Detecting abrupt changes in data distribution is one of the most signifi...
research
11/27/2021

Offline Neural Contextual Bandits: Pessimism, Optimization and Generalization

Offline policy learning (OPL) leverages existing data collected a priori...
research
08/07/2022

Optimal Tracking in Prediction with Expert Advice

We study the prediction with expert advice setting, where the aim is to ...
research
05/02/2018

Online and Offline Analysis of Streaming Data

Online and offline analytics have been traditionally treated separately ...
research
09/10/2019

The Outer Limits of Contention Resolution on Matroids and Connections to the Secretary Problem

Contention resolution schemes have proven to be a useful and unifying ab...

Please sign up or login with your details

Forgot password? Click here to reset