Online Learning in Dynamically Changing Environments

01/31/2023
by   Changlong Wu, et al.
0

We study the problem of online learning and online regret minimization when samples are drawn from a general unknown non-stationary process. We introduce the concept of a dynamic changing process with cost K, where the conditional marginals of the process can vary arbitrarily, but that the number of different conditional marginals is bounded by K over T rounds. For such processes we prove a tight (upto √(log T) factor) bound O(√(KT·𝖵𝖢(ℋ)log T)) for the expected worst case regret of any finite VC-dimensional class ℋ under absolute loss (i.e., the expected miss-classification loss). We then improve this bound for general mixable losses, by establishing a tight (up to log^3 T factor) regret bound O(K·𝖵𝖢(ℋ)log^3 T). We extend these results to general smooth adversary processes with unknown reference measure by showing a sub-linear regret bound for 1-dimensional threshold functions under a general bounded convex loss. Our results can be viewed as a first step towards regret analysis with non-stationary samples in the distribution blind (universal) regime. This also brings a new viewpoint that shifts the study of complexity of the hypothesis classes to the study of the complexity of processes generating data.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
05/19/2023

Non-stationary Projection-free Online Learning with Dynamic and Adaptive Regret Guarantees

Projection-free online learning has drawn increasing interest due to its...
research
09/09/2022

Expected Worst Case Regret via Stochastic Sequential Covering

We study the problem of sequential prediction and online minimax regret ...
research
02/07/2021

Non-stationary Online Learning with Memory and Non-stochastic Control

We study the problem of Online Convex Optimization (OCO) with memory, wh...
research
09/16/2023

Efficient Methods for Non-stationary Online Learning

Non-stationary online learning has drawn much attention in recent years....
research
03/01/2013

Second-Order Non-Stationary Online Learning for Regression

The goal of a learner, in standard online learning, is to have the cumul...
research
12/16/2019

Does the Multisecretary Problem Always Have Bounded Regret?

Arlotto and Gurvich (2019) showed that the regret in the multisecretary ...
research
10/09/2019

Derivative-Free Order-Robust Optimisation

In this paper, we formalise order-robust optimisation as an instance of ...

Please sign up or login with your details

Forgot password? Click here to reset