Online Self-Concordant and Relatively Smooth Minimization, With Applications to Online Portfolio Selection and Learning Quantum States

10/03/2022
by   Chung-En Tsai, et al.
0

Consider an online convex optimization problem where the loss functions are self-concordant barriers, smooth relative to a convex function h, and possibly non-Lipschitz. We analyze the regret of online mirror descent with h. Then, based on the result, we prove the following in a unified manner. Denote by T the time horizon and d the parameter dimension. 1. For online portfolio selection, the regret of EG, a variant of exponentiated gradient due to Helmbold et al., is Õ ( T^2/3 d^1/3 ) when T > 4 d / log d. This improves on the original Õ ( T^3/4 d^1/2 ) regret bound for EG. 2. For online portfolio selection, the regret of online mirror descent with the logarithmic barrier is Õ(√(T d)). The regret bound is the same as that of Soft-Bayes due to Orseau et al. up to logarithmic terms. 3. For online learning quantum states with the logarithmic loss, the regret of online mirror descent with the log-determinant function is also Õ ( √(T d) ). Its per-iteration time is shorter than all existing algorithms we know.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
05/23/2023

Data-Dependent Bounds for Online Portfolio Selection Without Lipschitzness and Smoothness

This work introduces the first small-loss and gradual-variation regret b...
research
02/15/2022

Damped Online Newton Step for Portfolio Selection

We revisit the classic online portfolio selection problem, where at each...
research
02/06/2022

Pushing the Efficiency-Regret Pareto Frontier for Online Learning of Portfolios and Quantum States

We revisit the classical online portfolio selection problem. It is widel...
research
01/08/2019

Soft-Bayes: Prod for Mixtures of Experts with Log-Loss

We consider prediction with expert advice under the log-loss with the go...
research
05/18/2018

Efficient Online Portfolio with Logarithmic Regret

We study the decades-old problem of online portfolio management and prop...
research
08/24/2020

Online Convex Optimization Perspective for Learning from Dynamically Revealed Preferences

We study the problem of online learning (OL) from revealed preferences: ...
research
09/28/2021

Near-Linear Time Algorithm with Near-Logarithmic Regret Per Switch for Mixable/Exp-Concave Losses

We investigate the problem of online learning, which has gained signific...

Please sign up or login with your details

Forgot password? Click here to reset