Open Problem: Tight Online Confidence Intervals for RKHS Elements

10/28/2021
by   Sattar Vakili, et al.
0

Confidence intervals are a crucial building block in the analysis of various online learning problems. The analysis of kernel based bandit and reinforcement learning problems utilize confidence intervals applicable to the elements of a reproducing kernel Hilbert space (RKHS). However, the existing confidence bounds do not appear to be tight, resulting in suboptimal regret bounds. In fact, the existing regret bounds for several kernelized bandit algorithms (e.g., GP-UCB, GP-TS, and their variants) may fail to even be sublinear. It is unclear whether the suboptimal regret bound is a fundamental shortcoming of these algorithms or an artifact of the proof, and the main challenge seems to stem from the online (sequential) nature of the observation points. We formalize the question of online confidence intervals in the RKHS setting and overview the existing results.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
08/20/2021

Optimal Order Simple Regret for Gaussian Process Bandits

Consider the sequential optimization of a continuous, possibly non-conve...
research
07/14/2023

On the Sublinear Regret of GP-UCB

In the kernelized bandit problem, a learner aims to sequentially compute...
research
08/03/2023

Multitask Learning with No Regret: from Improved Confidence Bounds to Active Learning

Multitask learning is a powerful framework that enables one to simultane...
research
01/28/2020

Bandit optimisation of functions in the Matérn kernel RKHS

We consider the problem of optimising functions in the Reproducing kerne...
research
06/19/2023

Accelerated, physics-inspired inference of skeletal muscle microstructure from diffusion-weighted MRI

Muscle health is a critical component of overall health and quality of l...
research
02/08/2022

Improved Convergence Rates for Sparse Approximation Methods in Kernel-Based Learning

Kernel-based models such as kernel ridge regression and Gaussian process...
research
06/21/2019

Guaranteed Validity for Empirical Approaches to Adaptive Data Analysis

We design a general framework for answering adaptive statistical queries...

Please sign up or login with your details

Forgot password? Click here to reset