Online Non-Additive Path Learning under Full and Partial Information

04/18/2018
by   Corinna Cortes, et al.
0

We consider the online path learning problem in a graph with non-additive gains/losses. Various settings of full information, semi-bandit, and full bandit are explored. We give an efficient implementation of EXP3 algorithm for the full bandit setting with any (non-additive) gain. Then, focusing on the large family of non-additive count-based gains, we construct an intermediate graph which has equivalent gains that are additive. By operating on this intermediate graph, we are able to use algorithms like Component Hedge and ComBand for the first time for non-additive gains. Finally, we apply our methods to the important application of ensemble structured prediction.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
07/23/2021

The geometry of non-additive stabiliser codes

We present a geometric framework for constructing additive and non-addit...
research
06/13/2023

Additive Causal Bandits with Unknown Graph

We explore algorithms to select actions in the causal bandit setting whe...
research
08/08/2023

Multiclass Online Learnability under Bandit Feedback

We study online multiclass classification under bandit feedback. We exte...
research
04/18/2019

Semi-bandit Optimization in the Dispersed Setting

In this work, we study the problem of online optimization of piecewise L...
research
11/25/2019

Minimax Optimal Algorithms for Adversarial Bandit Problem with Multiple Plays

We investigate the adversarial bandit problem with multiple plays under ...
research
02/15/2021

Learning Accurate Decision Trees with Bandit Feedback via Quantized Gradient Descent

Decision trees provide a rich family of highly non-linear but efficient ...
research
01/24/2022

Valid belief updates for prequentially additive loss functions arising in Semi-Modular Inference

Model-based Bayesian evidence combination leads to models with multiple ...

Please sign up or login with your details

Forgot password? Click here to reset