-
Achieving Near Instance-Optimality and Minimax-Optimality in Stochastic and Adversarial Linear Bandits Simultaneously
In this work, we develop linear bandit algorithms that automatically ada...
read it
-
Last-iterate Convergence of Decentralized Optimistic Gradient Descent/Ascent in Infinite-horizon Competitive Markov Games
We study infinite-horizon discounted two-player zero-sum Markov games, a...
read it
-
Linear Last-iterate Convergence for Matrix Games and Stochastic Games
Optimistic Gradient Descent Ascent (OGDA) algorithm for saddle-point opt...
read it
-
Bias no more: high-probability data-dependent regret bounds for adversarial bandits and MDPs
We develop a new approach to obtaining high probability regret bounds fo...
read it
-
A Closer Look at Small-loss Bounds for Bandits with Graph Feedback
We study small-loss bounds for the adversarial multi-armed bandits probl...
read it
-
A New Algorithm for Non-stationary Contextual Bandits: Efficient, Optimal, and Parameter-free
We propose the first contextual bandit algorithm that is parameter-free,...
read it
-
Multi-Label Zero-Shot Learning with Structured Knowledge Graphs
In this paper, we propose a novel deep learning architecture for multi-l...
read it

Chung-Wei Lee
is this you? claim profile