
-
ES-ENAS: Combining Evolution Strategies with Neural Architecture Search at No Extra Cost for Reinforcement Learning
We introduce ES-ENAS, a simple neural architecture search (NAS) algorith...
read it
-
Fairness with Continuous Optimal Transport
Whilst optimal transport (OT) is increasingly being recognized as a powe...
read it
-
Regret Bound Balancing and Elimination for Model Selection in Bandits and RL
We propose a simple model selection approach for algorithms in stochasti...
read it
-
Online Model Selection for Reinforcement Learning with Function Approximation
Deep reinforcement learning has achieved impressive successes yet often ...
read it
-
Ridge Rider: Finding Diverse Solutions by Following Eigenvectors of the Hessian
Over the last decade, a single algorithm has changed many facets of our ...
read it
-
Accelerated Message Passing for Entropy-Regularized MAP Inference
Maximum a posteriori (MAP) inference in discrete-valued Markov random fi...
read it
-
On Optimism in Model-Based Reinforcement Learning
The principle of optimism in the face of uncertainty is prevalent throug...
read it
-
Stochastic Bandits with Linear Constraints
We study a constrained contextual linear bandit setting, where the goal ...
read it
-
Regret Balancing for Bandit and RL Model Selection
We consider model selection in stochastic bandit and reinforcement learn...
read it
-
Learning the Truth From Only One Side of the Story
Learning under one-sided feedback (i.e., where examples arrive in an onl...
read it
-
Stochastic Flows and Geometric Optimization on the Orthogonal Group
We present a new class of stochastic, geometrically-driven optimization ...
read it
-
Robustness Guarantees for Mode Estimation with an Application to Bandits
Mode estimation is a classical problem in statistics with a wide range o...
read it
-
Model Selection in Contextual Stochastic Bandit Problems
We study model selection in stochastic bandit problems. Our approach rel...
read it
-
On Thompson Sampling with Langevin Algorithms
Thompson sampling is a methodology for multi-armed bandit problems that ...
read it
-
Ready Policy One: World Building Through Active Learning
Model-Based Reinforcement Learning (MBRL) offers a promising direction f...
read it
-
Effective Diversity in Population-Based Reinforcement Learning
Maintaining a population of solutions has been shown to increase explora...
read it
-
ES-MAML: Simple Hessian-Free Meta Learning
We introduce ES-MAML, a new framework for solving the model agnostic met...
read it
-
Wasserstein Fair Classification
We propose an approach to fair classification that enforces independence...
read it
-
Reinforcement Learning with Chromatic Networks
We present a new algorithm for finding compact neural networks encoding ...
read it
-
Approximate Sherali-Adams Relaxations for MAP Inference via Entropy Regularization
Maximum a posteriori (MAP) inference is a fundamental computational para...
read it
-
Wasserstein Reinforcement Learning
We propose behavior-driven optimization via Wasserstein distances (WDs) ...
read it
-
Structured Monte Carlo Sampling for Nonisotropic Distributions via Determinantal Point Processes
We propose a new class of structured methods for Monte Carlo (MC) sampli...
read it
-
Adaptive Sample-Efficient Blackbox Optimization via ES-active Subspaces
We present a new algorithm ASEBO for conducting optimization of high-dim...
read it
-
When random search is not enough: Sample-Efficient and Noise-Robust Blackbox Optimization of RL Policies
Interest in derivative-free optimization (DFO) and "evolutionary strateg...
read it
-
Gen-Oja: A Simple and Efficient Algorithm for Streaming Generalized Eigenvector Computation
In this paper, we study the problems of principal Generalized Eigenvecto...
read it
-
Online learning with kernel losses
We present a generalization of the adversarial linear bandits framework,...
read it
-
A note on reinforcement learning with Wasserstein distance regularisation, with applications to multipolicy learning
In this note we describe an application of Wasserstein distance to Reinf...
read it