
Learning Causal State Representations of Partially Observable Environments
Intelligent agents can cope with sensoryrich environments by learning t...
read it

Stochastic Linear Bandits with Hidden Low Rank Structure
Highdimensional representations often have a lower dimensional underlyi...
read it

Regret Minimization in Partially Observable Linear Quadratic Control
We study the problem of regret minimization in partially observable line...
read it

SampleEfficient Deep RL with Generative Adversarial Tree Search
We propose Generative Adversarial Tree Search (GATS), a sampleefficient...
read it

Experimental results : Reinforcement Learning of POMDPs using Spectral Methods
We propose a new reinforcement learning algorithm for partially observab...
read it

Reinforcement Learning in RichObservation MDPs using Spectral Methods
Designing effective explorationexploitation algorithms in Markov decisi...
read it

Reinforcement Learning of POMDPs using Spectral Methods
We propose a new reinforcement learning algorithm for partially observab...
read it

Efficient Exploration through Bayesian Deep QNetworks
We propose Bayesian Deep QNetwork (BDQN), a practical Thompson sampling...
read it

Stochastic Activation Pruning for Robust Adversarial Defense
Neural networks are known to be vulnerable to adversarial examples. Care...
read it

signSGD: compressed optimisation for nonconvex problems
Training large neural networks requires distributing learning across mul...
read it

signSGD with Majority Vote is Communication Efficient And Byzantine Fault Tolerant
Training neural networks on large datasets can be accelerated by distrib...
read it

Trust Region Policy Optimization of POMDPs
We propose Generalized Trust Region Policy Optimization (GTRPO), a Reinf...
read it

Neural Lander: Stable Drone Landing Control using Learned Dynamics
Precise trajectory control near ground is difficult for multirotor dron...
read it

Regularized Learning for Domain Adaptation under Label Shifts
We propose Regularized Learning under Label shifts (RLLS), a principled ...
read it

Directivity Modes of Earthquake Populations with Unsupervised Learning
We present a novel approach for resolving modes of rupture directivity i...
read it
Kamyar Azizzadenesheli
verfied profile