
Efficient Local Planning with Linear Function Approximation
We study query and computationally efficient planning algorithms with li...
Improved Regret Bound and Experience Replay in Regularized Policy Iteration
In this work, we study algorithms for learning in infinitehorizon undis...
Optimization Issues in KLConstrained Approximate Policy Iteration
Many reinforcement learning algorithms can be seen as versions of approx...
Neural Rate Control for Video Encoding using Imitation Learning
In modern video encoders, rate control is a critical component and has b...
A maximumentropy approach to offpolicy evaluation in averagereward MDPs
This work focuses on offpolicy evaluation (OPE) with function approxima...
Robotic Table Tennis with ModelFree Reinforcement Learning
We propose a modelfree algorithm for learning efficient policies capabl...
Provably Efficient Adaptive Approximate Policy Iteration
Modelfree reinforcement learning algorithms combined with value functio...
ExplorationEnhanced POLITEX
We study algorithms for averagecost reinforcement learning problems wit...
Hierarchical Policy Design for SampleEfficient Learning of Robot Table Tennis Through SelfPlay
Training robots with physical bodies requires developing new methods and...
Online Linear Quadratic Control
We study the problem of controlling linear timeinvariant systems with k...
Regret Bounds for ModelFree Linear Quadratic Control
Modelfree approaches for reinforcement learning (RL) and continuous con...
ContextDependent FineGrained Entity Type Tagging
Entity type tagging is the task of assigning category labels to each men...
Nevena Lazic
