
-
Robust Constrained Reinforcement Learning for Continuous Control with Model Misspecification
Many real-world physical control systems are required to satisfy constra...
read it
-
Balancing Constraints and Rewards with Meta-Gradient D4PG
Deploying Reinforcement Learning (RL) agents to solve real-world applica...
read it
-
An empirical investigation of the challenges of real-world reinforcement learning
Reinforcement learning (RL) has proven its worth in a series of artifici...
read it
-
Robust Reinforcement Learning for Continuous Control with Model Misspecification
We provide a framework for incorporating robustness -- to perturbations ...
read it
-
Action Assembly: Sparse Imitation Learning for Text Based Games with Combinatorial Action Spaces
We propose a computationally efficient algorithm that combines compresse...
read it
-
Learn What Not to Learn: Action Elimination with Deep Reinforcement Learning
Learning how to act when there are many available actions in each state ...
read it
-
Reward Constrained Policy Optimization
Teaching agents to perform tasks using Reinforcement Learning is no easy...
read it
-
Soft-Robust Actor-Critic Policy-Gradient
Robust Reinforcement Learning aims to derive an optimal behavior that ac...
read it
-
Unicorn: Continual Learning with a Universal, Off-policy Agent
Some real-world domains are best characterized as a single task, but for...
read it
-
Learning Robust Options
Robust reinforcement learning aims to produce policies that have strong ...
read it
-
Situationally Aware Options
Hierarchical abstractions, also known as options -- a type of temporally...
read it
-
Shallow Updates for Deep Reinforcement Learning
Deep reinforcement learning (DRL) methods such as the Deep Q-Network (DQ...
read it
-
Situational Awareness by Risk-Conscious Skills
Hierarchical Reinforcement Learning has been previously shown to speed u...
read it
-
Adaptive Skills, Adaptive Partitions (ASAP)
We introduce the Adaptive Skills, Adaptive Partitions (ASAP) framework t...
read it
-
Iterative Hierarchical Optimization for Misspecified Problems (IHOMP)
For complex, high-dimensional Markov Decision Processes (MDPs), it may b...
read it
-
CFORB: Circular FREAK-ORB Visual Odometry
We present a novel Visual Odometry algorithm entitled Circular FREAK-ORB...
read it
-
Bootstrapping Skills
The monolithic approach to policy representation in Markov Decision Proc...
read it