
PixL2R: Guiding Reinforcement Learning Using Natural Language by Mapping Pixels to Rewards
Reinforcement learning (RL), particularly in sparse reward settings, oft...
Bayesian Robust Optimization for Imitation Learning
One of the main challenges in imitation learning is determining what act...
Efficiently Guiding Imitation Learning Algorithms with Human Gaze
Human gaze is known to be an intentionrevealing signal in human demonst...
Safe Imitation Learning via Fast Bayesian Reward Inference from Preferences
Bayesian reward learning from demonstrations enables rigorous safety and...
Local Nonparametric MetaLearning
A central goal of metalearning is to find a learning rule that enables ...
Deep Bayesian Reward Learning from Preferences
Bayesian inverse reinforcement learning (IRL) methods are ideal for safe...
Learning Hybrid Object Kinematics for Efficient Hierarchical Planning Under Uncertainty
Sudden changes in the dynamics of robotic tasks, such as contact with an...
Understanding Teacher Gaze Patterns for Robot Learning
Human gaze is known to be a strong indicator of underlying human intenti...
RankingBased Reward Extrapolation without Rankings
The performance of imitation learning is typically upperbounded by the ...
A Review of Robot Learning for Manipulation: Challenges, Representations, and Algorithms
A key challenge in intelligent robotics is creating robots that are capa...
HypothesisDriven Skill Discovery for Hierarchical Deep Reinforcement Learning
Deep reinforcement learning encompasses many versatile tools for designi...
UncertaintyAware Data Aggregation for Deep Imitation Learning
Estimating statistical uncertainties allows autonomous agents to communi...
Extrapolating Beyond Suboptimal Demonstrations via Inverse Reinforcement Learning from Observations
A critical flaw of existing inverse reinforcement learning (IRL) methods...
Using Natural Language for Reward Shaping in Reinforcement Learning
Recent reinforcement learning (RL) approaches have shown strong performa...
RiskAware Active Inverse Reinforcement Learning
Active learning from demonstration allows a robot to query a human for s...
LAAIR: A Layered Architecture for Autonomous Interactive Robots
When developing general purpose robots, the overarching software archite...
Towards Online Learning from Corrective Demonstrations
Robots operating in realworld human environments will likely encounter ...
Learning MultiStep Robotic Tasks from Observation
Due to burdensome data requirements, learning from demonstration often f...
Importance Sampling Policy Evaluation with an Estimated Behavior Policy
In reinforcement learning, offpolicy evaluation is the task of using da...
Machine Teaching for Inverse Reinforcement Learning: Algorithms and Applications
Inverse reinforcement learning (IRL) infers a reward function from demon...
Efficient Hierarchical Robot Motion Planning Under Uncertainty and Hybrid Dynamics
Noisy observations coupled with nonlinear dynamics pose one of the bigge...
Leveraging Task Knowledge for Robot Motion Planning Under Uncertainty
Noisy observations coupled with nonlinear dynamics pose one of the bigge...
Safe Reinforcement Learning via Shielding
Reinforcement learning algorithms discover policies that maximize reward...
Efficient Probabilistic Performance Bounds for Inverse Reinforcement Learning
In the field of reinforcement learning there has been recent progress to...
DataEfficient Policy Evaluation Through Behavior Policy Search
We consider the task of evaluating a policy for a Markov decision proces...
Bootstrapping with Models: Confidence Intervals for OffPolicy Evaluation
For an autonomous agent, executing a poor policy may be costly or even d...
