An Incremental Inverse Reinforcement Learning Approach for Motion Planning with Human Preferences

01/25/2023
by   Armin Avaei, et al.
0

Humans often demonstrate diverse behaviors due to their personal preferences, for instance related to their individual execution style or personal margin for safety. In this paper, we consider the problem of integrating such preferences into trajectory planning for robotic manipulators. We first learn reward functions that represent the user path and motion preferences from kinesthetic demonstration. We then use a discrete-time trajectory optimization scheme to produce trajectories that adhere to both task requirements and user preferences. We go beyond the state of art by designing a feature set that captures the fundamental preferences in a manipulation task, such as timing of the motion. We further demonstrate that our method is capable of generalizing such preferences to new scenarios. We implement our algorithm on a Franka Emika 7-DoF robot arm, and validate the functionality and flexibility of our approach in a user study. The results show that non-expert users are able to teach the robot their preferences with just a few iterations of feedback.

READ FULL TEXT

page 1

page 6

page 7

research
01/05/2016

Learning Preferences for Manipulation Tasks from Online Coactive Feedback

We consider the problem of learning preferences over trajectories for mo...
research
09/03/2019

Learning User Preferences for Trajectories from Brain Signals

Robot motions in the presence of humans should not only be feasible and ...
research
06/10/2014

PlanIt: A Crowdsourcing Approach for Learning to Plan Paths from Large Scale Preference Feedback

We consider the problem of learning user preferences over robot trajecto...
research
09/01/2018

Cost Functions for Robot Motion Style

We focus on autonomously generating robot motion for day to day physical...
research
05/09/2023

TidyBot: Personalized Robot Assistance with Large Language Models

For a robot to personalize physical assistance effectively, it must lear...
research
05/21/2018

Planning to Give Information in Partially Observed Domains with a Learned Weighted Entropy Model

In many real-world robotic applications, an autonomous agent must act wi...
research
07/17/2023

SkiVis: Visual Exploration and Route Planning in Ski Resorts

Optimal ski route selection is a challenge based on a multitude of facto...

Please sign up or login with your details

Forgot password? Click here to reset