i-Sim2Real: Reinforcement Learning of Robotic Policies in Tight Human-Robot Interaction Loops

by   Saminda Abeyruwan, et al.

Sim-to-real transfer is a powerful paradigm for robotic reinforcement learning. The ability to train policies in simulation enables safe exploration and large-scale data collection quickly at low cost. However, prior works in sim-to-real transfer of robotic policies typically do not involve any human-robot interaction because accurately simulating human behavior is an open problem. In this work, our goal is to leverage the power of simulation to train robotic policies that are proficient at interacting with humans upon deployment. But there is a chicken and egg problem – how do we gather examples of a human interacting with a physical robot so as to model human behavior in simulation without already having a robot that is able to interact with a human? Our proposed method, Iterative-Sim-to-Real (i-S2R), attempts to address this. i-S2R bootstraps from a simple model of human behavior and alternates between training in simulation and deploying in the real world. In each iteration, both the human behavior model and the policy are refined. We evaluate our method on a real world robotic table tennis setting, where the objective for the robot is to play cooperatively with a human player for as long as possible. Table tennis is a high-speed, dynamic task that requires the two players to react quickly to each other's moves, making a challenging test bed for research on human-robot interaction. We present results on an industrial robotic arm that is able to cooperatively play table tennis with human players, achieving rallies of 22 successive hits on average and 150 at best. Further, for 80 compared to the sim-to-real (S2R) baseline. For videos of our system in action, please see https://sites.google.com/view/is2r.


page 2

page 22

page 23

page 24

page 25

page 26

page 28

page 29


Human-Robot Collaboration via Deep Reinforcement Learning of Real-World Interactions

We present a robotic setup for real-world testing and evaluation of huma...

Robotic Table Tennis: A Case Study into a High Speed Learning System

We present a deep-dive into a real-world robotic learning system that, i...

RCareWorld: A Human-centric Simulation World for Caregiving Robots

We present RCareWorld, a human-centric simulation world for physical and...

GoalsEye: Learning High Speed Precision Table Tennis on a Physical Robot

Learning goal conditioned control in the real world is a challenging ope...

Training Robots to Evaluate Robots: Example-Based Interactive Reward Functions for Policy Learning

Physical interactions can often help reveal information that is not read...

Real-World Human-Robot Collaborative Reinforcement Learning

The intuitive collaboration of humans and intelligent robots (embodied A...

One Policy to Dress Them All: Learning to Dress People with Diverse Poses and Garments

Robot-assisted dressing could benefit the lives of many people such as o...

Please sign up or login with your details

Forgot password? Click here to reset