Deep Reactive Planning in Dynamic Environments

by   Kei Ota, et al.

The main novelty of the proposed approach is that it allows a robot to learn an end-to-end policy which can adapt to changes in the environment during execution. While goal conditioning of policies has been studied in the RL literature, such approaches are not easily extended to cases where the robot's goal can change during execution. This is something that humans are naturally able to do. However, it is difficult for robots to learn such reflexes (i.e., to naturally respond to dynamic environments), especially when the goal location is not explicitly provided to the robot, and instead needs to be perceived through a vision sensor. In the current work, we present a method that can achieve such behavior by combining traditional kinematic planning, deep learning, and deep reinforcement learning in a synergistic fashion to generalize to arbitrary environments. We demonstrate the proposed approach for several reaching and pick-and-place tasks in simulation, as well as on a real system of a 6-DoF industrial manipulator. A video describing our work could be found <>.


page 1

page 3

page 6


Affordance Learning from Play for Sample-Efficient Policy Learning

Robots operating in human-centered environments should have the ability ...

Human-Following and -guiding in Crowded Environments using Semantic Deep-Reinforcement-Learning for Mobile Service Robots

Assistance robots have gained widespread attention in various industries...

Learning Navigation Behaviors End to End

A longstanding goal of behavior-based robotics is to solve high-level na...

Towards continuous control of flippers for a multi-terrain robot using deep reinforcement learning

In this paper we focus on developing a control algorithm for multi-terra...

Map-based Multi-Policy Reinforcement Learning: Enhancing Adaptability of Robots by Deep Reinforcement Learning

In order for robots to perform mission-critical tasks, it is essential t...

Vision-based deep execution monitoring

Execution monitor of high-level robot actions can be effectively improve...

LBGP: Learning Based Goal Planning for Autonomous Following in Front

This paper investigates a hybrid solution which combines deep reinforcem...