Demonstration-Guided Reinforcement Learning with Efficient Exploration for Task Automation of Surgical Robot

02/20/2023
by   Tao Huang, et al.
0

Task automation of surgical robot has the potentials to improve surgical efficiency. Recent reinforcement learning (RL) based approaches provide scalable solutions to surgical automation, but typically require extensive data collection to solve a task if no prior knowledge is given. This issue is known as the exploration challenge, which can be alleviated by providing expert demonstrations to an RL agent. Yet, how to make effective use of demonstration data to improve exploration efficiency still remains an open challenge. In this work, we introduce Demonstration-guided EXploration (DEX), an efficient reinforcement learning algorithm that aims to overcome the exploration problem with expert demonstrations for surgical automation. To effectively exploit demonstrations, our method estimates expert-like behaviors with higher values to facilitate productive interactions, and adopts non-parametric regression to enable such guidance at states unobserved in demonstration data. Extensive experiments on 10 surgical manipulation tasks from SurRoL, a comprehensive surgical simulation platform, demonstrate significant improvements in the exploration efficiency and task success rates of our method. Moreover, we also deploy the learned policies to the da Vinci Research Kit (dVRK) platform to show the effectiveness on the real robot. Code is available at https://github.com/med-air/DEX.

READ FULL TEXT

page 1

page 4

page 6

research
07/31/2023

Value-Informed Skill Chaining for Policy Learning of Long-Horizon Tasks with Surgical Robot

Reinforcement learning is still struggling with solving long-horizon sur...
research
10/15/2021

Toward Learning Context-Dependent Tasks from Demonstration for Tendon-Driven Surgical Robots

Tendon-driven robots, a type of continuum robot, have the potential to r...
research
01/01/2023

Human-in-the-loop Embodied Intelligence with Interactive Simulation Environment for Surgical Robot Learning

Surgical robot automation has attracted increasing research interest ove...
research
05/31/2019

Extending Deep Model Predictive Control with Safety Augmented Value Estimation from Demonstrations

Reinforcement learning (RL) for robotics is challenging due to the diffi...
research
10/13/2021

Safe Driving via Expert Guided Policy Optimization

When learning common skills like driving, beginners usually have domain ...
research
05/25/2023

Decomposing the Enigma: Subgoal-based Demonstration Learning for Formal Theorem Proving

Large language models (LLMs) present an intriguing avenue of exploration...
research
11/16/2019

Reinforcement Learning from Imperfect Demonstrations under Soft Expert Guidance

In this paper, we study Reinforcement Learning from Demonstrations (RLfD...

Please sign up or login with your details

Forgot password? Click here to reset