Improved Reinforcement Learning Coordinated Control of a Mobile Manipulator using Joint Clamping

10/05/2021
by   Denis Hadjivelichkov, et al.
0

Many robotic path planning problems are continuous, stochastic, and high-dimensional. The ability of a mobile manipulator to coordinate its base and manipulator in order to control its whole-body online is particularly challenging when self and environment collision avoidance is required. Reinforcement Learning techniques have the potential to solve such problems through their ability to generalise over environments. We study joint penalties and joint limits of a state-of-the-art mobile manipulator whole-body controller that uses LIDAR sensing for obstacle collision avoidance. We propose directions to improve the reinforcement learning method. Our agent achieves significantly higher success rates than the baseline in a goal-reaching environment and it can solve environments that require coordinated whole-body control which the baseline fails.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
02/25/2020

Whole-Body Control of a Mobile Manipulator using End-to-End Reinforcement Learning

Mobile manipulation is usually achieved by sequentially executing base a...
research
08/17/2020

Model-Reference Reinforcement Learning for Collision-Free Tracking Control of Autonomous Surface Vehicles

This paper presents a novel model-reference reinforcement learning algor...
research
09/23/2022

Safe Real-World Reinforcement Learning for Mobile Agent Obstacle Avoidance

Collision avoidance is key for mobile robots and agents to operate safel...
research
11/12/2018

Navigating Assistance System for Quadcopter with Deep Reinforcement Learning

In this paper, we present a deep reinforcement learning method for quadc...
research
07/16/2020

Collision Avoidance Robotics Via Meta-Learning (CARML)

This paper presents an approach to exploring a multi-objective reinforce...
research
03/17/2023

An Adaptive Fuzzy Reinforcement Learning Cooperative Approach for the Autonomous Control of Flock Systems

The flock-guidance problem enjoys a challenging structure where multiple...
research
11/22/2019

Fleet Control using Coregionalized Gaussian Process Policy Iteration

In many settings, as for example wind farms, multiple machines are insta...

Please sign up or login with your details

Forgot password? Click here to reset