Robot Playing Kendama with Model-Based and Model-Free Reinforcement Learning

03/15/2020
by   Shidi Li, et al.
0

Several model-based and model-free methods have been proposed for the robot trajectory learning task. Both approaches have their benefits and drawbacks. They can usually complement each other. Many research works are trying to integrate some model-based and model-free methods into one algorithm and perform well in simulators or quasi-static robot tasks. Difficulties still exist when algorithms are used in particular trajectory learning tasks. In this paper, we propose a robot trajectory learning framework for precise tasks with discontinuous dynamics and high speed. The trajectories learned from the human demonstration are optimized by DDP and PoWER successively. The framework is tested on the Kendama manipulation task, which can also be difficult for humans to achieve. The results show that our approach can plan the trajectories to successfully complete the task.

READ FULL TEXT

page 1

page 7

research
12/08/2019

Value-of-Information based Arbitration between Model-based and Model-free Control

There have been numerous attempts in explaining the general learning beh...
research
02/28/2023

Model-Free and Learning-Free Proprioceptive Humanoid Movement Control

This paper presents a novel model-free method for humanoid-robot quasi-s...
research
07/05/2018

Optimizing Execution of Dynamic Goal-Directed Robot Movements with Learning Control

Highly dynamic tasks that require large accelerations and precise tracki...
research
11/03/2020

Goal recognition via model-based and model-free techniques

Goal recognition aims at predicting human intentions from a trace of obs...
research
07/14/2021

Model-free Reinforcement Learning for Robust Locomotion Using Trajectory Optimization for Exploration

In this work we present a general, two-stage reinforcement learning appr...
research
07/08/2019

Data Efficient Reinforcement Learning for Legged Robots

We present a model-based framework for robot locomotion that achieves wa...
research
02/25/2022

Behaviorally Grounded Model-Based and Model Free Cost Reduction in a Simulated Multi-Echelon Supply Chain

Amplification and phase shift in ordering signals, commonly referred to ...

Please sign up or login with your details

Forgot password? Click here to reset