Unsupervised Reward Shaping for a Robotic Sequential Picking Task from Visual Observations in a Logistics Scenario

09/25/2022
by   Vittorio Giammarino, et al.
0

We focus on an unloading problem, typical of the logistics sector, modeled as a sequential pick-and-place task. In this type of task, modern machine learning techniques have shown to work better than classic systems since they are more adaptable to stochasticity and better able to cope with large uncertainties. More specifically, supervised and imitation learning have achieved outstanding results in this regard, with the shortcoming of requiring some form of supervision which is not always obtainable for all settings. On the other hand, reinforcement learning (RL) requires much milder form of supervision but still remains impracticable due to its inefficiency. In this paper, we propose and theoretically motivate a novel Unsupervised Reward Shaping algorithm from expert's observations which relaxes the level of supervision required by the agent and works on improving RL performance in our task.

READ FULL TEXT

page 3

page 5

research
03/26/2023

Inverse Reinforcement Learning without Reinforcement Learning

Inverse Reinforcement Learning (IRL) is a powerful set of techniques for...
research
11/01/2019

Positive-Unlabeled Reward Learning

Learning reward functions from data is a promising path towards achievin...
research
06/07/2022

Imitating Past Successes can be Very Suboptimal

Prior work has proposed a simple strategy for reinforcement learning (RL...
research
06/13/2019

Goal-conditioned Imitation Learning

Designing rewards for Reinforcement Learning (RL) is challenging because...
research
05/09/2022

A Comparative Tutorial of Bayesian Sequential Design and Reinforcement Learning

Reinforcement Learning (RL) is a computational approach to reward-driven...
research
03/05/2020

A Geometric Perspective on Visual Imitation Learning

We consider the problem of visual imitation learning without human super...
research
12/10/2020

Flatland-RL : Multi-Agent Reinforcement Learning on Trains

Efficient automated scheduling of trains remains a major challenge for m...

Please sign up or login with your details

Forgot password? Click here to reset