Machine Teaching in Hierarchical Genetic Reinforcement Learning: Curriculum Design of Reward Functions for Swarm Shepherding

01/04/2019
by   Nicholas R. Clayton, et al.
0

The design of reward functions in reinforcement learning is a human skill that comes with experience. Unfortunately, there is not any methodology in the literature that could guide a human to design the reward function or to allow a human to transfer the skills developed in designing reward functions to another human and in a systematic manner. In this paper, we use Systematic Instructional Design, an approach in human education, to engineer a machine education methodology to design reward functions for reinforcement learning. We demonstrate the methodology in designing a hierarchical genetic reinforcement learner that adopts a neural network representation to evolve a swarm controller for an agent shepherding a boids-based swarm. The results reveal that the methodology is able to guide the design of hierarchical reinforcement learners, with each model in the hierarchy learning incrementally through a multi-part reward function. The hierarchy acts as a decision fusion function that combines the individual behaviours and skills learnt by each instruction to create a smart shepherd to control the swarm.

READ FULL TEXT
research
04/24/2020

Continuous Deep Hierarchical Reinforcement Learning for Ground-Air Swarm Shepherding

The control and guidance of multi-robots (swarm) is a non-trivial proble...
research
02/16/2018

Diversity is All You Need: Learning Skills without a Reward Function

Intelligent creatures can explore their environments and learn useful sk...
research
06/30/2023

Design of Induction Machines using Reinforcement Learning

The design of induction machine is a challenging task due to different e...
research
11/19/2018

Scalable agent alignment via reward modeling: a research direction

One obstacle to applying reinforcement learning algorithms to real-world...
research
01/04/2019

Transparent Machine Education of Neural Networks for Swarm Shepherding Using Curriculum Design

Swarm control is a difficult problem due to the need to guide a large nu...
research
07/15/2019

Mutual Reinforcement Learning

Recently, collaborative robots have begun to train humans to achieve com...
research
02/07/2020

Machine Education: Designing semantically ordered and ontologically guided modular neural networks

The literature on machine teaching, machine education, and curriculum de...

Please sign up or login with your details

Forgot password? Click here to reset