Tournament selection in zeroth-level classifier systems based on average reward reinforcement learning

04/26/2016
by   Zhaoxiang Zang, et al.
0

As a genetics-based machine learning technique, zeroth-level classifier system (ZCS) is based on a discounted reward reinforcement learning algorithm, bucket-brigade algorithm, which optimizes the discounted total reward received by an agent but is not suitable for all multi-step problems, especially large-size ones. There are some undiscounted reinforcement learning methods available, such as R-learning, which optimize the average reward per time step. In this paper, R-learning is used as the reinforcement learning employed by ZCS, to replace its discounted reward reinforcement learning approach, and tournament selection is used to replace roulette wheel selection in ZCS. The modification results in classifier systems that can support long action chains, and thus is able to solve large multi-step problems.

READ FULL TEXT
research
02/27/2023

Reinforcement Learning with Depreciating Assets

A basic assumption of traditional reinforcement learning is that the val...
research
11/24/2020

Learning Principle of Least Action with Reinforcement Learning

Nature provides a way to understand physics with reinforcement learning ...
research
02/25/2020

G-Learner and GIRL: Goal Based Wealth Management with Reinforcement Learning

We present a reinforcement learning approach to goal based wealth manage...
research
06/17/2019

LPaintB: Learning to Paint from Self-SupervisionLPaintB: Learning to Paint from Self-Supervision

We present a novel reinforcement learning-based natural media painting a...
research
03/13/2017

Reinforcement Learning for Transition-Based Mention Detection

This paper describes an application of reinforcement learning to the men...
research
09/25/2019

"Good Robot!": Efficient Reinforcement Learning for Multi-Step Visual Tasks via Reward Shaping

In order to learn effectively, robots must be able to extract the intang...
research
06/01/2022

RLSS: A Deep Reinforcement Learning Algorithm for Sequential Scene Generation

We present RLSS: a reinforcement learning algorithm for sequential scene...

Please sign up or login with your details

Forgot password? Click here to reset