Criticality-Based Varying Step-Number Algorithm for Reinforcement Learning

01/13/2022
by   Yitzhak Spielberg, et al.
13

In the context of reinforcement learning we introduce the concept of criticality of a state, which indicates the extent to which the choice of action in that particular state influences the expected return. That is, a state in which the choice of action is more likely to influence the final outcome is considered as more critical than a state in which it is less likely to influence the final outcome. We formulate a criticality-based varying step number algorithm (CVS) - a flexible step number algorithm that utilizes the criticality function provided by a human, or learned directly from the environment. We test it in three different domains including the Atari Pong environment, Road-Tree environment, and Shooter environment. We demonstrate that CVS is able to outperform popular learning algorithms such as Deep Q-Learning and Monte Carlo.

READ FULL TEXT

page 9

page 13

page 14

research
10/16/2018

The Concept of Criticality in Reinforcement Learning

Reinforcement learning methods carry a well known bias-variance trade-of...
research
01/22/2019

Understanding Multi-Step Deep Reinforcement Learning: A Systematic Study of the DQN Target

Multi-step methods such as Retrace(λ) and n-step Q-learning have become ...
research
12/08/2020

Combining Reinforcement Learning with Lin-Kernighan-Helsgaun Algorithm for the Traveling Salesman Problem

We address the Traveling Salesman Problem (TSP), a famous NP-hard combin...
research
11/16/2022

Addressing the issue of stochastic environments and local decision-making in multi-objective reinforcement learning

Multi-objective reinforcement learning (MORL) is a relatively new field ...
research
10/07/2022

Advice Conformance Verification by Reinforcement Learning agents for Human-in-the-Loop

Human-in-the-loop (HiL) reinforcement learning is gaining traction in do...
research
08/09/2023

Variations on the Reinforcement Learning performance of Blackjack

Blackjack or "21" is a popular card-based game of chance and skill. The ...
research
08/01/2021

A survey of Monte Carlo methods for noisy and costly densities with application to reinforcement learning

This survey gives an overview of Monte Carlo methodologies using surroga...

Please sign up or login with your details

Forgot password? Click here to reset