Data-Driven Evaluation of Training Action Space for Reinforcement Learning

04/08/2022
by   Rajat Ghosh, et al.
0

Training action space selection for reinforcement learning (RL) is conflict-prone due to complex state-action relationships. To address this challenge, this paper proposes a Shapley-inspired methodology for training action space categorization and ranking. To reduce exponential-time shapley computations, the methodology includes a Monte Carlo simulation to avoid unnecessary explorations. The effectiveness of the methodology is illustrated using a cloud infrastructure resource tuning case study. It reduces the search space by 80% and categorizes the training action sets into dispensable and indispensable groups. Additionally, it ranks different training actions to facilitate high-performance yet cost-efficient RL model design. The proposed data-driven methodology is extensible to different domains, use cases, and reinforcement learning algorithms.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
11/20/2022

Safe Reinforcement Learning using Data-Driven Predictive Control

Reinforcement learning (RL) algorithms can achieve state-of-the-art perf...
research
04/26/2018

Action Categorization for Computationally Improved Task Learning and Planning

This paper explores the problem of task learning and planning, contribut...
research
03/13/2019

Task-oriented Design through Deep Reinforcement Learning

We propose a new low-cost machine-learning-based methodology which assis...
research
04/02/2020

Action Space Shaping in Deep Reinforcement Learning

Reinforcement learning (RL) has been successful in training agents in va...
research
05/11/2022

Characterizing the Action-Generalization Gap in Deep Q-Learning

We study the action generalization ability of deep Q-learning in discret...
research
05/11/2023

Towards Theoretical Understanding of Data-Driven Policy Refinement

This paper presents an approach for data-driven policy refinement in rei...
research
08/24/2022

Efficient Data-Driven Network Functions

Cloud environments require dynamic and adaptive networking policies. It ...

Please sign up or login with your details

Forgot password? Click here to reset