Cascade Attribute Network: Decomposing Reinforcement Learning Control Policies using Hierarchical Neural Networks

05/07/2020
by   Haonan Chang, et al.
0

Reinforcement learning methods have been developed to achieve great success in training control policies in various automation tasks. However, a main challenge of the wider application of reinforcement learning in practical automation is that the training process is hard and the pretrained policy networks are hardly reusable in other similar cases. To address this problem, we propose the cascade attribute network (CAN), which utilizes its hierarchical structure to decompose a complicated control policy in terms of the requirement constraints, which we call attributes, encoded in the control tasks. We validated the effectiveness of our proposed method on two robot control scenarios with various add-on attributes. For some control tasks with more than one add-on attribute attribute, by directly assembling the attribute modules in cascade, the CAN can provide ideal control policies in a zero-shot manner.

READ FULL TEXT

page 3

page 5

research
11/24/2017

Cascade Attribute Learning Network

We propose the cascade attribute learning network (CALNet), which can le...
research
04/05/2022

Configuration Path Control

Reinforcement learning methods often produce brittle policies – policies...
research
02/26/2020

Policy Evaluation Networks

Many reinforcement learning algorithms use value functions to guide the ...
research
04/13/2021

Reinforcement learning for Admission Control in 5G Wireless Networks

The key challenge in admission control in wireless networks is to strike...
research
09/15/2023

How Transferable are Attribute Controllers on Pretrained Multilingual Translation Models?

Customizing machine translation models to comply with fine-grained attri...
research
03/02/2023

Co-learning Planning and Control Policies Using Differentiable Formal Task Constraints

This paper presents a hierarchical reinforcement learning algorithm cons...
research
08/30/2021

Trustworthy AI for Process Automation on a Chylla-Haase Polymerization Reactor

In this paper, genetic programming reinforcement learning (GPRL) is util...

Please sign up or login with your details

Forgot password? Click here to reset