Metareasoning in Modular Software Systems: On-the-Fly Configuration using Reinforcement Learning with Rich Contextual Representations

05/12/2019
by   Aditya Modi, et al.
17

Assemblies of modular subsystems are being pressed into service to perform sensing, reasoning, and decision making in high-stakes, time-critical tasks in such areas as transportation, healthcare, and industrial automation. We address the opportunity to maximize the utility of an overall computing system by employing reinforcement learning to guide the configuration of the set of interacting modules that comprise the system. The challenge of doing system-wide optimization is a combinatorial problem. Local attempts to boost the performance of a specific module by modifying its configuration often leads to losses in overall utility of the system's performance as the distribution of inputs to downstream modules changes drastically. We present metareasoning techniques which consider a rich representation of the input, monitor the state of the entire pipeline, and adjust the configuration of modules on-the-fly so as to maximize the utility of a system's operation. We show significant improvement in both real-world and synthetic pipelines across a variety of reinforcement learning techniques.

READ FULL TEXT

page 8

page 9

research
11/08/2018

Modular Architecture for StarCraft II with Deep Reinforcement Learning

We present a novel modular architecture for StarCraft II AI. The archite...
research
07/25/2022

Post-processing Networks: Method for Optimizing Pipeline Task-oriented Dialogue Systems using Reinforcement Learning

Many studies have proposed methods for optimizing the dialogue performan...
research
10/03/2020

Automated Performance Tuning for Highly-Configurable Software Systems

Performance is an important non-functional aspect of the software requir...
research
06/01/2023

Augmented Modular Reinforcement Learning based on Heterogeneous Knowledge

In order to mitigate some of the inefficiencies of Reinforcement Learnin...
research
08/05/2020

Optimizing AD Pruning of Sponsored Search with Reinforcement Learning

Industrial sponsored search system (SSS) can be logically divided into t...
research
05/08/2022

Introduction to Soar

This paper is the recommended initial reading for a functional overview ...
research
05/08/2020

Using Taint Analysis and Reinforcement Learning (TARL) to Repair Autonomous Robot Software

It is important to be able to establish formal performance bounds for au...

Please sign up or login with your details

Forgot password? Click here to reset