Low Precision Policy Distillation with Application to Low-Power, Real-time Sensation-Cognition-Action Loop with Neuromorphic Computing

09/25/2018
by   Jeffrey L. McKinstry, et al.
0

Low precision networks in the reinforcement learning (RL) setting are relatively unexplored because of the limitations of binary activations for function approximation. Here, in the discrete action ATARI domain, we demonstrate, for the first time, that low precision policy distillation from a high precision network provides a principled, practical way to train an RL agent. As an application, on 10 different ATARI games, we demonstrate real-time end-to-end game playing on low-power neuromorphic hardware by converting a sequence of game frames into discrete actions.

READ FULL TEXT
research
12/29/2019

Real-time Policy Distillation in Deep Reinforcement Learning

Policy distillation in deep reinforcement learning provides an effective...
research
02/26/2021

Low-Precision Reinforcement Learning

Low-precision training has become a popular approach to reduce computati...
research
12/23/2019

Discrete and Continuous Action Representation for Practical RL in Video Games

While most current research in Reinforcement Learning (RL) focuses on im...
research
10/22/2019

A low-power end-to-end hybrid neuromorphic framework for surveillance applications

With the success of deep learning, object recognition systems that can b...
research
11/29/2020

Hybrid Imitation Learning for Real-Time Service Restoration in Resilient Distribution Systems

Self-healing capability is one of the most critical factors for a resili...
research
11/03/2022

Synthesis of separation processes with reinforcement learning

This paper shows the implementation of reinforcement learning (RL) in co...
research
09/24/2021

A Graph Policy Network Approach for Volt-Var Control in Power Distribution Systems

Volt-var control (VVC) is the problem of operating power distribution sy...

Please sign up or login with your details

Forgot password? Click here to reset