Model-based actor-critic: GAN + DRL (actor-critic) => AGI

04/04/2020
by   Aras Dargazany, et al.
0

Our effort is toward unifying GAN and DRL algorithms into a unifying AI model (AGI or general-purpose AI or artificial general intelligence which has general-purpose applications to: (A) offline learning (of stored data) like GAN in (un/semi-/fully-)SL setting such as big data analytics (mining) and visualization; (B) online learning (of real or simulated devices) like DRL in RL setting (with/out environment reward) such as (real or simulated) robotics and control; Our core proposal is adding an (generative/predictive) environment model to the actor-critic (model-free) architecture which results in a model-based actor-critic architecture with temporal-differencing (TD) error and an episodic memory. The proposed AI model is similar to (model-free) DDPG and therefore it's called model-based DDPG. To evaluate it, we compare it with (model-free) DDPG by applying them both to a variety (wide range) of independent simulated robotic and control task environments in OpenAI Gym and Unity Agents. Our initial limited experiments show that DRL and GAN in model-based actor-critic results in an incremental goal-driven intellignce required to solve each task with similar performance to (model-free) DDPG. Our future focus is to investigate the proposed AI model potential to: (A) unify DRL field inside AI by producing competitive performance compared to the best of model-based (PlaNet) and model-free (D4PG) approaches; (B) bridge the gap between AI and robotics communities by solving the important problem of reward engineering with learning the reward function by demonstration;

READ FULL TEXT

page 6

page 7

page 8

page 9

page 10

page 11

research
10/04/2020

FORK: A Forward-Looking Actor For Model-Free Reinforcement Learning

In this paper, we propose a new type of Actor, named forward-looking Act...
research
01/22/2022

Actor-Critic-Based Learning for Zero-touch Joint Resource and Energy Control in Network Slicing

To harness the full potential of beyond 5G (B5G) communication systems, ...
research
04/28/2020

Sample-Efficient Model-based Actor-Critic for an Interactive Dialogue Task

Human-computer interactive systems that rely on machine learning are bec...
research
01/22/2022

A Collaborative Statistical Actor-Critic Learning Approach for 6G Network Slicing Control

Artificial intelligence (AI)-driven zero-touch massive network slicing i...
research
06/04/2021

A Learning-based Optimal Market Bidding Strategy for Price-Maker Energy Storage

Load serving entities with storage units reach sizes and performances th...
research
06/28/2017

An Actor-Critic Contextual Bandit Algorithm for Personalized Mobile Health Interventions

Increasing technological sophistication and widespread use of smartphone...
research
06/06/2018

Model-free, Model-based, and General Intelligence

During the 60s and 70s, AI researchers explored intuitions about intelli...

Please sign up or login with your details

Forgot password? Click here to reset