Chris Nota

Chat Image Generator Video Music Voice Chat Photo Editor

Featured Co-authors

Philip S. Thomas
36 publications
Yash Chandak
21 publications
Georgios Theocharous
17 publications
Scott M. Jordan
6 publications
James Kostas
4 publications
Francisco M. Garcia
3 publications

research

∙ 12/28/2022

On the Convergence of Discounted Policy Gradient Methods

Many popular policy gradient methods for reinforcement learning follow a...

0 Chris Nota, et al. ∙

research

∙ 01/06/2020

Learning Reusable Options for Multi-Task Reinforcement Learning

Reinforcement learning (RL) has become an increasingly active area of re...

27 Francisco M. Garcia, et al. ∙

research

∙ 06/17/2019

Is the Policy Gradient a Gradient?

The policy gradient theorem describes the gradient of the expected disco...

0 Chris Nota, et al. ∙

research

∙ 06/06/2019

Classical Policy Gradient: Preserving Bellman's Principle of Optimality

We propose a new objective function for finite-horizon episodic Markov d...

0 Philip S. Thomas, et al. ∙

research

∙ 06/05/2019

Lifelong Learning with a Changing Action Set

In many real-world sequential decision making problems, the number of av...

0 Yash Chandak, et al. ∙

research

∙ 02/15/2019

Asynchronous Coagent Networks: Stochastic Networks for Reinforcement Learning without Backpropagation or a Clock

In this paper we introduce a reinforcement learning (RL) approach for tr...

0 James Kostas, et al. ∙

research

∙ 02/15/2019

Reinforcement Learning Without Backpropagation or a Clock

In this paper we introduce a reinforcement learning (RL) approach for tr...

0 James Kostas, et al. ∙

Success!

An error occurred

Chris Nota

Featured Co-authors

On the Convergence of Discounted Policy Gradient Methods

Learning Reusable Options for Multi-Task Reinforcement Learning

Is the Policy Gradient a Gradient?

Classical Policy Gradient: Preserving Bellman's Principle of Optimality

Lifelong Learning with a Changing Action Set

Asynchronous Coagent Networks: Stochastic Networks for Reinforcement Learning without Backpropagation or a Clock

Reinforcement Learning Without Backpropagation or a Clock

Sign in with Google

Consider DeepAI Pro