Craig Sherstan

DeepAI

AI Chat AI Image Generator AI Video AI Music Voice Chat AI Photo Editor Math AI

Featured Co-authors

Peter Stone
126 publications
Martha White
65 publications
Richard S. Sutton
47 publications
Matthew E. Taylor
39 publications
Adam White
37 publications
Patrick M. Pilarski
34 publications
Kory W. Mathewson
22 publications
Marlos C. Machado
20 publications
Pablo Hernandez-Leal
13 publications
Kenny Young
10 publications
Bilal Kartal
10 publications

research

∙ 06/24/2022

Value Function Decomposition for Iterative Design of Reinforcement Learning Agents

Designing reinforcement learning (RL) agents is typically a difficult pr...

5 James MacGlashan, et al. ∙

research

∙ 04/01/2020

Work in Progress: Temporally Extended Auxiliary Tasks

Predictive auxiliary tasks have been shown to improve performance in num...

0 Craig Sherstan, et al. ∙

research

∙ 11/18/2019

Gamma-Nets: Generalizing Value Estimation over Timescale

We present Γ-nets, a method for generalizing value function estimation o...

31 Craig Sherstan, et al. ∙

research

∙ 03/23/2018

Accelerating Learning in Constructive Predictive Frameworks with the Successor Representation

Here we propose using the successor representation (SR) to accelerate le...

0 Craig Sherstan, et al. ∙

research

∙ 01/25/2018

Directly Estimating the Variance of the λ-Return Using Temporal-Difference Methods

This paper investigates estimating the variance of a temporal-difference...

0 Craig Sherstan, et al. ∙

research

∙ 11/10/2017

Communicative Capital for Prosthetic Agents

This work presents an overarching perspective on the role that machine i...

0 Patrick M. Pilarski, et al. ∙

research

∙ 06/17/2016

Introspective Agents: Confidence Measures for General Value Functions

Agents of general intelligence deployed in real-world scenarios must ada...

0 Craig Sherstan, et al. ∙

Craig Sherstan

Featured Co-authors

Value Function Decomposition for Iterative Design of Reinforcement Learning Agents

Work in Progress: Temporally Extended Auxiliary Tasks

Gamma-Nets: Generalizing Value Estimation over Timescale

Accelerating Learning in Constructive Predictive Frameworks with the Successor Representation

Directly Estimating the Variance of the λ-Return Using Temporal-Difference Methods

Communicative Capital for Prosthetic Agents

Introspective Agents: Confidence Measures for General Value Functions

Sign in with Google

Consider DeepAI Pro