-
Work in Progress: Temporally Extended Auxiliary Tasks
Predictive auxiliary tasks have been shown to improve performance in num...
read it
-
Gamma-Nets: Generalizing Value Estimation over Timescale
We present Γ-nets, a method for generalizing value function estimation o...
read it
-
Accelerating Learning in Constructive Predictive Frameworks with the Successor Representation
Here we propose using the successor representation (SR) to accelerate le...
read it
-
Directly Estimating the Variance of the λ-Return Using Temporal-Difference Methods
This paper investigates estimating the variance of a temporal-difference...
read it
-
Communicative Capital for Prosthetic Agents
This work presents an overarching perspective on the role that machine i...
read it
-
Introspective Agents: Confidence Measures for General Value Functions
Agents of general intelligence deployed in real-world scenarios must ada...
read it

Craig Sherstan
is this you? claim profile