-
Avoiding Side Effects in Complex Environments
Reward function specification can be difficult, even in simple environme...
read it
-
Optimal Farsighted Agents Tend to Seek Power
Some researchers have speculated that capable reinforcement learning (RL...
read it
-
Conservative Agency via Attainable Utility Preservation
Reward functions are often misspecified. An agent optimizing an incorrec...
read it

Alexander Matt Turner
is this you? claim profile