One of the most surprising puzzles in neural network generalisation is
g...
One of the gnarliest challenges in reinforcement learning (RL) is explor...
Recent work has shown that asking language models to generate reasoning ...
The field of AI alignment is concerned with AI systems that pursue unint...
Causal models of agents have been used to analyse the safety aspects of
...
Agents should avoid unsafe behaviour during both training and deployment...
Formal Methods for the Informal Engineer (FMIE) was a workshop held at t...
How can we design agents that pursue a given objective when all feedback...
This paper describes REALab, a platform for embedded agency research in
...
Proposals for safe AGI systems are typically made at the level of framew...
We implement a automated tactical prover TacticToe on top of the HOL4
in...
Many potentially non-terminating functions cannot be directly defined in...