Jacob Pfau | DeepAI

Chat Image Generator Video Music Voice Chat Photo Editor

Featured Co-authors

Xin Chen
162 publications
Dorsa Sadigh
102 publications
Dylan Hadfield-Menell
38 publications
Anca Dragan
33 publications
David Krueger
31 publications
Erdem Bıyık
27 publications
Tomasz Korbak
18 publications
Peter Hase
17 publications
Thomas Krendl Gilbert
15 publications
David Lindner
15 publications
Micah Carroll
12 publications

research

∙ 07/27/2023

Open Problems and Fundamental Limitations of Reinforcement Learning from Human Feedback

Reinforcement learning from human feedback (RLHF) is a technique for tra...

0 Stephen Casper, et al. ∙

research

∙ 05/28/2021

Objective Robustness in Deep Reinforcement Learning

We study objective robustness failures, a type of out-of-distribution ro...

0 Jack Koch, et al. ∙

research

∙ 04/06/2021

Robust Semantic Interpretability: Revisiting Concept Activation Vectors

Interpretability methods for image classification assess model trustwort...

0 Jacob Pfau, et al. ∙

research

∙ 10/16/2019

Global Saliency: Aggregating Saliency Maps to Assess Dataset Artefact Bias

In high-stakes applications of machine learning models, interpretability...

0 Jacob Pfau, et al. ∙

Success!

An error occurred