b'Micah Carroll'

research

∙ 07/27/2023

Open Problems and Fundamental Limitations of Reinforcement Learning from Human Feedback

Reinforcement learning from human feedback (RLHF) is a technique for tra...

0 Stephen Casper, et al. ∙

research

∙ 06/15/2023

Who Needs to Know? Minimal Knowledge for Optimal Coordination

To optimally coordinate with others in cooperative games, it is often cr...

0 Niklas Lauffer, et al. ∙

research

∙ 05/26/2023

Twitter's Algorithm: Amplifying Anger, Animosity, and Affective Polarization

As social media continues to have a significant influence on public opin...

0 Smitha Milli, et al. ∙

research

∙ 03/16/2023

Characterizing Manipulation from AI Systems

Manipulation is a common concern in many domains, such as social media, ...

0 Micah Carroll, et al. ∙

research

∙ 02/20/2023

Harms from Increasingly Agentic Algorithmic Systems

Research in Fairness, Accountability, Transparency, and Ethics (FATE) ha...

0 Alan Chan, et al. ∙

research

∙ 11/30/2022

Time-Efficient Reward Learning via Visually Assisted Cluster Ranking

One of the most successful paradigms for reward learning uses human feed...

0 David Zhang, et al. ∙

research

∙ 11/20/2022

UniMASK: Unified Inference in Sequential Decision Problems

Randomly masking and predicting word tokens has been a successful approa...

0 Micah Carroll, et al. ∙

research

∙ 11/03/2022

Optimal Behavior Prior: Data-Efficient Human Models for Improved Human-AI Collaboration

AI agents designed to collaborate with people benefit from models that e...

0 Mesut Yang, et al. ∙

research

∙ 04/28/2022

Towards Flexible Inference in Sequential Decision Problems via Bidirectional Transformers

Randomly masking and predicting word tokens has been a successful approa...

2 Micah Carroll, et al. ∙

research

∙ 04/25/2022

Estimating and Penalizing Induced Preference Shifts in Recommender Systems

The content that a recommender system (RS) shows to users influences the...

4 Micah Carroll, et al. ∙

research

∙ 01/14/2021

Evaluating the Robustness of Collaborative Agents

In order for agents trained by deep reinforcement learning to work along...

6 Paul Knott, et al. ∙

research

∙ 10/13/2019

On the Utility of Learning about Humans for Human-AI Coordination

While we would like agents that can coordinate with humans, current algo...

8 Micah Carroll, et al. ∙

Micah Carroll

Featured Co-authors

Sign in with Google

Consider DeepAI Pro