Reinforcement learning from human feedback (RLHF) is a technique for tra...
To optimally coordinate with others in cooperative games, it is often cr...
As social media continues to have a significant influence on public opin...
Manipulation is a common concern in many domains, such as social media,
...
Research in Fairness, Accountability, Transparency, and Ethics (FATE) ha...
One of the most successful paradigms for reward learning uses human feed...
Randomly masking and predicting word tokens has been a successful approa...
AI agents designed to collaborate with people benefit from models that e...
Randomly masking and predicting word tokens has been a successful approa...
The content that a recommender system (RS) shows to users influences the...
In order for agents trained by deep reinforcement learning to work along...
While we would like agents that can coordinate with humans, current
algo...