Block-based programming environments are increasingly used to introduce
...
We aim to understand how people assess human likeness in navigation prod...
Trust Region Policy Optimization (TRPO) is an iterative method that
simu...
Many contrastive and meta-learning approaches learn representations by
i...
Diffusion models have emerged as powerful generative models in the
text-...
Randomly masking and predicting word tokens has been a successful approa...
Randomly masking and predicting word tokens has been a successful approa...
We present a new monotonic improvement guarantee for optimizing decentra...
Proximal Policy Optimization (PPO) methods learn a policy by iteratively...
Sample efficiency is crucial for imitation learning methods to be applic...
A key challenge on the path to developing agents that learn complex
huma...
Although deep reinforcement learning has led to breakthroughs in many
di...
In order for agents trained by deep reinforcement learning to work along...
Agents that interact with other agents often do not know a priori what t...
Policy gradient methods have become one of the most popular classes of
a...
Game agents such as opponents, non-player characters, and teammates are
...
Variational inference (VI) plays an essential role in approximate Bayesi...
We address the problem of learning reusable state representations from
s...
Throughout scientific history, overarching theoretical frameworks have
a...
Game-playing Evolutionary Algorithms, specifically Rolling Horizon
Evolu...
The ability for policies to generalize to new environments is key to the...
Real-world congestion problems (e.g. traffic congestion) are typically v...
Learning in multi-agent scenarios is a fruitful research direction, but
...
In 2016 and 2017 at the IEEE Conference on Computational Intelligence in...
Esports has emerged as a popular genre for players as well as spectators...