Understanding reinforcement learned crowds

09/19/2022
by   Ariel Kwiatkowski, et al.
38

Simulating trajectories of virtual crowds is a commonly encountered task in Computer Graphics. Several recent works have applied Reinforcement Learning methods to animate virtual agents, however they often make different design choices when it comes to the fundamental simulation setup. Each of these choices comes with a reasonable justification for its use, so it is not obvious what is their real impact, and how they affect the results. In this work, we analyze some of these arbitrary choices in terms of their impact on the learning performance, as well as the quality of the resulting simulation measured in terms of the energy efficiency. We perform a theoretical analysis of the properties of the reward function design, and empirically evaluate the impact of using certain observation and action spaces on a variety of scenarios, with the reward function and energy usage as metrics. We show that directly using the neighboring agents' information as observation generally outperforms the more widely used raycasting. Similarly, using nonholonomic controls with egocentric observations tends to produce more efficient behaviors than holonomic controls with absolute observations. Each of these choices has a significant, and potentially nontrivial impact on the results, and so researchers should be mindful about choosing and reporting them in their work.

READ FULL TEXT

page 13

page 14

page 17

page 19

page 20

page 22

page 23

page 24

research
06/02/2018

Internal Model from Observations for Reward Shaping

Reinforcement learning methods require careful design involving a reward...
research
11/01/2021

On the Expressivity of Markov Reward

Reward is the driving force for reinforcement-learning agents. This pape...
research
12/14/2021

Programmatic Reward Design by Example

Reward design is a fundamental problem in reinforcement learning (RL). A...
research
10/10/2022

Optimal wireless rate and power control in the presence of jammers using reinforcement learning

Future wireless networks require high throughput and energy efficiency. ...
research
10/08/2020

Information-Driven Adaptive Sensing Based on Deep Reinforcement Learning

In order to make better use of deep reinforcement learning in the creati...
research
11/02/2020

Observation Space Matters: Benchmark and Optimization Algorithm

Recent advances in deep reinforcement learning (deep RL) enable research...

Please sign up or login with your details

Forgot password? Click here to reset