FormulaZero: Distributionally Robust Online Adaptation via Offline Population Synthesis

03/09/2020
by   Aman Sinha, et al.
4

Balancing performance and safety is crucial to deploying autonomous vehicles in multi-agent environments. In particular, autonomous racing is a domain that penalizes safe but conservative policies, highlighting the need for robust, adaptive strategies. Current approaches either make simplifying assumptions about other agents or lack robust mechanisms for online adaptation. This work makes algorithmic contributions to both challenges. First, to generate a realistic, diverse set of opponents, we develop a novel method for self-play based on replica-exchange Markov chain Monte Carlo. Second, we propose a distributionally robust bandit optimization procedure that adaptively adjusts risk aversion relative to uncertainty in beliefs about opponents' behaviors. We rigorously quantify the tradeoffs in performance and robustness when approximating these computations in real-time motion-planning, and we demonstrate our methods experimentally on autonomous vehicles that achieve scaled speeds comparable to Formula One racecars.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
04/04/2019

Online Risk-Bounded Motion Planning for Autonomous Vehicles in Dynamic Environments

A crucial challenge to efficient and robust motion planning for autonomo...
research
09/16/2022

Game-theoretic Objective Space Planning

Autonomous Racing awards agents that react to opponents' behaviors with ...
research
09/18/2023

Multi-Agent Deep Reinforcement Learning for Cooperative and Competitive Autonomous Vehicles using AutoDRIVE Ecosystem

This work presents a modular and parallelizable multi-agent deep reinfor...
research
01/28/2020

Towards Learning Multi-agent Negotiations via Self-Play

Making sophisticated, robust, and safe sequential decisions is at the he...
research
11/11/2019

UW-MARL: Multi-Agent Reinforcement Learning for Underwater Adaptive Sampling using Autonomous Vehicles

Near-real-time water-quality monitoring in uncertain environments such a...
research
06/26/2020

Can Autonomous Vehicles Identify, Recover From, and Adapt to Distribution Shifts?

Out-of-training-distribution (OOD) scenarios are a common challenge of l...
research
12/16/2022

An Energy-aware, Fault-tolerant, and Robust Deep Reinforcement Learning based approach for Multi-agent Patrolling Problems

Autonomous vehicles are suited for continuous area patrolling problems. ...

Please sign up or login with your details

Forgot password? Click here to reset