Transfer Learning for Efficient Iterative Safety Validation

12/09/2020
by   Anthony Corso, et al.
0

Safety validation is important during the development of safety-critical autonomous systems but can require significant computational effort. Existing algorithms often start from scratch each time the system under test changes. We apply transfer learning to improve the efficiency of reinforcement learning based safety validation algorithms when applied to related systems. Knowledge from previous safety validation tasks is encoded through the action value function and transferred to future tasks with a learned set of attention weights. Including a learned state and action value transformation for each source task can improve performance even when systems have substantially different failure modes. We conduct experiments on safety validation tasks in gridworld and autonomous driving scenarios. We show that transfer learning can improve the initial and final performance of validation algorithms and reduce the number of training steps.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
05/06/2020

A Survey of Algorithms for Black-Box Safety Validation

Autonomous and semi-autonomous systems for safety-critical applications ...
research
02/07/2023

Adaptive Aggregation for Safety-Critical Control

Safety has been recognized as the central obstacle to preventing the use...
research
10/29/2022

Self-Improving Safety Performance of Reinforcement Learning Based Driving with Black-Box Verification Algorithms

In this work, we propose a self-improving artificial intelligence system...
research
04/24/2020

Explicit Domain Adaptation with Loosely Coupled Samples

Transfer learning is an important field of machine learning in general, ...
research
01/22/2019

On the validation of complex systems operating in open contexts

In the recent years, there has been a rush towards highly autonomous sys...
research
05/19/2023

Tune-Mode ConvBN Blocks For Efficient Transfer Learning

Convolution-BatchNorm (ConvBN) blocks are integral components in various...

Please sign up or login with your details

Forgot password? Click here to reset