Spatial-temporal recurrent reinforcement learning for autonomous ships

11/02/2022
by   Martin Waltz, et al.
0

The paper proposes a spatial-temporal recurrent neural network architecture for Deep Q-Networks to steer an autonomous ship. The network design allows handling an arbitrary number of surrounding target ships while offering robustness to partial observability. Further, a state-of-the-art collision risk metric is proposed to enable an easier assessment of different situations by the agent. The COLREG rules of maritime traffic are explicitly considered in the design of the reward function. The final policy is validated on a custom set of newly created single-ship encounters called "Around the Clock" problems and the commonly chosen Imazu (1987) problems, which include 18 multi-ship scenarios. Additionally, the framework shows robustness when deployed simultaneously in multi-agent scenarios. The proposed network architecture is compatible with other deep reinforcement learning algorithms, including actor-critic frameworks.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
10/05/2022

Spatial-Temporal-Aware Safe Multi-Agent Reinforcement Learning of Connected Autonomous Vehicles in Challenging Scenarios

Communication technologies enable coordination among connected and auton...
research
12/21/2018

Introducing Neuromodulation in Deep Neural Networks to Learn Adaptive Behaviours

In this paper, we propose a new deep neural network architecture, called...
research
04/14/2021

Decomposed Soft Actor-Critic Method for Cooperative Multi-Agent Reinforcement Learning

Deep reinforcement learning methods have shown great performance on many...
research
05/02/2021

Reducing Bus Bunching with Asynchronous Multi-Agent Reinforcement Learning

The bus system is a critical component of sustainable urban transportati...
research
11/29/2019

Distributed Soft Actor-Critic with Multivariate Reward Representation and Knowledge Distillation

In this paper, we describe NeurIPS 2019 Learning to Move - Walk Around c...
research
05/18/2017

Agent-Centric Risk Assessment: Accident Anticipation and Risky Region Localization

For survival, a living agent must have the ability to assess risk (1) by...

Please sign up or login with your details

Forgot password? Click here to reset