Pathfinding in Random Partially Observable Environments with Vision-Informed Deep Reinforcement Learning

09/11/2022
by   Anthony Dowling, et al.
0

Deep reinforcement learning is a technique for solving problems in a variety of environments, ranging from Atari video games to stock trading. This method leverages deep neural network models to make decisions based on observations of a given environment with the goal of maximizing a reward function that can incorporate cost and rewards for reaching goals. With the aim of pathfinding, reward conditions can include reaching a specified target area along with costs for movement. In this work, multiple Deep Q-Network (DQN) agents are trained to operate in a partially observable environment with the goal of reaching a target zone in minimal travel time. The agent operates based on a visual representation of its surroundings, and thus has a restricted capability to observe the environment. A comparison between DQN, DQN-GRU, and DQN-LSTM is performed to examine each models capabilities with two different types of input. Through this evaluation, it is been shown that with equivalent training and analogous model architectures, a DQN model is able to outperform its recurrent counterparts.

READ FULL TEXT
research
09/18/2016

Playing FPS Games with Deep Reinforcement Learning

Advances in deep reinforcement learning have allowed autonomous agents t...
research
11/28/2019

Algorithmic Improvements for Deep Reinforcement Learning applied to Interactive Fiction

Text-based games are a natural challenge domain for deep reinforcement l...
research
12/08/2020

Emergence of Different Modes of Tool Use in a Reaching and Dragging Task

Tool use is an important milestone in the evolution of intelligence. In ...
research
07/22/2018

Implementation of Q Learning and Deep Q Network For Controlling a Self Balancing Robot Model

In this paper, the implementation of two Reinforcement learnings namely,...
research
03/25/2022

Dealing with Sparse Rewards Using Graph Neural Networks

Deep reinforcement learning in partially observable environments is a di...
research
12/06/2017

A Novel Model for Arbitration between Planning and Habitual Control Systems

It is well established that humans decision making and instrumental cont...
research
02/22/2017

Theoretical and Experimental Analysis of the Canadian Traveler Problem

Devising an optimal strategy for navigation in a partially observable en...

Please sign up or login with your details

Forgot password? Click here to reset