Deep Q-Learning versus Proximal Policy Optimization: Performance Comparison in a Material Sorting Task

06/02/2023
by   Reuf Kozlica, et al.
0

This paper presents a comparison between two well-known deep Reinforcement Learning (RL) algorithms: Deep Q-Learning (DQN) and Proximal Policy Optimization (PPO) in a simulated production system. We utilize a Petri Net (PN)-based simulation environment, which was previously proposed in related work. The performance of the two algorithms is compared based on several evaluation metrics, including average percentage of correctly assembled and sorted products, average episode length, and percentage of successful episodes. The results show that PPO outperforms DQN in terms of all evaluation metrics. The study highlights the advantages of policy-based algorithms in problems with high-dimensional state and action spaces. The study contributes to the field of deep RL in context of production systems by providing insights into the effectiveness of different algorithms and their suitability for different tasks.

READ FULL TEXT
research
03/09/2020

Stable Policy Optimization via Off-Policy Divergence Regularization

Trust Region Policy Optimization (TRPO) and Proximal Policy Optimization...
research
08/03/2020

Proximal Deterministic Policy Gradient

This paper introduces two simple techniques to improve off-policy Reinfo...
research
12/08/2021

A Review for Deep Reinforcement Learning in Atari:Benchmarks, Challenges, and Solutions

The Arcade Learning Environment (ALE) is proposed as an evaluation platf...
research
06/04/2019

Off-Policy Evaluation via Off-Policy Classification

In this work, we consider the problem of model selection for deep reinfo...
research
01/27/2022

Quantile-Based Policy Optimization for Reinforcement Learning

Classical reinforcement learning (RL) aims to optimize the expected cumu...
research
01/10/2023

Imbalanced Classification In Faulty Turbine Data: New Proximal Policy Optimization

There is growing importance to detecting faults and implementing the bes...
research
07/06/2023

Volumetric Occupancy Detection: A Comparative Analysis of Mapping Algorithms

Despite the growing interest in innovative functionalities for collabora...

Please sign up or login with your details

Forgot password? Click here to reset