Online 3D Bin Packing Reinforcement Learning Solution with Buffer

08/15/2022
by   Aaron Valero Puche, et al.
0

The 3D Bin Packing Problem (3D-BPP) is one of the most demanded yet challenging problems in industry, where an agent must pack variable size items delivered in sequence into a finite bin with the aim to maximize the space utilization. It represents a strongly NP-Hard optimization problem such that no solution has been offered to date with high performance in space utilization. In this paper, we present a new reinforcement learning (RL) framework for a 3D-BPP solution for improving performance. First, a buffer is introduced to allow multi-item action selection. By increasing the degree of freedom in action selection, a more complex policy that results in better packing performance can be derived. Second, we propose an agnostic data augmentation strategy that exploits both bin item symmetries for improving sample efficiency. Third, we implement a model-based RL method adapted from the popular algorithm AlphaGo, which has shown superhuman performance in zero-sum games. Our adaptation is capable of working in single-player and score based environments. In spite of the fact that AlphaGo versions are known to be computationally heavy, we manage to train the proposed framework with a single thread and GPU, while obtaining a solution that outperforms the state-of-the-art results in space utilization.

READ FULL TEXT

page 1

page 3

research
07/01/2020

A Generalized Reinforcement Learning Algorithm for Online 3D Bin-Packing

We propose a Deep Reinforcement Learning (Deep RL) algorithm for solving...
research
12/05/2022

Learning Physically Realizable Skills for Online Packing of General 3D Shapes

We study the problem of learning online packing skills for irregular 3D ...
research
08/31/2021

Learning Practically Feasible Policies for Online 3D Bin Packing

We tackle the Online 3D Bin Packing Problem, a challenging yet practical...
research
05/26/2022

RACE: A Reinforcement Learning Framework for Improved Adaptive Control of NoC Channel Buffers

Network-on-chip (NoC) architectures rely on buffers to store flits to co...
research
04/17/2018

A Multi-task Selected Learning Approach for Solving New Type 3D Bin Packing Problem

This paper studies a new type of 3D bin packing problem (BPP), in which ...
research
08/20/2017

Solving a New 3D Bin Packing Problem with Deep Reinforcement Learning Method

In this paper, a new type of 3D bin packing problem (BPP) is proposed, i...
research
02/14/2017

Small Boxes Big Data: A Deep Learning Approach to Optimize Variable Sized Bin Packing

Bin Packing problems have been widely studied because of their broad app...

Please sign up or login with your details

Forgot password? Click here to reset