Online 3D Bin Packing with Constrained Deep Reinforcement Learning

06/26/2020
by   Hang Zhao, et al.
9

We solve a challenging yet practically useful variant of 3D Bin Packing Problem (3D-BPP). In our problem, the agent has limited information about the items to be packed into the bin, and an item must be packed immediately after its arrival without buffering or readjusting. The item's placement also subjects to the constraints of collision avoidance and physical stability. We formulate this online 3D-BPP as a constrained Markov decision process. To solve the problem, we propose an effective and easy-to-implement constrained deep reinforcement learning (DRL) method under the actor-critic framework. In particular, we introduce a feasibility predictor to predict the feasibility mask for the placement actions and use it to modulate the action probabilities output by the actor during training. Such supervisions and transformations to DRL facilitate the agent to learn feasible policies efficiently. Our method can also be generalized e.g., with the ability to handle lookahead or items with different orientations. We have conducted extensive evaluation showing that the learned policy significantly outperforms the state-of-the-art methods. A user study suggests that our method attains a human-level performance.

READ FULL TEXT

page 1

page 14

research
08/31/2021

Learning Practically Feasible Policies for Online 3D Bin Packing

We tackle the Online 3D Bin Packing Problem, a challenging yet practical...
research
03/30/2022

One-Step Two-Critic Deep Reinforcement Learning for Inverter-based Volt-Var Control in Active Distribution Networks

A one-step two-critic deep reinforcement learning (OSTC-DRL) approach fo...
research
04/29/2020

Actor-Critic Reinforcement Learning for Control with Stability Guarantee

Deep Reinforcement Learning (DRL) has achieved impressive performance in...
research
11/05/2018

Managing engineering systems with large state and action spaces through deep reinforcement learning

Decision-making for engineering systems can be efficiently formulated as...
research
03/31/2022

Mask Atari for Deep Reinforcement Learning as POMDP Benchmarks

We present Mask Atari, a new benchmark to help solve partially observabl...
research
07/08/2022

Product Segmentation Newsvendor Problems: A Robust Learning Approach

We propose and analyze a product segmentation newsvendor problem, which ...
research
08/20/2017

Solving a New 3D Bin Packing Problem with Deep Reinforcement Learning Method

In this paper, a new type of 3D bin packing problem (BPP) is proposed, i...

Please sign up or login with your details

Forgot password? Click here to reset