MVRackLay: Monocular Multi-View Layout Estimation for Warehouse Racks and Shelves

11/30/2022
by   Pranjali Pathre, et al.
1

In this paper, we propose and showcase, for the first time, monocular multi-view layout estimation for warehouse racks and shelves. Unlike typical layout estimation methods, MVRackLay estimates multi-layered layouts, wherein each layer corresponds to the layout of a shelf within a rack. Given a sequence of images of a warehouse scene, a dual-headed Convolutional-LSTM architecture outputs segmented racks, the front and the top view layout of each shelf within a rack. With minimal effort, such an output is transformed into a 3D rendering of all racks, shelves and objects on the shelves, giving an accurate 3D depiction of the entire warehouse scene in terms of racks, shelves and the number of objects on each shelf. MVRackLay generalizes to a diverse set of warehouse scenes with varying number of objects on each shelf, number of shelves and in the presence of other such racks in the background. Further, MVRackLay shows superior performance vis-a-vis its single view counterpart, RackLay, in layout accuracy, quantized in terms of the mean IoU and mAP metrics. We also showcase a multi-view stitching of the 3D layouts resulting in a representation of the warehouse scene with respect to a global reference frame akin to a rendering of the scene from a SLAM pipeline. To the best of our knowledge, this is the first such work to portray a 3D rendering of a warehouse scene in terms of its semantic components - Racks, Shelves and Objects - all from a single monocular camera.

READ FULL TEXT

page 1

page 6

page 7

research
03/16/2021

RackLay: Multi-Layer Layout Estimation for Warehouse Racks

Given a monocular colour image of a warehouse rack, we aim to predict th...
research
12/12/2021

MVLayoutNet:3D layout reconstruction with multi-view panoramas

We present MVLayoutNet, an end-to-end network for holistic 3D reconstruc...
research
03/22/2023

MAIR: Multi-view Attention Inverse Rendering with 3D Spatially-Varying Lighting Estimation

We propose a scene-level inverse rendering framework that uses multi-vie...
research
05/03/2022

Cross-View Cross-Scene Multi-View Crowd Counting

Multi-view crowd counting has been previously proposed to utilize multi-...
research
12/11/2019

Simultaneous Detection and Removal of Dynamic Objects in Multi-view Images

Consider a set of images of a scene consisting of moving objects capture...
research
10/24/2022

360-MLC: Multi-view Layout Consistency for Self-training and Hyper-parameter Tuning

We present 360-MLC, a self-training method based on multi-view layout co...
research
08/20/2021

AutoLay: Benchmarking amodal layout estimation for autonomous driving

Given an image or a video captured from a monocular camera, amodal layou...

Please sign up or login with your details

Forgot password? Click here to reset