Parcel3D: Shape Reconstruction from Single RGB Images for Applications in Transportation Logistics

04/18/2023
by   Alexander Naumann, et al.
0

We focus on enabling damage and tampering detection in logistics and tackle the problem of 3D shape reconstruction of potentially damaged parcels. As input we utilize single RGB images, which corresponds to use-cases where only simple handheld devices are available, e.g. for postmen during delivery or clients on delivery. We present a novel synthetic dataset, named Parcel3D, that is based on the Google Scanned Objects (GSO) dataset and consists of more than 13,000 images of parcels with full 3D annotations. The dataset contains intact, i.e. cuboid-shaped, parcels and damaged parcels, which were generated in simulations. We work towards detecting mishandling of parcels by presenting a novel architecture called CubeRefine R-CNN, which combines estimating a 3D bounding box with an iterative mesh refinement. We benchmark our approach on Parcel3D and an existing dataset of cuboid-shaped parcels in real-world scenarios. Our results show, that while training on Parcel3D enables transfer to the real world, enabling reliable deployment in real-world scenarios is still challenging. CubeRefine R-CNN yields competitive performance in terms of Mesh AP and is the only model that directly enables deformation assessment by 3D mesh comparison and tampering detection by comparing viewpoint invariant parcel side surface representations. Dataset and code are available at https://a-nau.github.io/parcel3d.

READ FULL TEXT

page 4

page 7

research
03/22/2022

A Real World Dataset for Multi-view 3D Reconstruction

We present a dataset of 371 3D models of everyday tabletop objects along...
research
03/20/2019

Photometric Mesh Optimization for Video-Aligned 3D Object Reconstruction

In this paper, we address the problem of 3D object mesh reconstruction f...
research
09/10/2019

FreiHAND: A Dataset for Markerless Capture of Hand Pose and Shape from Single RGB Images

Estimating 3D hand pose from single RGB images is a highly ambiguous pro...
research
06/06/2019

Mesh R-CNN

Rapid advances in 2D perception have led to systems that accurately dete...
research
10/08/2022

Training Deep Learning Algorithms on Synthetic Forest Images for Tree Detection

Vision-based segmentation in forested environments is a key functionalit...
research
07/02/2021

A Novel Disaster Image Dataset and Characteristics Analysis using Attention Model

The advancement of deep learning technology has enabled us to develop sy...
research
09/16/2021

Habitat-Matterport 3D Dataset (HM3D): 1000 Large-scale 3D Environments for Embodied AI

We present the Habitat-Matterport 3D (HM3D) dataset. HM3D is a large-sca...

Please sign up or login with your details

Forgot password? Click here to reset