PAg-NeRF: Towards fast and efficient end-to-end panoptic 3D representations for agricultural robotics

09/11/2023
by   Claus Smitt, et al.
0

Precise scene understanding is key for most robot monitoring and intervention tasks in agriculture. In this work we present PAg-NeRF which is a novel NeRF-based system that enables 3D panoptic scene understanding. Our representation is trained using an image sequence with noisy robot odometry poses and automatic panoptic predictions with inconsistent IDs between frames. Despite this noisy input, our system is able to output scene geometry, photo-realistic renders and 3D consistent panoptic representations with consistent instance IDs. We evaluate this novel system in a very challenging horticultural scenario and in doing so demonstrate an end-to-end trainable system that can make use of noisy robot poses rather than precise poses that have to be pre-calculated. Compared to a baseline approach the peak signal to noise ratio is improved from 21.34dB to 23.37dB while the panoptic quality improves from 56.65 tuned to improve inference time by more than a factor of 2 while being memory efficient with approximately 12 times fewer parameters.

READ FULL TEXT

page 1

page 2

page 6

page 8

research
11/25/2021

Scene Representation Transformer: Geometry-Free Novel View Synthesis Through Set-Latent Scene Representations

A classical problem in computer vision is to infer a 3D scene representa...
research
02/21/2020

Learning Precise 3D Manipulation from Multiple Uncalibrated Cameras

In this work, we present an effective multi-view approach to closed-loop...
research
07/28/2022

MonteBoxFinder: Detecting and Filtering Primitives to Fit a Noisy Point Cloud

We present MonteBoxFinder, a method that, given a noisy input point clou...
research
04/22/2022

Implicit Object Mapping With Noisy Data

Modelling individual objects as Neural Radiance Fields (NeRFs) within a ...
research
12/19/2022

Panoptic Lifting for 3D Scene Understanding with Neural Fields

We propose Panoptic Lifting, a novel approach for learning panoptic 3D v...
research
06/07/2019

Structural Decompositions for End-to-End Relighting

Relighting is an essential step in artificially transferring an object f...
research
04/24/2023

USA-Net: Unified Semantic and Affordance Representations for Robot Memory

In order for robots to follow open-ended instructions like "go open the ...

Please sign up or login with your details

Forgot password? Click here to reset