Language-enhanced RNR-Map: Querying Renderable Neural Radiance Field maps with natural language

08/17/2023
by   Francesco Taioli, et al.
0

We present Le-RNR-Map, a Language-enhanced Renderable Neural Radiance map for Visual Navigation with natural language query prompts. The recently proposed RNR-Map employs a grid structure comprising latent codes positioned at each pixel. These latent codes, which are derived from image observation, enable: i) image rendering given a camera pose, since they are converted to Neural Radiance Field; ii) image navigation and localization with astonishing accuracy. On top of this, we enhance RNR-Map with CLIP-based embedding latent codes, allowing natural language search without additional label data. We evaluate the effectiveness of this map in single and multi-object searches. We also investigate its compatibility with a Large Language Model as an "affordance query resolver". Code and videos are available at https://intelligolabs.github.io/Le-RNR-Map/

READ FULL TEXT

page 1

page 3

page 4

page 7

research
03/01/2023

Renderable Neural Radiance Map for Visual Navigation

We propose a novel type of map for visual navigation, a renderable neura...
research
10/11/2022

Visual Language Maps for Robot Navigation

Grounding language to the visual observations of a navigating agent can ...
research
10/23/2020

The RobotSlang Benchmark: Dialog-guided Robot Localization and Navigation

Autonomous robot systems for applications from search and rescue to assi...
research
02/22/2020

Emergent Communication with World Models

We introduce Language World Models, a class of language-conditional gene...
research
04/16/2021

BERT2Code: Can Pretrained Language Models be Leveraged for Code Search?

Millions of repetitive code snippets are submitted to code repositories ...
research
04/01/2022

LASER: LAtent SpacE Rendering for 2D Visual Localization

We present LASER, an image-based Monte Carlo Localization (MCL) framewor...
research
09/19/2023

Natural Language Dataset Generation Framework for Visualizations Powered by Large Language Models

We introduce a Large Language Model (LLM) framework that generates rich ...

Please sign up or login with your details

Forgot password? Click here to reset