UAV Visual Teach and Repeat Using Only Semantic Object Features

We demonstrate the use of semantic object detections as robust features for Visual Teach and Repeat (VTR). Recent CNN-based object detectors are able to reliably detect objects of tens or hundreds of categories in a video at frame rates. We show that such detections are repeatable enough to use as landmarks for VTR, without any low-level image features. Since object detections are highly invariant to lighting and surface appearance changes, our VTR can cope with global lighting changes and local movements of the landmark objects. In the teaching phase, we build a series of compact scene descriptors: a list of detected object labels and their image-plane locations. In the repeating phase, we use Seq-SLAM-like relocalization to identify the most similar learned scene, then use a motion control algorithm based on the funnel lane theory to navigate the robot along the previously piloted trajectory. We evaluate the method on a commodity UAV, examining the robustness of the algorithm to new viewpoints, lighting conditions, and movements of landmark objects. The results suggest that semantic object features could be useful due to their invariance to superficial appearance changes compared to low-level image features.

READ FULL TEXT

page 1

page 3

page 4

research
04/14/2023

FM-Loc: Using Foundation Models for Improved Vision-based Localization

Visual place recognition is essential for vision-based robot localizatio...
research
12/21/2020

Accurate Object Association and Pose Updating for Semantic SLAM

Nowadays in the field of semantic SLAM, how to correctly use semantic in...
research
02/09/2022

Object-Guided Day-Night Visual Localization in Urban Scenes

We introduce Object-Guided Localization (OGuL) based on a novel method o...
research
09/21/2021

Robust Visual Teach and Repeat for UGVs Using 3D Semantic Maps

In this paper, we propose a Visual Teach and Repeat (VTR) algorithm usin...
research
04/22/2022

Making Parameterization and Constrains of Object Landmark Globally Consistent via SPD(3) Manifold and Improved Cost Functions

Object-level SLAM introduces semantic meaningful and compact object land...
research
10/19/2019

CAPRICORN: Communication Aware Place Recognition using Interpretable Constellations of Objects in Robot Networks

Using multiple robots for exploring and mapping environments can provide...
research
01/25/2019

Visual Categorization of Objects into Animal and Plant Classes Using Global Shape Descriptors

How humans can distinguish between general categories of objects? Are th...

Please sign up or login with your details

Forgot password? Click here to reset