Learning Active Camera for Multi-Object Navigation

10/14/2022
by   Peihao Chen, et al.
0

Getting robots to navigate to multiple objects autonomously is essential yet difficult in robot applications. One of the key challenges is how to explore environments efficiently with camera sensors only. Existing navigation methods mainly focus on fixed cameras and few attempts have been made to navigate with active cameras. As a result, the agent may take a very long time to perceive the environment due to limited camera scope. In contrast, humans typically gain a larger field of view by looking around for a better perception of the environment. How to make robots perceive the environment as efficiently as humans is a fundamental problem in robotics. In this paper, we consider navigating to multiple objects more efficiently with active cameras. Specifically, we cast moving camera to a Markov Decision Process and reformulate the active camera problem as a reinforcement learning problem. However, we have to address two new challenges: 1) how to learn a good camera policy in complex environments and 2) how to coordinate it with the navigation policy. To address these, we carefully design a reward function to encourage the agent to explore more areas by moving camera actively. Moreover, we exploit human experience to infer a rule-based camera action to guide the learning process. Last, to better coordinate two kinds of policies, the camera policy takes navigation actions into account when making camera moving decisions. Experimental results show our camera policy consistently improves the performance of multi-object navigation over four baselines on two datasets.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
07/30/2018

Active Object Perceiver: Recognition-guided Policy Learning for Object Searching on Mobile Robots

We study the problem of learning a navigation policy for a robot to acti...
research
07/29/2018

Sidekick Policy Learning for Active Visual Exploration

We consider an active visual exploration scenario, where an agent must i...
research
02/22/2022

Coordinate-Aligned Multi-Camera Collaboration for Active Multi-Object Tracking

Active Multi-Object Tracking (AMOT) is a task where cameras are controll...
research
07/15/2020

Active Visual Information Gathering for Vision-Language Navigation

Vision-language navigation (VLN) is the task of entailing an agent to ca...
research
03/07/2023

Proactive Multi-Camera Collaboration For 3D Human Pose Estimation

This paper presents a multi-agent reinforcement learning (MARL) scheme f...
research
02/16/2016

Optimizing Gaze Direction in a Visual Navigation Task

Navigation in an unknown environment consists of multiple separable subt...
research
12/02/2022

Private Multiparty Perception for Navigation

We introduce a framework for navigating through cluttered environments b...

Please sign up or login with your details

Forgot password? Click here to reset