Building Intelligent Autonomous Navigation Agents

06/25/2021
by   Devendra Singh Chaplot, et al.
0

Breakthroughs in machine learning in the last decade have led to `digital intelligence', i.e. machine learning models capable of learning from vast amounts of labeled data to perform several digital tasks such as speech recognition, face recognition, machine translation and so on. The goal of this thesis is to make progress towards designing algorithms capable of `physical intelligence', i.e. building intelligent autonomous navigation agents capable of learning to perform complex navigation tasks in the physical world involving visual perception, natural language understanding, reasoning, planning, and sequential decision making. Despite several advances in classical navigation methods in the last few decades, current navigation agents struggle at long-term semantic navigation tasks. In the first part of the thesis, we discuss our work on short-term navigation using end-to-end reinforcement learning to tackle challenges such as obstacle avoidance, semantic perception, language grounding, and reasoning. In the second part, we present a new class of navigation methods based on modular learning and structured explicit map representations, which leverage the strengths of both classical and end-to-end learning methods, to tackle long-term navigation tasks. We show that these methods are able to effectively tackle challenges such as localization, mapping, long-term planning, exploration and learning semantic priors. These modular learning methods are capable of long-term spatial and semantic understanding and achieve state-of-the-art results on various navigation tasks.

READ FULL TEXT

page 24

page 29

page 41

research
07/13/2018

Artificial Intelligence for Long-Term Robot Autonomy: A Survey

Autonomous systems will play an essential role in many applications acro...
research
03/22/2022

Vision-and-Language Navigation: A Survey of Tasks, Methods, and Future Directions

A long-term goal of AI research is to build intelligent agents that can ...
research
03/05/2021

Structured Scene Memory for Vision-Language Navigation

Recently, numerous algorithms have been developed to tackle the problem ...
research
10/27/2020

Unsupervised Domain Adaptation for Visual Navigation

Advances in visual navigation methods have led to intelligent embodied n...
research
11/03/2020

Guided Navigation from Multiple Viewpoints using Qualitative Spatial Reasoning

Navigation is an essential ability for mobile agents to be completely au...
research
08/27/2022

Object Goal Navigation using Data Regularized Q-Learning

Object Goal Navigation requires a robot to find and navigate to an insta...
research
08/03/2023

Non-equilibrium physics: from spin glasses to machine and neural learning

Disordered many-body systems exhibit a wide range of emergent phenomena ...

Please sign up or login with your details

Forgot password? Click here to reset