Learning to Seek: Autonomous Source Seeking with Deep Reinforcement Learning Onboard a Nano Drone Microcontroller

by   Bardienus P. Duisterhof, et al.

Fully autonomous navigation using nano drones has numerous application in the real world, ranging from search and rescue to source seeking. Nano drones are well-suited for source seeking because of their agility, low price, and ubiquitous character. Unfortunately, their constrained form factor limits flight time, sensor payload, and compute capability. These challenges are a crucial limitation for the use of source-seeking nano drones in GPS-denied and highly cluttered environments. Hereby, we introduce a fully autonomous deep reinforcement learning-based light-seeking nano drone. The 33-gram nano drone performs all computation on-board the ultra-low-power microcontroller (MCU). We present the method for efficiently training, converting, and utilizing deep reinforcement learning policies. Our training methodology and novel quantization scheme allow fitting the trained policy in 3 kB of memory. The quantization scheme uses representative input data and input scaling to arrive at a full 8-bit model. Finally, we evaluate the approach in simulation and flight tests using a Bitcraze CrazyFlie, achieving 80 in a highly cluttered and randomized test environment. Even more, the drone finds the light source in 29 (obstacle avoidance without source information). To our knowledge, this is the first deep reinforcement learning method that enables source seeking within a highly constrained nano drone demonstrating robust flight behavior. Our general methodology is suitable for any (source seeking) highly constrained platform using deep reinforcement learning.


page 1

page 3

page 7


Autonomous Navigation via Deep Reinforcement Learning for Resource Constraint Edge Nodes using Transfer Learning

Smart and agile drones are fast becoming ubiquitous at the edge of the c...

UAV-Based Search and Rescue in Avalanches using ARVA: An Extremum Seeking Approach

This work deals with the problem of localizing a victim buried by an ava...

Interpretable UAV Collision Avoidance using Deep Reinforcement Learning

The significant components of any successful autonomous flight system ar...

Chasing the Intruder: A Reinforcement Learning Approach for Tracking Intruder Drones

Drones are becoming versatile in a myriad of applications. This has led ...

Deep reinforcement learning for drone navigation using sensor data

Mobile robots such as unmanned aerial vehicles (drones) can be used for ...

Long-Term Planning with Deep Reinforcement Learning on Autonomous Drones

In this paper, we study a long-term planning scenario that is based on d...

Autonomous Drone Racing with Deep Reinforcement Learning

In many robotic tasks, such as drone racing, the goal is to travel throu...

Please sign up or login with your details

Forgot password? Click here to reset