ROS Rescue : Fault Tolerance System for Robot Operating System

10/02/2019
by   Pushyami Kaveti, et al.
0

In this chapter we discuss the problem of master failure in ROS1.0 and its impact on robotic deployments in the real world. We address this issue in this tutorial chapter where we outline, design and demonstrate a fault tolerant mechanism associated with ROS master failure. Unlike previous solutions which use primary backup replication and external checkpointing libraries which are process heavy, our mechanism adds a lightweight functionality to the ROS master to enable it to recover from failure. We present a modified version of ROS master which is equipped with a logging mechanism to record the meta information and network state of ROS nodes as well as a recovery mechanism to go back to the previous state without having to abort or restart all the nodes. We also implement an additional master monitor node responsible for failure detection on the master by polling it for its availability. Our code is implemented in python and preliminary tests were conducted successfully on a variety of land, aerial and underwater robots and a tele-operating computer running ROS Kinetic on Ubuntu 16.04. The code is publicly available under a creative commons license on github at https://github.com/PushyamiKaveti/fault-tolerant-ros-master

READ FULL TEXT

page 3

page 15

research
04/11/2019

On Byzantine Fault Tolerance in Multi-Master Kubernertes Clusters

Docker container virtualization technology is being widely adopted in cl...
research
03/02/2022

ReStore: In-Memory REplicated STORagE for Rapid Recovery in Fault-Tolerant Algorithms

Fault-tolerant distributed applications require mechanisms to recover da...
research
05/15/2021

An Ergonomic Interaction Workspace Analysis Method for the Optimal Design of a Surgical Master Manipulator

Master control console is a place where robots collaborate with humans i...
research
11/19/2021

Reinforcement Learning with Adaptive Curriculum Dynamics Randomization for Fault-Tolerant Robot Control

This study is aimed at addressing the problem of fault tolerance of quad...
research
05/24/2023

Workrs: Fault Tolerant Horizontal Computation Offloading

The broad development and usage of edge devices has highlighted the impo...
research
09/18/2018

Random problems with R

R (Version 3.5.1 patched) has an issue with its random sampling function...
research
10/09/2018

Collective Strategies with a Master-slave Mechanism Dominate in Spatial Iterated Prisoner's Dilemma

The Iterated Prisoner's Dilemma (IPD) can model transactions among indiv...

Please sign up or login with your details

Forgot password? Click here to reset