Control of Pneumatic Artificial Muscles with SNN-based Cerebellar-like Model

09/22/2021
by   Hongbo Zhang, et al.
0

Soft robotics technologies have gained growing interest in recent years, which allows various applications from manufacturing to human-robot interaction. Pneumatic artificial muscle (PAM), a typical soft actuator, has been widely applied to soft robots. The compliance and resilience of soft actuators allow soft robots to behave compliant when interacting with unstructured environments, while the utilization of soft actuators also introduces nonlinearity and uncertainty. Inspired by Cerebellum's vital functions in control of human's physical movement, a neural network model of Cerebellum based on spiking neuron networks (SNNs) is designed. This model is used as a feed-forward controller in controlling a 1-DOF robot arm driven by PAMs. The simulation results show that this Cerebellar-based system achieves good performance and increases the system's response.

READ FULL TEXT VIEW PDF

page 1

page 2

page 3

page 4

06/25/2019

Keep soft robots soft -- a data-driven based trade-off between feed-forward and feedback control

Tracking control for soft robots is challenging due to uncertainties in ...
10/28/2019

Human Interface for Teleoperated Object Manipulation with a Soft Growing Robot

Soft growing robots are proposed for use in applications such as complex...
03/03/2021

Controlling the Sense of Agency in Dyadic Robot Interaction: An Active Inference Approach

This study investigated how social interaction among robotic agents chan...
12/13/2019

Analysis and Control of Fiber-Reinforced Elastomeric Enclosures (FREEs)

While rigid robots are extensively used in various applications, they ar...
02/04/2021

Regenerating Soft Robots through Neural Cellular Automata

Morphological regeneration is an important feature that highlights the e...
11/05/2019

Cognitive and motor compliance in intentional human-robot interaction

Embodiment and subjective experience in human-robot interaction are impo...
06/16/2020

Lio – A Personal Robot Assistant for Human-Robot Interaction and Care Applications

Lio is a mobile robot platform with a multi-functional arm explicitly de...

1 Introduction

Pneumatic artificial muscles (PAMs), such as Mckibben, are designed with the inspiration of creatures, showing great compatibility to creatures. This kind of muscle emerged in the twentieth century and has various kinds of applications after decades of development. Mckibben is small in size and relatively safe with high power to weight ratio. However, its nonlinearity and viscoelasticity properties increase the difficulty in controller designing.

The best example of control system for soft actuators can be found in animal bodies. As mentioned in [5], Cerebellum as part of creatures’ neural system have attracted vast attention because of that they play important role in controlling function. Therefore, we proposed a Cerebellum-like controller based on its real structure and internal information processing mechanism. Due to the bionic advantages of this controller, it’s appropriate to apply it in controlling Mckibben.

In our system, we conduct a Cerebellum-like controller based on spiking neural networks (SNNs) to control a 1-DOF robotic arm shown in Fig. 2 driven by a pair of PAMs. It can also be refered as an online closed-loop error-correction controller. The controller has one kind of fibers, Mossy Fibers, with four kinds of cells, Granule cells, Purkinje cells Inferior Olive cells and Deep cerebellar nuclei cells. All of them are constructed as the similar structures of Cerebellum and we model its physical functions. Besides, the SNNs functions the feed-forward part in our controller. Previously, scholars have designed some novel SNNs[3][1][6].In our work, a new SNN topology is designed to learn the inverse model of soft actuators and make up for the output of the controller.

2 The Structure of SNN

Here we use a real-time spiking neural network with a cerebellar-like structure that can obtain the inverse model of the Mckibben pairs to act as a feed-forward part of the controller. We use a set of spiking neurons as a basic unit of the network and imitate the structure of Cerebellar to build a neural network, which bases on [3]. The topology of the network is displayed in Fig. 2.

Figure 1: Topological structure diagram of the neural network. The arrow indicates an excitatory effect, and the circle indicates an inhibitory effect.
Figure 2: The robotic arm. Two pneumatic artificial muscles are placed on both sides. When one of them contracts, it pulls the robotic arm to rotate.

2.1 Neurons and Layers

The neural network consists of about 1000 neurons which contains 80 Mossy Fibers (MF), 100 Granule cells (GR), 160 Purkinje cells (PK), 160 Inferior Olive cells (IO), and 160 Deep cerebellar nuclei cells (DCN). The leaky integrate-and-fire (LIF) neuron model is used to build the neurons. Since the cerebellar cortex has hierarchical functional blocks, different blocks are responsible for different types of physical movements. The GR layer is connected to the MF layer hierarchically to imitate this partition mapping pattern. The PK layer and the DCN layer are divided into antagonistic pairs to receive pulses from GR and corresponding error signals from IO. Weights between GR and PK are adjusted and trained according to Spike Timing–Dependent Plasticity (STDP) learning rules.

2.2 Learning Rules

Studies have shown that learning in Cerebellum is mediated by synaptic plasticity. In our network, the connection between the GR layer and the PK layer is carried out according to the learning rules.

STDP is a learning method based on Hebbian learning rules. The dependence of synaptic modification on the order of pre and postsynaptic spiking within a critical window of tens of milliseconds has profound functional utilities in learning and memory.[2]

STDP in this network is divided into long-term potentiation (LTP) mediated by GR pulses and long-term depression (LTD) mediated by IO pulses. The equations are as follows: LTP effect increases the weight at a specific learning rate whenever there is a GR pulse, while the LTD effect adds the historical GR pulse to the kernel function whenever there is an IO input pulse.

(1)
(2)
(3)

where represents learning rate for LTP and LTD respectively, represents pulse signal of the corresponding neurons and represents the kernel function described in Eq.(3).

3 Simulation Platform

Bindsnet[4]

, an open-source spiking neural network building platform, is used to build and train the neural network. Meanwhile, Simulink is used as the platform to build the physical simulation environment and the controller.

3.1 The Robotic Arm

Our robotic arm uses two iron rods as bones and a pair of pneumatic muscles as actuators imitating bicep and tricep of human respectively in Fig. 2.

One end of the link is fixed, and the muscles are installed on both sides of the link. When one of the muscles contracts, it will pull the unfixed link to rotate. We take the deflection angle of the robotic arm as output and model the robotic arm in Simulink.

3.2 Control Loop

A cascade control method including a feed-forward part and a feedback part is applied in our system shown in Fig. 3

Figure 3: The block diagram of the feed-forward and feedback cascade control system.

The air pressure feedback controller shown in Fig. 3 directly controls the air pressure of the two pneumatic muscles by controlling opening and closing of the solenoid valves.

The PD controllers in the outer loop is served as the main controller to obtain the precision of motion control. Cerebellum-inspired feed-forward controller contributes to improve the response speed and deal with the nonlinearity.

4 Results and Discussions

In the simulation, we use two different control strategies:(1) A controller with both feed-forward and feedback blocks (2) A controller with a single feed-forward block in Fig. 4.

Firstly, a controller with a PD feedback part and a feed-forward part built by the network is applied. The feedback controller is added to achieve the rapid response of the control system to disturbances. In order to test the trajectory tracking effect and the anti-interference effect of the end of the manipulator, a sinusoidal trajectory input is applied as the desired trajectory to analyze the control effect.Results in Fig. 4 (a) shows improvement in control accuracy comparing with the PD controller. Results in Fig. 4 (b) also indicate that the feed-forward controller achieves good performance as well.

The experiment verifies that our controller can replace traditional controllers, and we will continue to reduce the effect of feedback part and verify the control effect of our controller on Mckibben artificial muscle. The whole system shows strong bionics and has potentially large applications in many fields.

Figure 4: (a) Results for a controller with both feed-forward and feedback blocks. (b) Results for a controller with a single feed-forward block.

References