DeepAI AI Chat
Log In Sign Up

A Deep Reinforcement Learning Approach for Dynamically Stable Inverse Kinematics of Humanoid Robots

by   S Phaniteja, et al.
IIIT Hyderabad

Real time calculation of inverse kinematics (IK) with dynamically stable configuration is of high necessity in humanoid robots as they are highly susceptible to lose balance. This paper proposes a methodology to generate joint-space trajectories of stable configurations for solving inverse kinematics using Deep Reinforcement Learning (RL). Our approach is based on the idea of exploring the entire configuration space of the robot and learning the best possible solutions using Deep Deterministic Policy Gradient (DDPG). The proposed strategy was evaluated on the highly articulated upper body of a humanoid model with 27 degree of freedom (DoF). The trained model was able to solve inverse kinematics for the end effectors with 90 maintaining the balance in double support phase.


Towards continuous control of flippers for a multi-terrain robot using deep reinforcement learning

In this paper we focus on developing a control algorithm for multi-terra...

A Deep Reinforcement Learning Approach towards Pendulum Swing-up Problem based on TF-Agents

Adapting the idea of training CartPole with Deep Q-learning agent, we ar...

Guided Deep Reinforcement Learning for Articulated Swimming Robots

Deep reinforcement learning has recently been applied to a variety of ro...

Robust Recovery Controller for a Quadrupedal Robot using Deep Reinforcement Learning

The ability to recover from a fall is an essential feature for a legged ...

Collisionless Pattern Discovery in Robot Swarms Using Deep Reinforcement Learning

We present a deep reinforcement learning-based framework for automatical...

Real-time Active Vision for a Humanoid Soccer Robot Using Deep Reinforcement Learning

In this paper, we present an active vision method using a deep reinforce...

Reinforced dynamics for enhanced sampling in large atomic and molecular systems. I. Basic Methodology

A new approach for efficiently exploring the configuration space and com...