Robust Walking Control of a Lower Limb Rehabilitation Exoskeleton Coupled with a Musculoskeletal Model via Deep Reinforcement Learning

Mapping Intimacies ◽

10.21203/rs.3.rs-1212542/v1 ◽

2021 ◽

Author(s):

Shuzhen Luo ◽

Ghaith Androwis ◽

Sergei Adamovich ◽

Erick Nunez ◽

Hao Su ◽

...

Keyword(s):

Reinforcement Learning ◽

Neuromuscular Disorders ◽

Control Policy ◽

Policy Network ◽

Control Parameters ◽

Robust Controller ◽

Interaction Forces ◽

Learning Framework ◽

Limb Rehabilitation ◽

Lower Limb Rehabilitation

Abstract Background: Few studies have systematically investigated robust controllers for lower limb rehabilitation exoskeletons (LLREs) that can safely and effectively assist users with a variety of neuromuscular disorders to walk with full autonomy. One of the key challenges for developing such a robust controller is to handle different degrees of uncertain human-exoskeleton interaction forces from the patients. Consequently, conventional walking controllers either are patient-condition specific or involve tuning of many control parameters, which could behave unreliably and even fail to maintain balance. Methods: We present a novel and robust controller for a LLRE based on a decoupled deep reinforcement learning framework with three independent networks, which aims to provide reliable walking assistance against various and uncertain human-exoskeleton interaction forces. The exoskeleton controller is driven by a neural network control policy that acts on a stream of the LLRE’s proprioceptive signals, including joint kinematic states, and subsequently predicts real-time position control targets for the actuated joints. To handle uncertain human-interaction forces, the control policy is trained intentionally with an integrated human musculoskeletal model and realistic human-exoskeleton interaction forces. Two other neural networks are connected with the control policy network to predict the interaction forces and muscle coordination. To further increase the robustness of the control policy, we employ domain randomization during training that includes not only randomization of exoskeleton dynamics properties but, more importantly, randomization of human muscle strength to simulate the variability of the patient’s disability. Through this decoupled deep reinforcement learning framework, the trained controller of LLREs is able to provide reliable walking assistance to the human with different degrees of neuromuscular disorders. Results and Conclusion: A universal, RL-based walking controller is trained and virtually tested on a LLRE system to verify its effectiveness and robustness in assisting users with different disabilities such as passive muscles (quadriplegic), muscle weakness, or hemiplegic conditions. An ablation study demonstrates strong robustness of the control policy under large exoskeleton dynamic property ranges and various human-exoskeleton interaction forces. The decoupled network structure allows us to isolate the LLRE control policy network for testing and sim-to-real transfer since it uses only proprioception information of the LLRE (joint sensory state) as the input. Furthermore, the controller is shown to be able to handle different patient conditions without the need for patient-specific control parameters tuning.

Download Full-text

Reinforcement Learning and Control of a Lower Extremity Exoskeleton for Squat Assistance

Frontiers in Robotics and AI ◽

10.3389/frobt.2021.702845 ◽

2021 ◽

Vol 8 ◽

Author(s):

Shuzhen Luo ◽

Ghaith Androwis ◽

Sergei Adamovich ◽

Hao Su ◽

Erick Nunez ◽

...

Keyword(s):

Reinforcement Learning ◽

Lower Extremity ◽

Center Of Pressure ◽

Control Policy ◽

Human Interaction ◽

Policy Network ◽

Motion Controller ◽

Interaction Forces ◽

Important Indicator ◽

Control Robustness

A significant challenge for the control of a robotic lower extremity rehabilitation exoskeleton is to ensure stability and robustness during programmed tasks or motions, which is crucial for the safety of the mobility-impaired user. Due to various levels of the user’s disability, the human-exoskeleton interaction forces and external perturbations are unpredictable and could vary substantially and cause conventional motion controllers to behave unreliably or the robot to fall down. In this work, we propose a new, reinforcement learning-based, motion controller for a lower extremity rehabilitation exoskeleton, aiming to perform collaborative squatting exercises with efficiency, stability, and strong robustness. Unlike most existing rehabilitation exoskeletons, our exoskeleton has ankle actuation on both sagittal and front planes and is equipped with multiple foot force sensors to estimate center of pressure (CoP), an important indicator of system balance. This proposed motion controller takes advantage of the CoP information by incorporating it in the state input of the control policy network and adding it to the reward during the learning to maintain a well balanced system state during motions. In addition, we use dynamics randomization and adversary force perturbations including large human interaction forces during the training to further improve control robustness. To evaluate the effectiveness of the learning controller, we conduct numerical experiments with different settings to demonstrate its remarkable ability on controlling the exoskeleton to repetitively perform well balanced and robust squatting motions under strong perturbations and realistic human interaction forces.

Download Full-text

End-to-End Deep Reinforcement Learning for Image-Based UAV Autonomous Control

Applied Sciences ◽

10.3390/app11188419 ◽

2021 ◽

Vol 11 (18) ◽

pp. 8419

Author(s):

Jiang Zhao ◽

Jiaming Sun ◽

Zhihao Cai ◽

Longhong Wang ◽

Yingxun Wang

Keyword(s):

Reinforcement Learning ◽

Network Architecture ◽

Control Method ◽

Control Policy ◽

Input Image ◽

Autonomous Control ◽

Policy Network ◽

Model Free ◽

Control Command ◽

End To End

To achieve the perception-based autonomous control of UAVs, schemes with onboard sensing and computing are popular in state-of-the-art work, which often consist of several separated modules with respective complicated algorithms. Most methods depend on handcrafted designs and prior models with little capacity for adaptation and generalization. Inspired by the research on deep reinforcement learning, this paper proposes a new end-to-end autonomous control method to simplify the separate modules in the traditional control pipeline into a single neural network. An image-based reinforcement learning framework is established, depending on the design of the network architecture and the reward function. Training is performed with model-free algorithms developed according to the specific mission, and the control policy network can map the input image directly to the continuous actuator control command. A simulation environment for the scenario of UAV landing was built. In addition, the results under different typical cases, including both the small and large initial lateral or heading angle offsets, show that the proposed end-to-end method is feasible for perception-based autonomous control.

Download Full-text

Towards Autonomous Reinforcement Learning: Automatic Setting of Hyper-parameters using Bayesian Optimization

CLEI electronic journal ◽

10.19153/cleiej.21.2.1 ◽

2018 ◽

Vol 21 (2) ◽

Cited By ~ 2

Author(s):

Juan Cruz Barsce ◽

Jorge Andrés Palombarini ◽

Ernesto Carlos Martínez

Keyword(s):

Reinforcement Learning ◽

Learning Algorithm ◽

Gaussian Process Regression ◽

Control Policy ◽

Bayesian Optimization ◽

Learning Framework ◽

User Expertise ◽

Key Factor ◽

Self Driving Cars ◽

Reinforcement Learning Algorithm

With the increase of machine learning usage by industries and scientific communities in a variety of tasks such as text mining, image recognition and self-driving cars, automatic setting of hyper-parameter in learning algorithms is a key factor for obtaining good performances regardless of user expertise in the inner workings of the techniques and methodologies. In particular, for a reinforcement learning algorithm, the efficiency of an agent learning a control policy in an uncertain environment is heavily dependent on the hyper-parameters used to balance exploration with exploitation. In this work, an autonomous learning framework that integrates Bayesian optimization with Gaussian process regression to optimize the hyper-parameters of a reinforcement learning algorithm, is proposed. Also, a bandits-based approach to achieve a balance between computational costs and decreasing uncertainty about the \textit{Q}-values, is presented. A gridworld example is used to highlight how hyper-parameter configurations of a learning algorithm (SARSA) are iteratively improved based on two performance functions.

Download Full-text

Modeling And Position Control Of Human Lower Limb Rehabilitation Robot Using Pneumatic Muscle Actuators

10.36541/0231-000-023-009 ◽

2015 ◽

pp. 61

Author(s):

محمد يوسف حسن ◽

شهد صبيح غنتاب

Keyword(s):

Lower Limb ◽

Position Control ◽

Rehabilitation Robot ◽

Pneumatic Muscle ◽

Muscle Actuators ◽

Limb Rehabilitation ◽

Lower Limb Rehabilitation

Download Full-text

Recent Advances on Horizontal Lower Limb Rehabilitation Robot

Recent Patents on Mechanical Engineering ◽

10.2174/2212797610666170421165454 ◽

2017 ◽

Vol 10 (2) ◽

Author(s):

Jingang Jiang ◽

Xuefeng Ma ◽

Biao Huo ◽

Xiaoyang Yu ◽

Xiaowei Guo ◽

...

Keyword(s):

Lower Limb ◽

Rehabilitation Robot ◽

Recent Advances ◽

Limb Rehabilitation ◽

Lower Limb Rehabilitation

Download Full-text

Impedance Control Based Sliding Mode for Lower Limb Rehabilitation Robot

Applied Mechanics and Materials ◽

10.4028/www.scientific.net/amm.672-674.1770 ◽

2014 ◽

Vol 672-674 ◽

pp. 1770-1773 ◽

Cited By ~ 1

Author(s):

Fu Cheng Cao ◽

Li Min Du

Keyword(s):

Dynamic Response ◽

Sliding Mode Control ◽

Lower Limb ◽

Control Method ◽

Sliding Mode ◽

Impedance Control ◽

Rehabilitation Robot ◽

Active Rehabilitation ◽

Limb Rehabilitation ◽

Lower Limb Rehabilitation

Aimed at improving the dynamic response of the lower limb for patients, an impedance control method based on sliding mode was presented to implement an active rehabilitation. Impedance control can achieve a target-reaching training without the help of a therapist and sliding mode control has a robustness to system uncertainty and vary limb strength. Simulations demonstrate the efficacy of the proposed method for lower limb rehabilitation.

Download Full-text

Integrated design of a lower limb rehabilitation mechanism using differential evolution

Computers & Electrical Engineering ◽

10.1016/j.compeleceng.2021.107103 ◽

2021 ◽

Vol 92 ◽

pp. 107103

Author(s):

José Saúl Muñoz-Reina ◽

Miguel Gabriel Villarreal-Cervantes ◽

Leonel Germán Corona-Ramírez

Keyword(s):

Differential Evolution ◽

Lower Limb ◽

Integrated Design ◽

Limb Rehabilitation ◽

Lower Limb Rehabilitation

Download Full-text

Lower limb rehabilitation equipment with animation performance for isotonic and isokinetic exercises

10.1063/5.0027586 ◽

2020 ◽

Author(s):

Nurul Hasyikin Hasmuni Chew ◽

Siti Marwangi Mohamad Maharum ◽

Zuhanis Mansor ◽

Irfan Abd Rahim

Keyword(s):

Lower Limb ◽

Limb Rehabilitation ◽

Lower Limb Rehabilitation

Download Full-text

Optimal fuzzy logic-based control strategy for lower limb rehabilitation exoskeleton

Applied Soft Computing ◽

10.1016/j.asoc.2021.107226 ◽

2021 ◽

pp. 107226

Author(s):

Richa Sharma ◽

Prerna Gaur ◽

Shaurya Bhatt ◽

Deepak Joshi

Keyword(s):

Fuzzy Logic ◽

Control Strategy ◽

Lower Limb ◽

Limb Rehabilitation ◽

Lower Limb Rehabilitation

Download Full-text

Mechatronic Exoskeletons for Lower-Limb Rehabilitation: An Innovative Review

2021 IEEE International IOT, Electronics and Mechatronics Conference (IEMTRONICS) ◽

10.1109/iemtronics52119.2021.9422513 ◽

2021 ◽

Author(s):

Deyby Huamanchahua ◽

Yerson Taza-Aquino ◽

Jhon Figueroa-Bados ◽

Jason Alanya-Villanueva ◽

Adriana Vargas-Martinez ◽

...

Keyword(s):

Lower Limb ◽

Limb Rehabilitation ◽

Lower Limb Rehabilitation

Download Full-text