HyQ: compliance and cerebellum-inspired control

Reflex-based locomotion on a 90 kg hydraulic quadruped, studied during a research stay at IIT Genoa.

September 2018 · Ghent University — IIT — HBP RoboticsLegged RobotsCompliant RoboticsNeuroroboticsMachine LearningReinforcement LearningControl & PlanningPython
← All articles

Context

Between June and September 2018 I spent four months as a visiting researcher in the Dynamic Legged Systems (DLS) group at IIT, Genoa, under Claudio Semini and Victor Barasuol, as part of my PhD at Ghent University and within the Human Brain Project. The DLS group designs and operates HyQ, a hydraulically actuated 90 kg quadruped whose active-impedance joints can emulate any ratio of stiffness and damping. I joined the team working on HyQ’s locomotion stack and ran two series of experiments that later became the two papers summarised below: one on the role of joint compliance, one on a cerebellum-inspired stance controller.

Method

Both papers share the same reflex-based architecture, built on top of HyQ’s Reactive Controller Framework (RCF). Three layers:

Three-layer control architecture diagram
The shared three-layer architecture: robot + active compliance (bottom), reflex-based motion controller (middle), cerebellum-inspired posture module (top).

Paper 1: Effect of compliance on morphological control

Using this architecture as a probe, I swept 3,600 impedance settings on HyQ in simulation to understand how does compliance interact with the control task and if we could illustrate a transfer of morphological computation to compliant bodies?

We observed that the cost of Transport grows almost exponentially with stiffness in the closed-loop system. In other words, the biologically-inspired control architecture shows good performance only on compliant bodies.

COT, power, speed and accuracy as a function of stiffness
This figure shows the end-effector trajectories of the right front foot (red), the left front foot (purple), and also the trunk’s center of mass (green). The experiment is repeated for two compliance values discussed in the paper: compliant (top), and stiff (bottom). It shows a deteriorated locomotion cycle at higher stiffness.

More details are discussed in the paper Effect of Compliance on Morphological Control of Dynamic Locomotion with HyQ.

Paper 2: Cerebellum-inspired stance control

In a second paper, I used the same control architecture to answer if a biologically-inspire cerebellum controller (top layer of the architecture) was helping to stabilize the locomotion control. To that goal, I implemented two different architecture for that cerebellum controller:

Over twelve experimental trials on the real HyQ, all PSE trials reproduced the target gait; five out of six RSE trials diverged. From a dynamic point of view, this indicates that reflex-based locomotion requires the clock signal to stabilize its limit cycle and achieve robust locomotion.

NRMSE of neural predictions for PSE and RSE trials
NRMSE of the reflex-network predictions. Green: PSE trials, stable through the test phase. Red: RSE trials, diverging during closing/testing.

More insights are provided in the paper Stance Control Inspired by Cerebellum Stabilizes Reflex-Based Locomotion on HyQ.

Going further

Alongside the two published studies, I also got HyQ to locomote end-to-end in simulation using deep reinforcement learning, specifically using TRPO (Trust Region Policy Optimization), on the same Gazebo simulation model as the experiments presented here. The results and the code were never published however.

To follow-up on this subject: