Rough-Terrain Locomotion and Unilateral Contact Force Regulations With a Multi-Modal Legged Robot
←
→
Page content transcription
If your browser does not render page correctly, please read the page content below
Rough-Terrain Locomotion and Unilateral Contact Force Regulations
With a Multi-Modal Legged Robot
Kaier Liang, Eric Sihite, Pravin Dangol, Andrew Lessieur, and Alireza Ramezani1
ut Thruster Forces
Abstract— Despite many accomplishments by legged robot
designers, state-of-the-art bipedal robots are prone to falling
over, cannot negotiate extremely rough terrains and cannot Thruster Actuator
directly regulate unilateral contact forces. Our objective is to
integrate merits of legged and aerial robots in a single platform.
arXiv:2103.15952v1 [cs.RO] 29 Mar 2021
We will show that the thrusters in a bipedal legged robot called Thruster
Harpy can be leveraged to stabilize the robot’s frontal dynamics
and permit jumping over large obstacles which is an unusual
Hip Frontal
capability not reported before. In addition, we will capitalize on
Actuator
the thrusters action in Harpy and will show that one can avoid
using costly optimization-based schemes by directly regulating
contact forces using an Reference Governor (RGs). We will Hip Sagi al
resolve gait parameters and re-plan them during gait cycles by Actuator
only assuming well-tuned supervisory controllers. Then, we will
focus on RG-based fine-tuning of the joints desired trajectories Knee Actuator
to satisfy unilateral contact force constraints.
ug Ground Reaction
I. I NTRODUCTION Force
Raibert’s hopping robots [1] and Boston Dynamics’ robots
[2] are amongst the most successful examples of legged
robots, as they can hop or trot robustly even in the presence
of significant unplanned disturbances. Other than these suc-
Fig. 1. Illustration of a concept design for Harpy, a thruster-assisted bipedal
cessful examples, a large number of underactuated and fully robot designed by the authors to study robust, efficient and agile legged
actuated bipedal robots have also been introduced. Agility robotics.
Robotics’ Cassie [3], Honda’s ASIMO [4] and Samsung’s
Mahru III [5] are capable of walking, running, dancing and very limited in that, for example, the height of terrain bumps
going up and down stairs, and the Yobotics-IHMC [6] biped should not exceed the size of the legs [7], [8], [9]. However,
can recover from pushes. Despite these accomplishments, a legged robot maintains a superior energetic efficiency of
all of these systems are prone to falling over and cannot locomotion because its overall body weight is supported by
negotiate extremely rough terrains. Even humans, known the legs, can safely operate inside buildings and has no sharp,
for their natural, dynamic and robust gaits cannot recover rotating blades to cause severe laceration injuries to humans.
from severe rough terrain perturbations, external pushes Thruster-assisted legged locomotion has not been explored
or slippage on icy surfaces. Our goal is to enhance the previously except to a limited extent in a few examples
robustness of these systems through a distributed array of that only considered the hardware-related challenges [10].
thrusters and nonlinear control. These robots potentially can offer rich and challenging dy-
In this paper, we will report our efforts in designing namics and control problems. The overactuation and control
closed-loop feedback for the thruster-assisted walking of a allocation problems led by the coexistence of thrusters and
legged system called Harpy (shown in Fig. 1), currently its joint actuators not only can provide opportunities to study
hardware being developed at Northeastern University. This interesting control ideas, but also, from a dynamical behavior
biped is equipped with a total of eight actuators, and a pair standpoint, can permit studying unexplored behaviors such
of coaxial thrusters fixed to its torso. Our motivation stems as walking under buoyancy phenomena [11]. Also, studying
from the merits of aerial and legged systems and we intend multi-modal systems that can switch from one mode to
to integrate these merits in a single platform. Contrary to another in order to overcome the demanding mobility objec-
fixed- or rotary-wing aerial systems, legged robots cannot tives in unstructured environments is a rather new research
exhibit a fast mobility and fly over obstacles. A legged problem and potentially can result in interesting machine-
robot’s capability to negotiate extremely bumpy terrains (e.g., learning and optimization-based motion planning problems
semi-collapsed buildings in the aftermath of an earthquake) is [12].
1 SiliconSynapse Laboratory, ECE Department, Northeastern Univer-
From a feedback design standpoint, the challenge of simul-
sity, Boston, MA, USA. emails: {liang.k, e.sihite, dangol.p, lessieur.a, taneously providing asymptotic stability and gait feasibility
a.ramezani} @northeastern.edu constraints satisfaction in legged systems have been exten-sively addressed [13]. For instance, the method of Hybrid
Zero Dynamics (HZD) has provided a rigorous model-based
approach to assign attributes such as efficiency of locomotion
in an off-line fashion. Other attempts entail optimization-
based, nonlinear approaches to secure safety and perfor-
mance of legged locomotion [14], [15], [16], [17], [18], [19].
Thrusters can result in unparalleled capabilities. For in-
stance, gait trajectory planning (or re-planning), control and
unilateral contact force regulation can be treated significantly
differently as we have shown previously [20], [21], [22]
and will further discuss new details in this paper. That
said, real-time gait trajectory design in legged robots has
been widely studied and the application of optimization-
based methods is very common [23]. In general, in these
paradigms, an optimization-based controller adjusts the gait
parameters throughout the whole gait cycle such that not only
the robot’s posture is adjusted to accommodate the unplanned
posture adjustments but also the joints position, velocity and Fig. 2. Shows the joint movements, key positions (pi ) and dimensions (li )
acceleration are modified to avoid slipping into infeasible of Harpy. The non-conservative forces and torques acting on the system are
scenarios, e.g., the violation of contact forces. What makes denoted by ui .
these methods further cumbersome is that they are widely
defined based on Whole Body Control (WBC) which can This paper is outlined as follows: the dynamic modeling
lead to computationally expensive algorithms [24]. for Harpy and the reduced-order models which will be used
These problems are widely known to suffer from curse of in the numerical simulation and controller design, the discus-
dimensionality and other popular paradigms such as Approx- sion on thruster assisted locomotion, numerical simulation
imate Dynamic Programming (ADP) [25], Reinforcement discussions, and then followed by the concluding remarks.
Learning (RL) [26], decoupled approaches to design control
for nonlinear stochastic systems [27], etc., can potentially II. DYNAMIC M ODELING OF H ARPY
remedy the challenges. However, these approaches are far This section outlines the dynamics formulation of the robot
from providing any practical solutions to the problem in hand which is used in the numerical simulation in Section IV, in
and they are shown to be only effective on simpler practical addition to the reduced order models which are used in the
robots mainly those that can only demonstrate quasi-static controller design. Fig. 2 shows the kinematic configuration
gaits. of Harpy which listed the center of mass (CoM) positions
We will capitalize on the thrusters action in Harpy and of the dynamic components, joint actuation torques, and
will show that one can limit the use of costly optimization- thruster torques. The system model has a combined total of
based schemes by directly regulating contact forces. We 12 degrees-of-freedoms (DoFs): 6 for the body and 3 on each
will resolve gait parameters and re-plan them during the leg. Due to the symmetry, the left and right side of the robot
whole Single Support (SS) phase, which is the longest phase follow a similar derivations so only the general derivations
in a gait cycle, by only assuming well-tuned supervisory are provided in this section.
controllers found in [28], [29], [30] and by focusing on
fine-tuning the joints desired trajectories to satisfy unilateral A. Euler-Lagrange Formalism
contact force constraints. To do this, we will devise inter- The Harpy equations of motion are derived using Euler-
mediary filters based on the celebrated idea of Explicit Ref- Lagrangian dynamics formulation. In order to simplify the
erence Governors (ERG) [31], [32], [33], [34]. ERGs relied system, each linkages are assumed to be massless and the
on provable Lyapunov stability properties can perform the mass are concentrated at the body and the joint motors.
motion planning problem in the state space in a much faster Consequently, the lower leg kinematic chain is considered to
way than widely used optimization-based methods. That said, be massless which significantly simplifies the system. The
these ERG-based gait modifications and impact events (i.e., three leg joints are labeled as the hip frontal (pelvis P ), hip
impulsive effects) can lead to severe deviations from the sagittal (hip H) and knee sagittal (knee K), as illustrated in
desired periodic orbits and standard legged robots cannot Fig. 2. The thrusters are also considered to be massless and
sustain these perturbations. Previously, we demonstrated that capable of providing forces in any directions to simplify the
the thrusters can be leveraged to enforce hybrid invariance in problem.
a robust fashion by applying predictive schemes within the Let γh be the frontal hip angle while φh and φk be the
Double Support (DS) phase [22]. Last, we also will show sagittal hip and knee angles respectively. Let the superscript
that thrusters can be leveraged to stabilize frontal dynamics {B, P, H, K} represent the frame of reference about the
and permit jumping over large obstacles which is an unusual body, pelvis, hip, and knee while the inertial frame is repre-
capability not reported before. sented without the superscript. Let RB be the rotation matrixfrom the body frame to the inertial frame (i.e. x = RB xB ). ug is the ground reaction forces (GRFs). The variables M ,
The pelvis motor mass is added to the body mass. Then h, Bt , and Bg are a function of the full system states:
the positions of the hip and knee CoM are defined using >
x = [rB , q > , φKL , φKR , ωB
B>
, q̇ > , φ̇KL , φ̇KR ]> , (6)
kinematic equations:
where the vector rB contains the elements of RB . Using
pP = pB + RB l1B , pH = pP + RB Rx (γh ) l2P
(1) Bj = [06×6 , I6×6 ] allows uj to actuate the joint angles
pK = pH + RB Rx (γh ) Ry (φh )l3H , directly. Let v = [ωB B>
, q̇ > ]> be the velocity of the gen-
where Rx and Ry are the rotation matrices about the x and eralized coordinates, then Bt and Bg can be defined using
y axis respectively, l is the length vectors representing the the virtual displacement from the velocity as follows:
conformation of Harpy which are constant in their respective
> >
∂ ṗTL /∂v ∂ ṗFL /∂v
local frame of reference. The foot and thruster positions are Bt = ∂ ṗTR /∂v , Bg = ∂ ṗFR /∂v . (7)
defined as: 02×6 02×6
pF = pK + RB Rx (γh ) Ry (φh ) Ry (φk ) l4K
(2) The vector ut = [u> > >
tL , utR ] is formed from the left and
pT = pB + RB ltB thruster forces utL and utR , respectively.
The GRF is modeled using the unilateral compliant ground
where the length vector from the knee to the foot is l4K =
model with undamped rebound while the friction is modeled
[−l4a cos φk , 0, −(l4b + l4a sin φk )]> which is the kinematic
using the Stribeck friction model, defined as follows:
solution to the parallel linkage mechanism of the lower
leg. Let ωB be the angular velocity of the body. Then ug,z = − kg,p pF,z − kg,d ṗF,z
the angular velocities of the hip and knee are defined as: |ṗF,x |2
ωHB
= [γ̇h , 0, 0]> + ωB
B
and ωK H
= [0, φ̇h , 0]> + ωH H
. ug,x = − µc + (µs − µc ) exp − fz sgn(ṗF,x )
vs2
Finally, the energy of the system for the Lagrangian
− µv ṗF,x ,
dynamics formulation are defined as follows: (8)
1P
> i> ˆ i
where pF,x and pF,z are the x and z components of the
K= m i p i p i + ω i I i ω i inertial foot position, kg,p and kg,d are the spring and
2 Pi∈F (3)
> > damping model for the ground, µc , µs , and µv are the
V = − i∈F mi pi [0, 0, −g] ,
Coulomb, static, and viscous friction coefficient respectively,
where F = {B, HL , KL , HR , KR } are the relevant frame and vs is the Stribeck velocity. kg,d = 0 if ṗF,z > 0 for the
of references and mass components (body, hip and knee of undamped rebound model and the friction in the y direction
each side), and the subscripts L and R represent the left follows a similar derivation to ug,x . Then the ground force
and right side of the robot. Furthermore, Iˆi is the inertia model ug is defined as follows:
about its local frame, and g is the gravitational constant.
This forms the Lagrangian of the system L = K − V which ug = [u> > >
gL H(−pFL ,z ), ugR H(−pFR ,z )] , (9)
is used to derive the system’s Euler-Lagrangian equations of where H(x) is the heaviside function, while ugL and ugR
motion. The dynamics of the body angular velocity is derived are the left and right ground forces which are formed using
using the modified Lagrangian for rotation in SO(3) to avoid their respective ug,x , ug,y , and ug,z . Finally, the full system
using Euler angles and the potential gimbal lock associated equation of motion can be derived using (4) to (9) to form
with them. This results in the following equations of motion ẋ = f (x, uj , ut , ug ).
following Hamilton’s principle of least action:
B. Reduced-Order Models
d ∂L B ∂L P3 ∂L
B
+ ωB × B
+ j=1 rBj × = u1 The following reduced-order models are used: a variable
dt ∂ωB ∂ωB ∂rBj
length inverted pendulum (VLIP) model and a two-body
d ∂L ∂L d B pendulum model, as illustrated in Fig. 3. The VLIP model is
− = u2 , RB = RB [ωB ]× ,
dt ∂ q̇ ∂q dt used to describe walking with an ERG, while the two-body
(4) pendulum model is used to describe the flight phase with
>
where [ · ]× is the skew operator, RB = [rB1 , rB2 , rB3 ], a Model Predictive Control (MPC) to track trajectory and
q = [p> B , γhL , γhR , φhL , φhR ]
>
is the dynamical system regulate the appropriate leg postures at the time of landing.
B
states other than (RB , ωB ), and u is the generalized forces. 1) Variable-Length Inverted Pendulum (VLIP) Model:
The knee sagittal angle φk which is not associated with As shown in Fig. 3a, the model is described simply using
any mass is updated using the knee joint acceleration input the inverted pendulum model where the length of r can be
uk = [φ̈kL , φ̈kR ]> . Then the system acceleration can be adjusted through the change in leg conformation. The center
derived as follows: of pressure, c, is defined as the weighted average position of
M a + h = Bj uj + Bt ut + Bg ug (5) the foot, c = λL pFL + λR pFR , where λi = ugi ,z /(ugL ,z +
ugR ,z ), i ∈ {L, R}. Harpy is modeled using a point foot, so
where a = [ω̇BB>
, q̈ > , φ̈kL , φ̈kR ]> , ut is the thruster force, c is equal to the stance foot position during the SS phase.
uj = [uPL , uPR , uHL , uHR , u> >
k ] is the joint actuation, and The VLIP model without thrusters is underactuated, but theaddition of thrusters makes the system fully actuated and
enables it to do trajectory tracking. pB Jsλ
The dynamic model is derived as follows:
pB
mp̈B = mg + ut,c + Js> λ (10) ut pH θ
r
mg
where m is the mass of the pendulum which in this case is
the total mass of the system, and ut,c is the thruster forces q pK
about the body CoM. The constraint force Js> λ is setup to
keep the leg length r equal to the leg conformation using the c m1 g
following constraint equation: m2 g
(a) (b)
Js (p̈B − c̈) = ur , Js = (pB − c)> , (11)
Fig. 3. Illustrates two reduced-order models, a variable-length inverted
which is designed to keep the leg length second derivative pendulum (VLIP) and a two-body pendulum model, which are used to
describe the dominant dynamic response of the system during the walking
equal to ur . This constraint force also forms the GRF as and ballistic motions, respectively.
long as the friction cone constraint is satisfied. Assuming no
slip (c̈ = 0), then the inputs to the system are ur which
controls the body position about the vector r = pB − c by parameters P0 , P2 , and P4 define the initial, middle, and
adjusting the leg length, and the thrusters ut which controls final positions of the gait. Each swing and stance feet-end
the remaining DoFs. have a constant Bezier curve parameters to form the 2D
2) Two-body Pendulum Model: As shown in Fig. 3b, the open-loop walking gait. The following parameters defined
model is described as a planar double pendulum in the x- in the x-z plane are used: P0,sw = [−0.21, −0.60]> m ,
z plane where the mass is concentrated in the body CoM P2,sw = [−0.20, −0.50]> m, and P4,sw = [0.10, −0.60]> m
and knee, m1 and m2 respectively. Here, the mass of the for the swing foot, and P0,st = [0.10, −0.60]> m , P2,st =
pelvis motors are combined into the m1 while the mass [0.01, −0.63]> m, and P4,st = [−0.21, −0.60]> m for the
of both hips are combined into m2 to ensure a similar stance foot. These Bezier parameters define the feet positions
kinematics behavior. The control action is defined as udp = and then the joint angles are found simply by resolving
[ux , uz , uh ]> where ux and uz are the sum of thruster the corresponding inverse kinematics problem. Finally, these
forces in the x and z directions, respectively, and uh is the joint trajectories are tracked using an asymptotically stable
sagittal hip motor torque. The model uses the following states controller [35].
qdp = [pB,x , pB,z , q, θ]> , where q is the hip joint angle The thrusters are used to stabilize the roll and yaw motion
and θ is the body absolute pitch angle. Then the system of the robot using the following controller
acceleration can be derived as follows:
utL ,F = [uyaw , 0, uroll ]> , utR ,F = −utL ,F , (13)
Mdp q̈dp + hdp = Bdp udp , (12)
where utL ,F and utR ,F are the left and right thruster force
where Bdp = [I3×3 , 01×3 ], allows the direct control signal components for frontal dynamics stabilization, while uroll
to the system states except θ. and uyaw are simple PD controllers to track zero roll and yaw
reference angles. This controller is sufficient to stabilize the
III. C ONTROL OF T HRUSTER -A SSISTED L OCOMOTION
frontal dynamics and the robot’s heading even when using
This section discusses and outlines the applications of a gait designed for a 2D bipedal robot. During walking, the
thrusters in our robot, particularly to stabilizes the frontal combined thruster forces are formed by combining ut,c in
dynamics, apply the ERG framework to regulate GRFs, and (10) and (13), as follows
to perform ballistic motions to avoid obstacles.
ut = [u> > > > > >
t,c , ut,c ] /2 + [utL ,F , utR ,F ] . (14)
A. Frontal Dynamics Stabilization
In order to show the application of thruster-assisted walk- B. Thruster-Assisted Enforcement of GRF Constraints Using
ing in our robot, we use a walking gait designed for a planar RG-based Methods
bipedal robot. The absence of the frontal dynamics means The ERG framework is utilized to enforce the friction
that this gait is unstable for 3D walking. Hence the thrusters pyramid constraint of the robot by manipulating the applied
are used to stabilize the robot’s roll and yaw to achieve a reference to the controller, which is useful to avoid using
stable walking gait in the full 3D system. optimization or nonlinear MPC framework to enforce con-
The gait is designed by constraining the system dynamics straints on the harder-to-model GRFs. The VLIP model in
in the x-z plane using 4th order Bezier polynomials to (10) can be fully actuated due to the addition of thrusters,
define the feet-end positions relative to the hip in the robot which enables us to utilize a more advanced controller such
body frame (pB B
F − pP ). Let Pi , i ∈ {0, . . . , 4} be the as ERG.
Bezier polynomial parameters. This 4th order polynomial The ERG manipulates the applied reference (xw ) to avoid
can be constrained to have zero initial and final gait ve- violating the constraint equation hw (x, xw ) ≥ 0 while
locity by setting P0 = P1 and P3 = P4 . Then the free also be as close as possible to the desired reference (xr ),as illustrated in Fig. 4. Consider the Lyapunov equation hw < 0 V
V = (xr − xw )> P (xr − xw ). xw is updated through the hr < 0 min
update law:
ẋw = vr + vt + vn , (15)
vn xr
where vr drives xw directly to xr , while vt and vn drives xw vr V=0
along the surface and into the constraint equation boundary xw vt
hw = 0, respectively. The objective of this ERG algorithm is xw,t hw = 0
to drive xw to the state xw,t which is the minimum energy hw > 0
solution Vmin that satisfies the constraint hw ≥ 0.
Let hr (x, xr ) = Jr xr + dr ≥ 0 be the constraint Fig. 4. Shows how an Explicit Reference Governor (ERG) can manipulate
equation using the desired reference xr , and Cr be the the applied reference states xw to be as close as possible to the desired
reference xr while the constraint equations hw ≥ 0 remain feasible.
rowspace of the violated constraints of hr . Define Nr =
null(Cr ) = [n1 , . . . , nn ] where n is the size of the nullspace.
Additionally, let rk be the k’th row of Jr . Then the following is aligned about the body CoM sagittal axis. Therefore, the
update law is used for vr , vt , and vn : reduced-order model in (5) is used to model the robot during
the ballistic motion which is represented as a two-body
vr = α̂r (xr − xw ), vn = α̂n rk rk> (xr − xw )
Pn (16) pendulum system. Then an MPC framework is developed
vt = k=1 α̂t nk n>
k (xr − xw ), based on this model to regulate the feet-end positions and
where α̂ are scalars defined as follows: ensuring the proper landing configuration.
( The hip frontal and knee angles are setup to be constant
αr , if min(hw ) ≥ 0 or min(hr ) ≥ 0 during the ballistic maneuver and the feet are positioned in a
α̂r =
0, else neutral stance configuration within which legs are parallel to
( each other. An MPC with a prediction and control horizons
αt , if min(hw ) ≥ 0 or min(hr ) < 0
α̂t = Nh = 10 is developed using the following optimizer
0, else (17) PNh k>
min k=0 (e We ek + ∆uk> k
dp Wu ∆udp )∆t
αn ,
if min(hw ) ≤ min(hr ) < 0 udp
α̂n = −αn , if min(hr ) < min(hw ) < 0 subject to ẋdp = f (xdp , udp ) (19)
k+1 k
0, else, qref − θ + qc = 0
where α is a positive scalar which determines the rate of |udp | − udp,max ≤ 0,
convergence. where e = [p> > >
B −pB,ref , q −qref ] is the tracking error, We
Assuming ẋr = 0 and using the update law defined from and Wu are the cost weighting matrices, and qc is a constant
(16) and (17), we willP have V̇ = −2(xr −xw )> Q (xr −xw ), angle to determine the desired landing posture. qref denotes
n
with Q = P (α̂r I + k=1 α̂t nk n> >
k + α̂n rk rk ). We have the reference hip sagittal angle which is updated each time
the gradient V̇ = 0 if min(hw ) ≤ 0 and nk ⊥(xr − xw ), step relative to the body pitch angle θ and qc . This reference
while V̇ < 0 when min(hr ) ≥ 0 or when min(hw ) ≥ 0. In can ensure the legs do not lag behind the body at the landing
case both applied reference and target constraints equation moment, and qc can be properly adjusted to achieve the
are violated, we have V̇ > 0 which drives the xw towards the desired landing posture. The references pB,ref are designed
constraint boundary. This allows the xw to converge to xw,t as a ballistic motion for the body CoM as follows
which is the minimum energy solution that satisfies hw ≥ 0 >
as illustrated in Fig. 4. pk+1 k
B,ref = pB,ref + ∆t [a, b sin(2πt/1.5)] (20)
The GRF constraints for this robot can be derived from where pB denotes the body position in the x-z plane, while a
the reduced-order model constraint equation in (11) by ap- and b are some constants, and ∆t is the controller time step.
plying the ground pyramid constraint. We use the following Finally, the resulting udp = [ux , uz , uh ] is fed to the full
constraints for the ERG: model, where the thruster forces ux and uz are combined
|ug,x |≤ µs ug,z , |ug,y |≤ µs ug,z , ug,z ≥ uz,min , (18) with the roll and yaw stabilization controller in (13). This
forms the combined thruster forces
where [ug,x , ug,y , ug,z ]> = Js> λ is the ground reaction force
model from (10) and (11). This forms the ground friction ut = [ux , 0, uz , ux , 0, uz ]> /2 + [u> > >
t,l , ut,r ] , (21)
pyramid constraint and the minimum ground normal force which tracks the flight trajectory and stabilizes the robot’s
acting on the leg to ensure that the foot doesn’t slip. roll and heading angles.
C. MPC Design and Control of Robot’s Ballistic Motion IV. S IMULATION R ESULTS
The thrusters can also be used to traverse rough terrain by The optimization and numerical simulation are done in
simply flying over obstacles. However, it is not possible to Matlab where we used interior-point algorithm and RK4
stabilize pitch dynamics due to the thrusters position which scheme, respectively. The MPC and the ERG filter are(a)
(b)
(c)
Fig. 5. Shows the simulation result for the walking and jumping maneuvers. (a) Illustrates the stick-diagram plot when the robot walks on flat ground
and jumps over obstacles. (b) Depicts the states evolution, the thrusters action and ground contact forces. Note that the thrusters are employed to stabilize
the frontal dynamics. The RG algorithm is disabled during the ballistic motion (grey area). (c) Close-up view of the constraints at t = [9.5, 11.5]s where
the applied references (xw ) are manipulated such that the constraints (hw (x, xw ) ≥ 0) remain feasible.
simulated using a zero-order hold at a frequency 20 times and kg,d = 268. The frequency of MPC controller applied
smaller than the numerical integration to better capture the in the two-body pendulum model is 100 Hz with a 10
behavior of the onboard computer in Harpy. steps prediction and control horizons. The entire simulation
contains 16 gait cycles (12 s) using the following sequence:
A. Simulation Specifications 3 walking steps, 1 quiet standing cycle, 2 jumping cycles, 4
In this section, all units are in N, kg, m, s. The robot walking steps, 1 quiet standing cycle, 2 jumping cycles, and
has the following dimensions: l1 = [0, 0.1, −0.1]> , l2 = 3 walking steps. The weighting matrices for the MPC are
[0, 0.5, 0]> , l3 = [0, 0, −0.3]> , and l4 = [0, 0.1, 0]> for the We = diag(5, 20), Wu = diag(1/10, 1/10, 1/5), qc = 23◦ ,
left side. The right side simply has the y axis component and the parameters for the jumping trajectories are a = 0.9,
inverted. This gives the robot CoM height of around 0.6 m b = 0.8 for the first jumping cycle, and a = 1.2, b = 0.3 for
for standing and walking. The following mass and inertias the second jump. The ERG is applied using α = 5 which
are used: mB = 2, mH = mK = 0.5, IB = 10−3 , provides a sufficient convergence rate.
IH = IK = 10−4 . Finally, the following ground parameters
are used: µs = 0.6, µc = 0.54, µv = 0.85, kg,p = 8000,B. Simulation Results and Discussions [3] Y. Gong, R. Hartley, X. Da, A. Hereid, O. Harib, J.-K. Huang, and
J. Grizzle, “Feedback control of a cassie bipedal robot: Walking,
The simulation results can be seen in Fig. 5, where Fig. 5a standing, and riding a segway,” in 2019 American Control Conference
shows the key frames of the simulated robot trajectory where (ACC), 2019, pp. 4559–4566.
[4] M. Hirose and K. Ogawa, “Honda humanoid robots development,”
it walks and jumps over obstacles. Figure 5b shows the data Philosophical Transactions of the Royal Society A: Mathematical,
of the thruster forces, ground normal forces, and body states Physical and Engineering Sciences, vol. 365, no. 1850, pp. 11–19,
during the simulation. The walking gait designed for a 2D 2006.
bipedal robot is stable when used in the full 3D system [5] W. Kwon et al., “Biped humanoid robot mahru iii,” in IEEE-RAS
International Conf. on Humanoid Robots. IEEE, 2007, pp. 583–588.
as the pitch and yaw angles are stabilized by the thruster [6] J. E. Pratt et al., “The yobotics-ihmc lower body humanoid robot,” in
actions. Additionally, the trajectory of the body positions IEEE/RSJ International Conference on Intelligent Robots and Systems,
match closely towards the desired trajectories throughout 10 2009, pp. 410–411.
[7] H.-W. Park, A. Ramezani, and J. W. Grizzle, “A finite-state ma-
the walking and flight phases. The robot’s pitch angle is chine for accommodating unexpected large ground-height variations
uncontrollable as shown in Fig. 5b throughout the entire in bipedal robot walking,” IEEE Transactions on Robotics, vol. 29,
simulation. However, the MPC has successfully regulated no. 2, pp. 331–345, 2012.
[8] B. G. Buss, A. Ramezani, K. A. Hamed, B. A. Griffin, K. S. Galloway,
the foot landing positions such that the foot is positioned and J. W. Grizzle, “Preliminary walking experiments with under-
below or in front of the body at the time of landing through actuated 3d bipedal robot marlo,” in 2014 IEEE/RSJ International
the appropriate hip sagittal angles, as shown in Fig. 5a. Conference on Intelligent Robots and Systems, 2014, pp. 2529–2536.
This allows a smooth transition between landing and walking [9] H.-W. Park, K. Sreenath, A. Ramezani, and J. W. Grizzle, “Switching
control design for accommodating large step-down disturbances in
which is the primary objective of using the MPC. bipedal robot walking,” in 2012 IEEE International Conference on
The ERG is used to regulate the ground friction forces Robotics and Automation, 2012, pp. 45–50.
by manipulating the applied state references during walking [10] G. Picardi, H. Hauser, C. Laschi, and M. Calisti, “Morphologically
induced stability on an underwater legged robot with a deformable
to prevent slips. Figure 5b shows the constraint equation body,” The International Journal of Robotics Research, 2019.
values using the applied and target reference (hw and hr [11] Y. Kojio et al., “Walking control in water considering reaction forces
respectively), where the target reference trajectories satisfy from water for humanoid robots with a waterproof suit,” in 2016
IEEE/RSJ International Conference on Intelligent Robots and Systems
the constraints except at around the time range of t = (IROS), 2016, pp. 658–665.
[10, 12]s as shown in Fig. 5c. Within this time range, the [12] B. Araki, J. Strang, S. Pohorecky, C. Qiu, T. Naegeli, and D. Rus,
desired reference trajectory results in min(hr ) < 0 and the “Multi-robot path planning for a swarm of robots that can both fly
and drive,” in 2017 IEEE International Conference on Robotics and
ERG has successfully manipulate the applied reference xw Automation (ICRA), 2017, pp. 5575–5582.
such that the constraint equation hw ≥ 0 is satisfied. The [13] E. Westervelt and J. Grizzle, Feedback Control of Dynamic Bipedal
control output of the ERG is disabled during the jumping Robot Locomotion, ser. Control and Automation Series. CRC
PressINC, 2007.
period (shaded gray in Fig. 5b) as the thruster components
[14] K. Galloway, K. Sreenath, A. D. Ames, and J. W. Grizzle, “Torque sat-
for position stabilization is handled by the MPC. uration in bipedal robotic walking through control lyapunov function-
based quadratic programs,” IEEE Access, vol. 3, pp. 323–332, 2015.
V. C ONCLUSIONS AND F UTURE W ORK [15] H. Dai and R. Tedrake, “Planning robust walking motion on uneven
terrain via convex optimization,” in IEEE-RAS International Confer-
The concept design and dynamics simulation of a thruster- ence on Humanoid Robots (Humanoids), 11 2016, pp. 579–586.
assisted bipedal robot called Harpy is presented in this paper. [16] S. Feng, E. Whitman, X. Xinjilefu, and C. G. Atkeson, “Optimization
based full body control for the atlas robot,” in IEEE-RAS International
The addition of the thrusters allows the robot to stabilize its Conference on Humanoid Robots, 11 2014, pp. 120–127.
frontal dynamics easily and be able to regulate the ground [17] A. Ramezani, J. W. Hurst, K. Akbari Hamed, and J. W. Grizzle, “Per-
contact forces directly. To do this, a control algorithm based formance analysis and feedback control of atrias, a three-dimensional
bipedal robot,” Journal of Dynamic Systems, Measurement, and Con-
on Reference Governors (RGs) was proposed. In addition, trol, vol. 136, no. 2, 2014.
the thrusters allowed multimodal locomotion where the robot [18] A. Ramezani, “Feedback control design for marlo, a 3d-bipedal robot.”
can transition between walking and jumping over obstacles. Ph.D. dissertation, 2013.
[19] J. Grizzle, A. Ramezani, B. Buss, B. Griffin, K. A. Hamed, and
Through simulations we showed that the integration of the K. Galloway, “Progress on controlling marlo, an atrias-series 3d
thrusters can yield a dynamically robust biped. Future work underactuated bipedal robot,” Dynamic Walking, 2013.
includes the completion of Harpy. In addition, we will [20] P. Dangol, A. Ramezani, and N. Jalili, “Performance satisfaction in
midget, a thruster-assisted bipedal robot,” in 2020 American Control
explore the cost of transport in multi-modal systems. Fur- Conference (ACC), 2020, pp. 3217–3223.
ther investigations will be made in developing high fidelity [21] A. C. de Oliveira and A. Ramezani, “Thruster-assisted center man-
models that incorporate realistic thruster dynamics which ifold shaping in bipedal legged locomotion,” in 2020 IEEE/ASME
are more appropriate for the real-world implementation of International Conference on Advanced Intelligent Mechatronics (AIM).
IEEE, 2020, pp. 508–513.
closed-loop feedback on Harpy. [22] P. Dangol and A. Ramezani, “Towards thruster-assisted bipedal loco-
motion for enhanced efficiency and robustness,” International Feder-
R EFERENCES ation of Automatic Control (IFAC), 2020.
[23] A. Hereid, E. A. Cousineau, C. M. Hubicki, and A. D. Ames,
[1] M. H. Raibert, H. B. Brown Jr, and M. Chepponis, “Experiments in “3d dynamic walking with underactuated humanoid robots: A direct
balance with a 3d one-legged hopping machine,” The International collocation framework for optimizing hybrid zero dynamics,” in 2016
Journal of Robotics Research, vol. 3, no. 2, pp. 75–92, 1984. IEEE International Conference on Robotics and Automation (ICRA),
[2] M. Raibert, K. Blankespoor, G. Nelson, and R. Playter, “Bigdog, the 2016, pp. 1447–1454.
rough-terrain quadruped robot,” IFAC Proceedings Volumes, vol. 41, [24] L. Sentis and O. Khatib, “A whole-body control framework for
no. 2, pp. 10 822–10 825, 2008. humanoids operating in human environments,” in Proceedings 2006IEEE International Conference on Robotics and Automation, 2006.
ICRA 2006., 2006, pp. 2641–2648.
[25] W. B. Powell, Approximate Dynamic Programming: Solving the curses
of dimensionality. John Wiley & Sons, 2007, vol. 703.
[26] R. S. Sutton and A. G. Barto, Reinforcement learning: An introduction.
MIT press, 2018.
[27] M. Rafieisakhaei, S. Chakravorty, and P. Kumar, “A near-optimal
decoupling principle for nonlinear stochastic systems arising in robotic
path planning and control,” in 2017 IEEE 56th Annual Conference on
Decision and Control (CDC), 2017, pp. 1–6.
[28] E. D. Sontag, “A lyapunov-like characterization of asymptotic con-
trollability,” SIAM journal on control and optimization, vol. 21, no. 3,
pp. 462–471, 1983.
[29] P. V. Kokotovic, M. Krstic, and I. Kanellakopoulos, “Backstepping to
passivity: recursive design of adaptive systems,” in IEEE Conference
on Decision and Control, vol. 4, 12 1992, pp. 3276–3280.
[30] S. P. Bhat and D. S. Bernstein, “Continuous finite-time stabilization of
the translational and rotational double integrators,” IEEE Transactions
on Automatic Control, vol. 43, no. 5, pp. 678–682, 1998.
[31] E. Garone and M. M. Nicotra, “Explicit reference governor for con-
strained nonlinear systems,” IEEE Transactions on Automatic Control,
vol. 61, no. 5, pp. 1379–1384, 2015.
[32] E. G. Gilbert, I. Kolmanovsky, and Kok Tin Tan, “Nonlinear control
of discrete-time linear systems with state and control constraints:
a reference governor with global convergence properties,” in IEEE
Conference on Decision and Control, vol. 1, 12 1994, pp. 144–149.
[33] A. Bemporad, “Reference governor for constrained nonlinear systems,”
IEEE Trans. on Automatic Control, vol. 43, no. 3, pp. 415–419, 1998.
[34] E. Gilbert and I. Kolmanovsky, “Nonlinear tracking control in the
presence of state and control constraints: a generalized reference
governor,” Automatica, vol. 38, no. 12, pp. 2063–2073, 2002.
[35] H. Khalil, Nonlinear Systems. Pearson Education, Prentice Hall, 2002.You can also read