Rough-Terrain Locomotion and Unilateral Contact Force Regulations With a Multi-Modal Legged Robot
←
→
Page content transcription
If your browser does not render page correctly, please read the page content below
Rough-Terrain Locomotion and Unilateral Contact Force Regulations With a Multi-Modal Legged Robot Kaier Liang, Eric Sihite, Pravin Dangol, Andrew Lessieur, and Alireza Ramezani1 ut Thruster Forces Abstract— Despite many accomplishments by legged robot designers, state-of-the-art bipedal robots are prone to falling over, cannot negotiate extremely rough terrains and cannot Thruster Actuator directly regulate unilateral contact forces. Our objective is to integrate merits of legged and aerial robots in a single platform. arXiv:2103.15952v1 [cs.RO] 29 Mar 2021 We will show that the thrusters in a bipedal legged robot called Thruster Harpy can be leveraged to stabilize the robot’s frontal dynamics and permit jumping over large obstacles which is an unusual Hip Frontal capability not reported before. In addition, we will capitalize on Actuator the thrusters action in Harpy and will show that one can avoid using costly optimization-based schemes by directly regulating contact forces using an Reference Governor (RGs). We will Hip Sagi al resolve gait parameters and re-plan them during gait cycles by Actuator only assuming well-tuned supervisory controllers. Then, we will focus on RG-based fine-tuning of the joints desired trajectories Knee Actuator to satisfy unilateral contact force constraints. ug Ground Reaction I. I NTRODUCTION Force Raibert’s hopping robots [1] and Boston Dynamics’ robots [2] are amongst the most successful examples of legged robots, as they can hop or trot robustly even in the presence of significant unplanned disturbances. Other than these suc- Fig. 1. Illustration of a concept design for Harpy, a thruster-assisted bipedal cessful examples, a large number of underactuated and fully robot designed by the authors to study robust, efficient and agile legged actuated bipedal robots have also been introduced. Agility robotics. Robotics’ Cassie [3], Honda’s ASIMO [4] and Samsung’s Mahru III [5] are capable of walking, running, dancing and very limited in that, for example, the height of terrain bumps going up and down stairs, and the Yobotics-IHMC [6] biped should not exceed the size of the legs [7], [8], [9]. However, can recover from pushes. Despite these accomplishments, a legged robot maintains a superior energetic efficiency of all of these systems are prone to falling over and cannot locomotion because its overall body weight is supported by negotiate extremely rough terrains. Even humans, known the legs, can safely operate inside buildings and has no sharp, for their natural, dynamic and robust gaits cannot recover rotating blades to cause severe laceration injuries to humans. from severe rough terrain perturbations, external pushes Thruster-assisted legged locomotion has not been explored or slippage on icy surfaces. Our goal is to enhance the previously except to a limited extent in a few examples robustness of these systems through a distributed array of that only considered the hardware-related challenges [10]. thrusters and nonlinear control. These robots potentially can offer rich and challenging dy- In this paper, we will report our efforts in designing namics and control problems. The overactuation and control closed-loop feedback for the thruster-assisted walking of a allocation problems led by the coexistence of thrusters and legged system called Harpy (shown in Fig. 1), currently its joint actuators not only can provide opportunities to study hardware being developed at Northeastern University. This interesting control ideas, but also, from a dynamical behavior biped is equipped with a total of eight actuators, and a pair standpoint, can permit studying unexplored behaviors such of coaxial thrusters fixed to its torso. Our motivation stems as walking under buoyancy phenomena [11]. Also, studying from the merits of aerial and legged systems and we intend multi-modal systems that can switch from one mode to to integrate these merits in a single platform. Contrary to another in order to overcome the demanding mobility objec- fixed- or rotary-wing aerial systems, legged robots cannot tives in unstructured environments is a rather new research exhibit a fast mobility and fly over obstacles. A legged problem and potentially can result in interesting machine- robot’s capability to negotiate extremely bumpy terrains (e.g., learning and optimization-based motion planning problems semi-collapsed buildings in the aftermath of an earthquake) is [12]. 1 SiliconSynapse Laboratory, ECE Department, Northeastern Univer- From a feedback design standpoint, the challenge of simul- sity, Boston, MA, USA. emails: {liang.k, e.sihite, dangol.p, lessieur.a, taneously providing asymptotic stability and gait feasibility a.ramezani} @northeastern.edu constraints satisfaction in legged systems have been exten-
sively addressed [13]. For instance, the method of Hybrid Zero Dynamics (HZD) has provided a rigorous model-based approach to assign attributes such as efficiency of locomotion in an off-line fashion. Other attempts entail optimization- based, nonlinear approaches to secure safety and perfor- mance of legged locomotion [14], [15], [16], [17], [18], [19]. Thrusters can result in unparalleled capabilities. For in- stance, gait trajectory planning (or re-planning), control and unilateral contact force regulation can be treated significantly differently as we have shown previously [20], [21], [22] and will further discuss new details in this paper. That said, real-time gait trajectory design in legged robots has been widely studied and the application of optimization- based methods is very common [23]. In general, in these paradigms, an optimization-based controller adjusts the gait parameters throughout the whole gait cycle such that not only the robot’s posture is adjusted to accommodate the unplanned posture adjustments but also the joints position, velocity and Fig. 2. Shows the joint movements, key positions (pi ) and dimensions (li ) acceleration are modified to avoid slipping into infeasible of Harpy. The non-conservative forces and torques acting on the system are scenarios, e.g., the violation of contact forces. What makes denoted by ui . these methods further cumbersome is that they are widely defined based on Whole Body Control (WBC) which can This paper is outlined as follows: the dynamic modeling lead to computationally expensive algorithms [24]. for Harpy and the reduced-order models which will be used These problems are widely known to suffer from curse of in the numerical simulation and controller design, the discus- dimensionality and other popular paradigms such as Approx- sion on thruster assisted locomotion, numerical simulation imate Dynamic Programming (ADP) [25], Reinforcement discussions, and then followed by the concluding remarks. Learning (RL) [26], decoupled approaches to design control for nonlinear stochastic systems [27], etc., can potentially II. DYNAMIC M ODELING OF H ARPY remedy the challenges. However, these approaches are far This section outlines the dynamics formulation of the robot from providing any practical solutions to the problem in hand which is used in the numerical simulation in Section IV, in and they are shown to be only effective on simpler practical addition to the reduced order models which are used in the robots mainly those that can only demonstrate quasi-static controller design. Fig. 2 shows the kinematic configuration gaits. of Harpy which listed the center of mass (CoM) positions We will capitalize on the thrusters action in Harpy and of the dynamic components, joint actuation torques, and will show that one can limit the use of costly optimization- thruster torques. The system model has a combined total of based schemes by directly regulating contact forces. We 12 degrees-of-freedoms (DoFs): 6 for the body and 3 on each will resolve gait parameters and re-plan them during the leg. Due to the symmetry, the left and right side of the robot whole Single Support (SS) phase, which is the longest phase follow a similar derivations so only the general derivations in a gait cycle, by only assuming well-tuned supervisory are provided in this section. controllers found in [28], [29], [30] and by focusing on fine-tuning the joints desired trajectories to satisfy unilateral A. Euler-Lagrange Formalism contact force constraints. To do this, we will devise inter- The Harpy equations of motion are derived using Euler- mediary filters based on the celebrated idea of Explicit Ref- Lagrangian dynamics formulation. In order to simplify the erence Governors (ERG) [31], [32], [33], [34]. ERGs relied system, each linkages are assumed to be massless and the on provable Lyapunov stability properties can perform the mass are concentrated at the body and the joint motors. motion planning problem in the state space in a much faster Consequently, the lower leg kinematic chain is considered to way than widely used optimization-based methods. That said, be massless which significantly simplifies the system. The these ERG-based gait modifications and impact events (i.e., three leg joints are labeled as the hip frontal (pelvis P ), hip impulsive effects) can lead to severe deviations from the sagittal (hip H) and knee sagittal (knee K), as illustrated in desired periodic orbits and standard legged robots cannot Fig. 2. The thrusters are also considered to be massless and sustain these perturbations. Previously, we demonstrated that capable of providing forces in any directions to simplify the the thrusters can be leveraged to enforce hybrid invariance in problem. a robust fashion by applying predictive schemes within the Let γh be the frontal hip angle while φh and φk be the Double Support (DS) phase [22]. Last, we also will show sagittal hip and knee angles respectively. Let the superscript that thrusters can be leveraged to stabilize frontal dynamics {B, P, H, K} represent the frame of reference about the and permit jumping over large obstacles which is an unusual body, pelvis, hip, and knee while the inertial frame is repre- capability not reported before. sented without the superscript. Let RB be the rotation matrix
from the body frame to the inertial frame (i.e. x = RB xB ). ug is the ground reaction forces (GRFs). The variables M , The pelvis motor mass is added to the body mass. Then h, Bt , and Bg are a function of the full system states: the positions of the hip and knee CoM are defined using > x = [rB , q > , φKL , φKR , ωB B> , q̇ > , φ̇KL , φ̇KR ]> , (6) kinematic equations: where the vector rB contains the elements of RB . Using pP = pB + RB l1B , pH = pP + RB Rx (γh ) l2P (1) Bj = [06×6 , I6×6 ] allows uj to actuate the joint angles pK = pH + RB Rx (γh ) Ry (φh )l3H , directly. Let v = [ωB B> , q̇ > ]> be the velocity of the gen- where Rx and Ry are the rotation matrices about the x and eralized coordinates, then Bt and Bg can be defined using y axis respectively, l is the length vectors representing the the virtual displacement from the velocity as follows: conformation of Harpy which are constant in their respective > > ∂ ṗTL /∂v ∂ ṗFL /∂v local frame of reference. The foot and thruster positions are Bt = ∂ ṗTR /∂v , Bg = ∂ ṗFR /∂v . (7) defined as: 02×6 02×6 pF = pK + RB Rx (γh ) Ry (φh ) Ry (φk ) l4K (2) The vector ut = [u> > > tL , utR ] is formed from the left and pT = pB + RB ltB thruster forces utL and utR , respectively. The GRF is modeled using the unilateral compliant ground where the length vector from the knee to the foot is l4K = model with undamped rebound while the friction is modeled [−l4a cos φk , 0, −(l4b + l4a sin φk )]> which is the kinematic using the Stribeck friction model, defined as follows: solution to the parallel linkage mechanism of the lower leg. Let ωB be the angular velocity of the body. Then ug,z = − kg,p pF,z − kg,d ṗF,z the angular velocities of the hip and knee are defined as: |ṗF,x |2 ωHB = [γ̇h , 0, 0]> + ωB B and ωK H = [0, φ̇h , 0]> + ωH H . ug,x = − µc + (µs − µc ) exp − fz sgn(ṗF,x ) vs2 Finally, the energy of the system for the Lagrangian − µv ṗF,x , dynamics formulation are defined as follows: (8) 1P > i> ˆ i where pF,x and pF,z are the x and z components of the K= m i p i p i + ω i I i ω i inertial foot position, kg,p and kg,d are the spring and 2 Pi∈F (3) > > damping model for the ground, µc , µs , and µv are the V = − i∈F mi pi [0, 0, −g] , Coulomb, static, and viscous friction coefficient respectively, where F = {B, HL , KL , HR , KR } are the relevant frame and vs is the Stribeck velocity. kg,d = 0 if ṗF,z > 0 for the of references and mass components (body, hip and knee of undamped rebound model and the friction in the y direction each side), and the subscripts L and R represent the left follows a similar derivation to ug,x . Then the ground force and right side of the robot. Furthermore, Iˆi is the inertia model ug is defined as follows: about its local frame, and g is the gravitational constant. This forms the Lagrangian of the system L = K − V which ug = [u> > > gL H(−pFL ,z ), ugR H(−pFR ,z )] , (9) is used to derive the system’s Euler-Lagrangian equations of where H(x) is the heaviside function, while ugL and ugR motion. The dynamics of the body angular velocity is derived are the left and right ground forces which are formed using using the modified Lagrangian for rotation in SO(3) to avoid their respective ug,x , ug,y , and ug,z . Finally, the full system using Euler angles and the potential gimbal lock associated equation of motion can be derived using (4) to (9) to form with them. This results in the following equations of motion ẋ = f (x, uj , ut , ug ). following Hamilton’s principle of least action: B. Reduced-Order Models d ∂L B ∂L P3 ∂L B + ωB × B + j=1 rBj × = u1 The following reduced-order models are used: a variable dt ∂ωB ∂ωB ∂rBj length inverted pendulum (VLIP) model and a two-body d ∂L ∂L d B pendulum model, as illustrated in Fig. 3. The VLIP model is − = u2 , RB = RB [ωB ]× , dt ∂ q̇ ∂q dt used to describe walking with an ERG, while the two-body (4) pendulum model is used to describe the flight phase with > where [ · ]× is the skew operator, RB = [rB1 , rB2 , rB3 ], a Model Predictive Control (MPC) to track trajectory and q = [p> B , γhL , γhR , φhL , φhR ] > is the dynamical system regulate the appropriate leg postures at the time of landing. B states other than (RB , ωB ), and u is the generalized forces. 1) Variable-Length Inverted Pendulum (VLIP) Model: The knee sagittal angle φk which is not associated with As shown in Fig. 3a, the model is described simply using any mass is updated using the knee joint acceleration input the inverted pendulum model where the length of r can be uk = [φ̈kL , φ̈kR ]> . Then the system acceleration can be adjusted through the change in leg conformation. The center derived as follows: of pressure, c, is defined as the weighted average position of M a + h = Bj uj + Bt ut + Bg ug (5) the foot, c = λL pFL + λR pFR , where λi = ugi ,z /(ugL ,z + ugR ,z ), i ∈ {L, R}. Harpy is modeled using a point foot, so where a = [ω̇BB> , q̈ > , φ̈kL , φ̈kR ]> , ut is the thruster force, c is equal to the stance foot position during the SS phase. uj = [uPL , uPR , uHL , uHR , u> > k ] is the joint actuation, and The VLIP model without thrusters is underactuated, but the
addition of thrusters makes the system fully actuated and enables it to do trajectory tracking. pB Jsλ The dynamic model is derived as follows: pB mp̈B = mg + ut,c + Js> λ (10) ut pH θ r mg where m is the mass of the pendulum which in this case is the total mass of the system, and ut,c is the thruster forces q pK about the body CoM. The constraint force Js> λ is setup to keep the leg length r equal to the leg conformation using the c m1 g following constraint equation: m2 g (a) (b) Js (p̈B − c̈) = ur , Js = (pB − c)> , (11) Fig. 3. Illustrates two reduced-order models, a variable-length inverted which is designed to keep the leg length second derivative pendulum (VLIP) and a two-body pendulum model, which are used to describe the dominant dynamic response of the system during the walking equal to ur . This constraint force also forms the GRF as and ballistic motions, respectively. long as the friction cone constraint is satisfied. Assuming no slip (c̈ = 0), then the inputs to the system are ur which controls the body position about the vector r = pB − c by parameters P0 , P2 , and P4 define the initial, middle, and adjusting the leg length, and the thrusters ut which controls final positions of the gait. Each swing and stance feet-end the remaining DoFs. have a constant Bezier curve parameters to form the 2D 2) Two-body Pendulum Model: As shown in Fig. 3b, the open-loop walking gait. The following parameters defined model is described as a planar double pendulum in the x- in the x-z plane are used: P0,sw = [−0.21, −0.60]> m , z plane where the mass is concentrated in the body CoM P2,sw = [−0.20, −0.50]> m, and P4,sw = [0.10, −0.60]> m and knee, m1 and m2 respectively. Here, the mass of the for the swing foot, and P0,st = [0.10, −0.60]> m , P2,st = pelvis motors are combined into the m1 while the mass [0.01, −0.63]> m, and P4,st = [−0.21, −0.60]> m for the of both hips are combined into m2 to ensure a similar stance foot. These Bezier parameters define the feet positions kinematics behavior. The control action is defined as udp = and then the joint angles are found simply by resolving [ux , uz , uh ]> where ux and uz are the sum of thruster the corresponding inverse kinematics problem. Finally, these forces in the x and z directions, respectively, and uh is the joint trajectories are tracked using an asymptotically stable sagittal hip motor torque. The model uses the following states controller [35]. qdp = [pB,x , pB,z , q, θ]> , where q is the hip joint angle The thrusters are used to stabilize the roll and yaw motion and θ is the body absolute pitch angle. Then the system of the robot using the following controller acceleration can be derived as follows: utL ,F = [uyaw , 0, uroll ]> , utR ,F = −utL ,F , (13) Mdp q̈dp + hdp = Bdp udp , (12) where utL ,F and utR ,F are the left and right thruster force where Bdp = [I3×3 , 01×3 ], allows the direct control signal components for frontal dynamics stabilization, while uroll to the system states except θ. and uyaw are simple PD controllers to track zero roll and yaw reference angles. This controller is sufficient to stabilize the III. C ONTROL OF T HRUSTER -A SSISTED L OCOMOTION frontal dynamics and the robot’s heading even when using This section discusses and outlines the applications of a gait designed for a 2D bipedal robot. During walking, the thrusters in our robot, particularly to stabilizes the frontal combined thruster forces are formed by combining ut,c in dynamics, apply the ERG framework to regulate GRFs, and (10) and (13), as follows to perform ballistic motions to avoid obstacles. ut = [u> > > > > > t,c , ut,c ] /2 + [utL ,F , utR ,F ] . (14) A. Frontal Dynamics Stabilization In order to show the application of thruster-assisted walk- B. Thruster-Assisted Enforcement of GRF Constraints Using ing in our robot, we use a walking gait designed for a planar RG-based Methods bipedal robot. The absence of the frontal dynamics means The ERG framework is utilized to enforce the friction that this gait is unstable for 3D walking. Hence the thrusters pyramid constraint of the robot by manipulating the applied are used to stabilize the robot’s roll and yaw to achieve a reference to the controller, which is useful to avoid using stable walking gait in the full 3D system. optimization or nonlinear MPC framework to enforce con- The gait is designed by constraining the system dynamics straints on the harder-to-model GRFs. The VLIP model in in the x-z plane using 4th order Bezier polynomials to (10) can be fully actuated due to the addition of thrusters, define the feet-end positions relative to the hip in the robot which enables us to utilize a more advanced controller such body frame (pB B F − pP ). Let Pi , i ∈ {0, . . . , 4} be the as ERG. Bezier polynomial parameters. This 4th order polynomial The ERG manipulates the applied reference (xw ) to avoid can be constrained to have zero initial and final gait ve- violating the constraint equation hw (x, xw ) ≥ 0 while locity by setting P0 = P1 and P3 = P4 . Then the free also be as close as possible to the desired reference (xr ),
as illustrated in Fig. 4. Consider the Lyapunov equation hw < 0 V V = (xr − xw )> P (xr − xw ). xw is updated through the hr < 0 min update law: ẋw = vr + vt + vn , (15) vn xr where vr drives xw directly to xr , while vt and vn drives xw vr V=0 along the surface and into the constraint equation boundary xw vt hw = 0, respectively. The objective of this ERG algorithm is xw,t hw = 0 to drive xw to the state xw,t which is the minimum energy hw > 0 solution Vmin that satisfies the constraint hw ≥ 0. Let hr (x, xr ) = Jr xr + dr ≥ 0 be the constraint Fig. 4. Shows how an Explicit Reference Governor (ERG) can manipulate equation using the desired reference xr , and Cr be the the applied reference states xw to be as close as possible to the desired reference xr while the constraint equations hw ≥ 0 remain feasible. rowspace of the violated constraints of hr . Define Nr = null(Cr ) = [n1 , . . . , nn ] where n is the size of the nullspace. Additionally, let rk be the k’th row of Jr . Then the following is aligned about the body CoM sagittal axis. Therefore, the update law is used for vr , vt , and vn : reduced-order model in (5) is used to model the robot during the ballistic motion which is represented as a two-body vr = α̂r (xr − xw ), vn = α̂n rk rk> (xr − xw ) Pn (16) pendulum system. Then an MPC framework is developed vt = k=1 α̂t nk n> k (xr − xw ), based on this model to regulate the feet-end positions and where α̂ are scalars defined as follows: ensuring the proper landing configuration. ( The hip frontal and knee angles are setup to be constant αr , if min(hw ) ≥ 0 or min(hr ) ≥ 0 during the ballistic maneuver and the feet are positioned in a α̂r = 0, else neutral stance configuration within which legs are parallel to ( each other. An MPC with a prediction and control horizons αt , if min(hw ) ≥ 0 or min(hr ) < 0 α̂t = Nh = 10 is developed using the following optimizer 0, else (17) PNh k> min k=0 (e We ek + ∆uk> k dp Wu ∆udp )∆t αn , if min(hw ) ≤ min(hr ) < 0 udp α̂n = −αn , if min(hr ) < min(hw ) < 0 subject to ẋdp = f (xdp , udp ) (19) k+1 k 0, else, qref − θ + qc = 0 where α is a positive scalar which determines the rate of |udp | − udp,max ≤ 0, convergence. where e = [p> > > B −pB,ref , q −qref ] is the tracking error, We Assuming ẋr = 0 and using the update law defined from and Wu are the cost weighting matrices, and qc is a constant (16) and (17), we willP have V̇ = −2(xr −xw )> Q (xr −xw ), angle to determine the desired landing posture. qref denotes n with Q = P (α̂r I + k=1 α̂t nk n> > k + α̂n rk rk ). We have the reference hip sagittal angle which is updated each time the gradient V̇ = 0 if min(hw ) ≤ 0 and nk ⊥(xr − xw ), step relative to the body pitch angle θ and qc . This reference while V̇ < 0 when min(hr ) ≥ 0 or when min(hw ) ≥ 0. In can ensure the legs do not lag behind the body at the landing case both applied reference and target constraints equation moment, and qc can be properly adjusted to achieve the are violated, we have V̇ > 0 which drives the xw towards the desired landing posture. The references pB,ref are designed constraint boundary. This allows the xw to converge to xw,t as a ballistic motion for the body CoM as follows which is the minimum energy solution that satisfies hw ≥ 0 > as illustrated in Fig. 4. pk+1 k B,ref = pB,ref + ∆t [a, b sin(2πt/1.5)] (20) The GRF constraints for this robot can be derived from where pB denotes the body position in the x-z plane, while a the reduced-order model constraint equation in (11) by ap- and b are some constants, and ∆t is the controller time step. plying the ground pyramid constraint. We use the following Finally, the resulting udp = [ux , uz , uh ] is fed to the full constraints for the ERG: model, where the thruster forces ux and uz are combined |ug,x |≤ µs ug,z , |ug,y |≤ µs ug,z , ug,z ≥ uz,min , (18) with the roll and yaw stabilization controller in (13). This forms the combined thruster forces where [ug,x , ug,y , ug,z ]> = Js> λ is the ground reaction force model from (10) and (11). This forms the ground friction ut = [ux , 0, uz , ux , 0, uz ]> /2 + [u> > > t,l , ut,r ] , (21) pyramid constraint and the minimum ground normal force which tracks the flight trajectory and stabilizes the robot’s acting on the leg to ensure that the foot doesn’t slip. roll and heading angles. C. MPC Design and Control of Robot’s Ballistic Motion IV. S IMULATION R ESULTS The thrusters can also be used to traverse rough terrain by The optimization and numerical simulation are done in simply flying over obstacles. However, it is not possible to Matlab where we used interior-point algorithm and RK4 stabilize pitch dynamics due to the thrusters position which scheme, respectively. The MPC and the ERG filter are
(a) (b) (c) Fig. 5. Shows the simulation result for the walking and jumping maneuvers. (a) Illustrates the stick-diagram plot when the robot walks on flat ground and jumps over obstacles. (b) Depicts the states evolution, the thrusters action and ground contact forces. Note that the thrusters are employed to stabilize the frontal dynamics. The RG algorithm is disabled during the ballistic motion (grey area). (c) Close-up view of the constraints at t = [9.5, 11.5]s where the applied references (xw ) are manipulated such that the constraints (hw (x, xw ) ≥ 0) remain feasible. simulated using a zero-order hold at a frequency 20 times and kg,d = 268. The frequency of MPC controller applied smaller than the numerical integration to better capture the in the two-body pendulum model is 100 Hz with a 10 behavior of the onboard computer in Harpy. steps prediction and control horizons. The entire simulation contains 16 gait cycles (12 s) using the following sequence: A. Simulation Specifications 3 walking steps, 1 quiet standing cycle, 2 jumping cycles, 4 In this section, all units are in N, kg, m, s. The robot walking steps, 1 quiet standing cycle, 2 jumping cycles, and has the following dimensions: l1 = [0, 0.1, −0.1]> , l2 = 3 walking steps. The weighting matrices for the MPC are [0, 0.5, 0]> , l3 = [0, 0, −0.3]> , and l4 = [0, 0.1, 0]> for the We = diag(5, 20), Wu = diag(1/10, 1/10, 1/5), qc = 23◦ , left side. The right side simply has the y axis component and the parameters for the jumping trajectories are a = 0.9, inverted. This gives the robot CoM height of around 0.6 m b = 0.8 for the first jumping cycle, and a = 1.2, b = 0.3 for for standing and walking. The following mass and inertias the second jump. The ERG is applied using α = 5 which are used: mB = 2, mH = mK = 0.5, IB = 10−3 , provides a sufficient convergence rate. IH = IK = 10−4 . Finally, the following ground parameters are used: µs = 0.6, µc = 0.54, µv = 0.85, kg,p = 8000,
B. Simulation Results and Discussions [3] Y. Gong, R. Hartley, X. Da, A. Hereid, O. Harib, J.-K. Huang, and J. Grizzle, “Feedback control of a cassie bipedal robot: Walking, The simulation results can be seen in Fig. 5, where Fig. 5a standing, and riding a segway,” in 2019 American Control Conference shows the key frames of the simulated robot trajectory where (ACC), 2019, pp. 4559–4566. [4] M. Hirose and K. Ogawa, “Honda humanoid robots development,” it walks and jumps over obstacles. Figure 5b shows the data Philosophical Transactions of the Royal Society A: Mathematical, of the thruster forces, ground normal forces, and body states Physical and Engineering Sciences, vol. 365, no. 1850, pp. 11–19, during the simulation. The walking gait designed for a 2D 2006. bipedal robot is stable when used in the full 3D system [5] W. Kwon et al., “Biped humanoid robot mahru iii,” in IEEE-RAS International Conf. on Humanoid Robots. IEEE, 2007, pp. 583–588. as the pitch and yaw angles are stabilized by the thruster [6] J. E. Pratt et al., “The yobotics-ihmc lower body humanoid robot,” in actions. Additionally, the trajectory of the body positions IEEE/RSJ International Conference on Intelligent Robots and Systems, match closely towards the desired trajectories throughout 10 2009, pp. 410–411. [7] H.-W. Park, A. Ramezani, and J. W. Grizzle, “A finite-state ma- the walking and flight phases. The robot’s pitch angle is chine for accommodating unexpected large ground-height variations uncontrollable as shown in Fig. 5b throughout the entire in bipedal robot walking,” IEEE Transactions on Robotics, vol. 29, simulation. However, the MPC has successfully regulated no. 2, pp. 331–345, 2012. [8] B. G. Buss, A. Ramezani, K. A. Hamed, B. A. Griffin, K. S. Galloway, the foot landing positions such that the foot is positioned and J. W. Grizzle, “Preliminary walking experiments with under- below or in front of the body at the time of landing through actuated 3d bipedal robot marlo,” in 2014 IEEE/RSJ International the appropriate hip sagittal angles, as shown in Fig. 5a. Conference on Intelligent Robots and Systems, 2014, pp. 2529–2536. This allows a smooth transition between landing and walking [9] H.-W. Park, K. Sreenath, A. Ramezani, and J. W. Grizzle, “Switching control design for accommodating large step-down disturbances in which is the primary objective of using the MPC. bipedal robot walking,” in 2012 IEEE International Conference on The ERG is used to regulate the ground friction forces Robotics and Automation, 2012, pp. 45–50. by manipulating the applied state references during walking [10] G. Picardi, H. Hauser, C. Laschi, and M. Calisti, “Morphologically induced stability on an underwater legged robot with a deformable to prevent slips. Figure 5b shows the constraint equation body,” The International Journal of Robotics Research, 2019. values using the applied and target reference (hw and hr [11] Y. Kojio et al., “Walking control in water considering reaction forces respectively), where the target reference trajectories satisfy from water for humanoid robots with a waterproof suit,” in 2016 IEEE/RSJ International Conference on Intelligent Robots and Systems the constraints except at around the time range of t = (IROS), 2016, pp. 658–665. [10, 12]s as shown in Fig. 5c. Within this time range, the [12] B. Araki, J. Strang, S. Pohorecky, C. Qiu, T. Naegeli, and D. Rus, desired reference trajectory results in min(hr ) < 0 and the “Multi-robot path planning for a swarm of robots that can both fly and drive,” in 2017 IEEE International Conference on Robotics and ERG has successfully manipulate the applied reference xw Automation (ICRA), 2017, pp. 5575–5582. such that the constraint equation hw ≥ 0 is satisfied. The [13] E. Westervelt and J. Grizzle, Feedback Control of Dynamic Bipedal control output of the ERG is disabled during the jumping Robot Locomotion, ser. Control and Automation Series. CRC PressINC, 2007. period (shaded gray in Fig. 5b) as the thruster components [14] K. Galloway, K. Sreenath, A. D. Ames, and J. W. Grizzle, “Torque sat- for position stabilization is handled by the MPC. uration in bipedal robotic walking through control lyapunov function- based quadratic programs,” IEEE Access, vol. 3, pp. 323–332, 2015. V. C ONCLUSIONS AND F UTURE W ORK [15] H. Dai and R. Tedrake, “Planning robust walking motion on uneven terrain via convex optimization,” in IEEE-RAS International Confer- The concept design and dynamics simulation of a thruster- ence on Humanoid Robots (Humanoids), 11 2016, pp. 579–586. assisted bipedal robot called Harpy is presented in this paper. [16] S. Feng, E. Whitman, X. Xinjilefu, and C. G. Atkeson, “Optimization based full body control for the atlas robot,” in IEEE-RAS International The addition of the thrusters allows the robot to stabilize its Conference on Humanoid Robots, 11 2014, pp. 120–127. frontal dynamics easily and be able to regulate the ground [17] A. Ramezani, J. W. Hurst, K. Akbari Hamed, and J. W. Grizzle, “Per- contact forces directly. To do this, a control algorithm based formance analysis and feedback control of atrias, a three-dimensional bipedal robot,” Journal of Dynamic Systems, Measurement, and Con- on Reference Governors (RGs) was proposed. In addition, trol, vol. 136, no. 2, 2014. the thrusters allowed multimodal locomotion where the robot [18] A. Ramezani, “Feedback control design for marlo, a 3d-bipedal robot.” can transition between walking and jumping over obstacles. Ph.D. dissertation, 2013. [19] J. Grizzle, A. Ramezani, B. Buss, B. Griffin, K. A. Hamed, and Through simulations we showed that the integration of the K. Galloway, “Progress on controlling marlo, an atrias-series 3d thrusters can yield a dynamically robust biped. Future work underactuated bipedal robot,” Dynamic Walking, 2013. includes the completion of Harpy. In addition, we will [20] P. Dangol, A. Ramezani, and N. Jalili, “Performance satisfaction in midget, a thruster-assisted bipedal robot,” in 2020 American Control explore the cost of transport in multi-modal systems. Fur- Conference (ACC), 2020, pp. 3217–3223. ther investigations will be made in developing high fidelity [21] A. C. de Oliveira and A. Ramezani, “Thruster-assisted center man- models that incorporate realistic thruster dynamics which ifold shaping in bipedal legged locomotion,” in 2020 IEEE/ASME are more appropriate for the real-world implementation of International Conference on Advanced Intelligent Mechatronics (AIM). IEEE, 2020, pp. 508–513. closed-loop feedback on Harpy. [22] P. Dangol and A. Ramezani, “Towards thruster-assisted bipedal loco- motion for enhanced efficiency and robustness,” International Feder- R EFERENCES ation of Automatic Control (IFAC), 2020. [23] A. Hereid, E. A. Cousineau, C. M. Hubicki, and A. D. Ames, [1] M. H. Raibert, H. B. Brown Jr, and M. Chepponis, “Experiments in “3d dynamic walking with underactuated humanoid robots: A direct balance with a 3d one-legged hopping machine,” The International collocation framework for optimizing hybrid zero dynamics,” in 2016 Journal of Robotics Research, vol. 3, no. 2, pp. 75–92, 1984. IEEE International Conference on Robotics and Automation (ICRA), [2] M. Raibert, K. Blankespoor, G. Nelson, and R. Playter, “Bigdog, the 2016, pp. 1447–1454. rough-terrain quadruped robot,” IFAC Proceedings Volumes, vol. 41, [24] L. Sentis and O. Khatib, “A whole-body control framework for no. 2, pp. 10 822–10 825, 2008. humanoids operating in human environments,” in Proceedings 2006
IEEE International Conference on Robotics and Automation, 2006. ICRA 2006., 2006, pp. 2641–2648. [25] W. B. Powell, Approximate Dynamic Programming: Solving the curses of dimensionality. John Wiley & Sons, 2007, vol. 703. [26] R. S. Sutton and A. G. Barto, Reinforcement learning: An introduction. MIT press, 2018. [27] M. Rafieisakhaei, S. Chakravorty, and P. Kumar, “A near-optimal decoupling principle for nonlinear stochastic systems arising in robotic path planning and control,” in 2017 IEEE 56th Annual Conference on Decision and Control (CDC), 2017, pp. 1–6. [28] E. D. Sontag, “A lyapunov-like characterization of asymptotic con- trollability,” SIAM journal on control and optimization, vol. 21, no. 3, pp. 462–471, 1983. [29] P. V. Kokotovic, M. Krstic, and I. Kanellakopoulos, “Backstepping to passivity: recursive design of adaptive systems,” in IEEE Conference on Decision and Control, vol. 4, 12 1992, pp. 3276–3280. [30] S. P. Bhat and D. S. Bernstein, “Continuous finite-time stabilization of the translational and rotational double integrators,” IEEE Transactions on Automatic Control, vol. 43, no. 5, pp. 678–682, 1998. [31] E. Garone and M. M. Nicotra, “Explicit reference governor for con- strained nonlinear systems,” IEEE Transactions on Automatic Control, vol. 61, no. 5, pp. 1379–1384, 2015. [32] E. G. Gilbert, I. Kolmanovsky, and Kok Tin Tan, “Nonlinear control of discrete-time linear systems with state and control constraints: a reference governor with global convergence properties,” in IEEE Conference on Decision and Control, vol. 1, 12 1994, pp. 144–149. [33] A. Bemporad, “Reference governor for constrained nonlinear systems,” IEEE Trans. on Automatic Control, vol. 43, no. 3, pp. 415–419, 1998. [34] E. Gilbert and I. Kolmanovsky, “Nonlinear tracking control in the presence of state and control constraints: a generalized reference governor,” Automatica, vol. 38, no. 12, pp. 2063–2073, 2002. [35] H. Khalil, Nonlinear Systems. Pearson Education, Prentice Hall, 2002.
You can also read