forked from RussTedrake/underactuated
-
Notifications
You must be signed in to change notification settings - Fork 0
/
Copy pathlimit_cycles.html
546 lines (452 loc) · 29.1 KB
/
limit_cycles.html
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
43
44
45
46
47
48
49
50
51
52
53
54
55
56
57
58
59
60
61
62
63
64
65
66
67
68
69
70
71
72
73
74
75
76
77
78
79
80
81
82
83
84
85
86
87
88
89
90
91
92
93
94
95
96
97
98
99
100
101
102
103
104
105
106
107
108
109
110
111
112
113
114
115
116
117
118
119
120
121
122
123
124
125
126
127
128
129
130
131
132
133
134
135
136
137
138
139
140
141
142
143
144
145
146
147
148
149
150
151
152
153
154
155
156
157
158
159
160
161
162
163
164
165
166
167
168
169
170
171
172
173
174
175
176
177
178
179
180
181
182
183
184
185
186
187
188
189
190
191
192
193
194
195
196
197
198
199
200
201
202
203
204
205
206
207
208
209
210
211
212
213
214
215
216
217
218
219
220
221
222
223
224
225
226
227
228
229
230
231
232
233
234
235
236
237
238
239
240
241
242
243
244
245
246
247
248
249
250
251
252
253
254
255
256
257
258
259
260
261
262
263
264
265
266
267
268
269
270
271
272
273
274
275
276
277
278
279
280
281
282
283
284
285
286
287
288
289
290
291
292
293
294
295
296
297
298
299
300
301
302
303
304
305
306
307
308
309
310
311
312
313
314
315
316
317
318
319
320
321
322
323
324
325
326
327
328
329
330
331
332
333
334
335
336
337
338
339
340
341
342
343
344
345
346
347
348
349
350
351
352
353
354
355
356
357
358
359
360
361
362
363
364
365
366
367
368
369
370
371
372
373
374
375
376
377
378
379
380
381
382
383
384
385
386
387
388
389
390
391
392
393
394
395
396
397
398
399
400
401
402
403
404
405
406
407
408
409
410
411
412
413
414
415
416
417
418
419
420
421
422
423
424
425
426
427
428
429
430
431
432
433
434
435
436
437
438
439
440
441
442
443
444
445
446
447
448
449
450
451
452
453
454
455
456
457
458
459
460
461
462
463
464
465
466
467
468
469
470
471
472
473
474
475
476
477
478
479
480
481
482
483
484
485
486
487
488
489
490
491
492
493
494
495
496
497
498
499
500
501
502
503
504
505
506
507
508
509
510
511
512
513
514
515
516
517
518
519
520
521
522
523
524
525
526
527
528
529
530
531
532
533
534
535
536
537
538
539
540
541
542
543
544
545
546
<!DOCTYPE html>
<html>
<head>
<title>Ch. 16 - Algorithms
for Limit Cycles</title>
<meta name="Ch. 16 - Algorithms
for Limit Cycles" content="text/html; charset=utf-8;" />
<link rel="canonical" href="http://underactuated.mit.edu/limit_cycles.html" />
<script src="https://hypothes.is/embed.js" async></script>
<script type="text/javascript" src="chapters.js"></script>
<script type="text/javascript" src="htmlbook/book.js"></script>
<script src="htmlbook/mathjax-config.js" defer></script>
<script type="text/javascript" id="MathJax-script" defer
src="htmlbook/MathJax/es5/tex-chtml.js">
</script>
<script>window.MathJax || document.write('<script type="text/javascript" src="https://cdn.jsdelivr.net/npm/mathjax@3/es5/tex-chtml.js" defer><\/script>')</script>
<link rel="stylesheet" href="htmlbook/highlight/styles/default.css">
<script src="htmlbook/highlight/highlight.pack.js"></script> <!-- http://highlightjs.readthedocs.io/en/latest/css-classes-reference.html#language-names-and-aliases -->
<script>hljs.initHighlightingOnLoad();</script>
<link rel="stylesheet" type="text/css" href="htmlbook/book.css" />
</head>
<body onload="loadChapter('underactuated');">
<div data-type="titlepage">
<header>
<h1><a href="index.html" style="text-decoration:none;">Underactuated Robotics</a></h1>
<p data-type="subtitle">Algorithms for Walking, Running, Swimming, Flying, and Manipulation</p>
<p style="font-size: 18px;"><a href="http://people.csail.mit.edu/russt/">Russ Tedrake</a></p>
<p style="font-size: 14px; text-align: right;">
© Russ Tedrake, 2023<br/>
Last modified <span id="last_modified"></span>.</br>
<script>
var d = new Date(document.lastModified);
document.getElementById("last_modified").innerHTML = d.getFullYear() + "-" + (d.getMonth()+1) + "-" + d.getDate();</script>
<a href="misc.html">How to cite these notes, use annotations, and give feedback.</a><br/>
</p>
</header>
</div>
<p><b>Note:</b> These are working notes used for <a
href="https://underactuated.csail.mit.edu/Spring2023/">a course being taught
at MIT</a>. They will be updated throughout the Spring 2023 semester. <a
href="https://www.youtube.com/channel/UChfUOAhz7ynELF-s_1LPpWg">Lecture videos are available on YouTube</a>.</p>
<table style="width:100%;"><tr style="width:100%">
<td style="width:33%;text-align:left;"><a class="previous_chapter" href=output_feedback.html>Previous Chapter</a></td>
<td style="width:33%;text-align:center;"><a href=index.html>Table of contents</a></td>
<td style="width:33%;text-align:right;"><a class="next_chapter" href=contact.html>Next Chapter</a></td>
</tr></table>
<script type="text/javascript">document.write(notebook_header('limit_cycles'))
</script>
<!-- EVERYTHING ABOVE THIS LINE IS OVERWRITTEN BY THE INSTALL SCRIPT -->
<chapter style="counter-reset: chapter 15"><h1>Algorithms
for Limit Cycles</h1>
<p>The discussion of walking and running robots in Chapter 4 motivated the
notion of limit cycle stability. Linear systems are not capable of producing
stable limit cycle behavior, so this rich topic is unique to nonlinear systems
design and analysis. Furthermore, the tools that are required to design,
stabilize, and verify limit cycles will have applicability beyond simple
periodic motions.</p>
<p>The first natural question we must ask is, given a system $\dot{\bx} =
f(\bx)$, or a control system $\dot{x} = f(\bx,\bu)$, how do we go about
finding periodic solutions which may be passively stable, open-loop stable, or
stabilizable via closed-loop feedback? It turns out that the trajectory
optimization tools that we developed already are very well suited to this
task.</p>
<section id="trajopt"><h1>Trajectory optimization</h1>
<p>I introduced the trajectory optimization tools as means for optimizing a
control trajectory starting from a particular known initial condition. But
the fundamental idea of optimizing over individual trajectories of the
system is useful more broadly. Even for a passive system, we can formulate
the search for a periodic solution as an optimization over trajectories that
satisfy the dynamic constraints and <em>periodicity constraints</em>,
$\bx[0] = \bx[N]$: \begin{align*} \find_{\bx[\cdot]} \quad \subjto \quad &
\bx[n+1] = f(\bx[n]), \quad \forall n\in[0, N-1] \\ & \bx[0] = \bx[N].
\end{align*} Certainly we can add control inputs back into the formulation,
too, but let's start with this simple case. Take a moment to think about
the feasible solutions to this problem formulation. Certainly a fixed point
$\bx[n] = \bx^*$ will satisfy the constraints; if we don't want these
solutions to come out of the solver we might need to exclude them with
constraints or add an objective that guides the solver towards the desired
solutions. The other possible solutions are trajectories that are periodic
in exactly $N$ steps. That's pretty restrictive.
</p>
<p>We can do better if we use the continuous-time formulations. For
instance, in our introduction of <a
href="trajopt.html#direct_collocation">direct collocation</a>, we wrote
\begin{align*} \min_{\bx[\cdot],\bu[\cdot]} \quad & \ell_f(\bx[N]) +
\sum_{n_0}^{N-1} h_n \ell(\bx[n],\bu[n]) \\ \subjto \quad & \dot\bx(t_{c,n})
= f(\bx(t_{c,n}), \bu(t_{c,n})), & \forall n \in [0,N-1] \\ & \bx[0] = \bx_0
\\ & + \text{additional constraints}. \end{align*} But we can also add
$h_n$ as decision variables in the optimization (reminder: I recommend
setting a lower-bound $h_n \ge h_{min} > 0$). This allows our $N$-step
trajectory optimization to scale and shrink time in order to satisfy the
periodicity constraint. The result is simple and powerful.</p>
<example><h1>Finding the limit cycle of the
Van der Pol oscillator</h1>
<p>Recall the dynamics of the Van der Pol oscillator given by $$\ddot{q} +
\mu (q^2 - 1) \dot{q} + q = 0, \quad \mu>0,$$ which exhibited a stable
limit cycle.</p>
<p>Formulate the direct collocation optimization: \begin{align*}
\find_{\bx[\cdot],h} \quad \subjto \quad & q[0] = 0, \quad \dot{q}[0] > 0,
\\ & \bx[N] = \bx[0], \text{(periodicity constraint)}\\ &
\text{collocation dynamic constraints} \\ & 0.01 \le h \le 0.5
\end{align*}
</p>
<p>Try it yourself:</p>
<script>document.write(notebook_link('limit_cycles'))</script>
<p>As always, make sure that you take a look at the code. Poke around.
Try changing some things.</p>
<p>One of the things that you should notice in the code is that I provide
an initial guess for the solver. In most of the examples so far I've been
able to avoid doing that--the solver takes small random numbers as a
default initial guess and solves from there. But for this problem, I
found that it was getting stuck in a local minima. Adding the initial
guess that the solution moves around a circle in state space was enough.
</p>
</example>
<todo>Show how to take the local linearization along the trajectory: adjoint + projection at the boundary conditions.</todo>
<todo> example: simplest flapping robot </todo>
</section> <!-- end trajectory optimization -->
<section id="lyapunov"><h1>Lyapunov analysis</h1>
<p>Recall the important distinction between stability of a trajectory in
time and stability of a limit cycle was that the limit cycle does not
converge in phase -- trajectories near the cycle converge to the cycle,
but trajectories on the cycle may not converge with each other. This is
type of stability, also known as <em>orbital stability</em> can be
written as stability to the manifold described by a trajectory $\bx^*(t)$,
\[ \min_\tau || x(t) - x^*(\tau) || \rightarrow 0.\] In the case of limit
cycles, this manifold is a periodic solution with $\bx^*(t+t_{period}) =
\bx^*(t)$. Depending on exactly how that convergence happens, we can define
orbital stability in the sense of Lyapunov, asymptotic orbital stability,
exponential orbital stability, or even finite-time orbital stability.</p>
<p>In order to prove that a system is orbitally stable (locally, over a
region, or globally), or to analyze the region of attraction of a limit
cycle, we can use a Lyapunov function. When we analyzed our <a
href="simple_legs.html">simple models of walking and running</a>, we
carried out the analysis on the Poincare map. Indeed, if we can find a
Lyapunov function that proves (i.s.L., asympototic, or exponential)
stability of the discrete Poincare map, then this implies (i.s.L,
asymptotic, or exponential) <i>orbital</i> stability of the cycle. But
Lyapunov functions of this form are difficult to verify for a pretty
fundamental reason: we rarely have an analytical expression for the
Poincare map, since it is the result of integrating a nonlinear dynamics
over the cycle.</p>
<p>Instead, we will focus our attention on constructing Lyapunov functions
in the full state space, which use the continuous dynamics. In particular,
we would like to consider Lyapunov functions which have the form cartooned
below; they vanish to zero everywhere along the cycle, and are strictly
positive everywhere away from the cycle.</p>
<figure><img
src="figures/limitCycleLyapunovCartoon.svg"/><figcaption>Cartoon of a
Lyapunov function which vanishes on a limit cycle, and is strictly positive
everywhere else. (a small segment has been removed solely for the purposes
of visualization).</figcaption></figure>
<subsection><h1>Transverse coordinates</h1>
<p>How can we parameterize this class of functions? For arbitrary cycles
this could be very difficult in the original coordinates. For simple
cycles like in the cartoon, one could imagine using polar coordinates.
More generally, we will define a new coordinate system relative to the
orbit, with coordinates</p> <ul> <li>$\tau$ - the phase along the
orbit</li> <li>$\bx_\perp(\tau)$ - the remaining coordinates, linearly
independent from $\tau$.</li> </ul> <p>Given a state $\bx$ in the
original coordinates, we must define a smooth mapping $\bx \rightarrow
(\tau, \bx_\perp)$ to this new coordinate system. For example, for a
simple ring oscillator we might have:</p> <figure><img width="80%"
src="figures/transverseCoordinates.svg"/><figcaption>A moving coordinate
system along the limit cycle.</figcaption></figure> In general, for an
$n$-dimensional state space, $\tau$ will always be a scalar, and
$\bx_\perp$ will be an $(n-1)$-dimensional vector defining the remaining
coordinates relative to $\bx^*(\tau)$. In fact, although we use the
notation $\bx_\perp$ the coordinate system need not be strictly
orthogonal to the orbit, but must simply be <em>transversal</em> (not
parallel). Having defined the smooth mapping $\bx \rightarrow
(\tau,\bx_\perp)$, we can always rewrite the dynamics in this new
coordinate system: \begin{gather*} \dot{\tau} = f_\tau(\bx_\perp,\tau) \\
\dot\bx_\perp = f_\perp(\bx_\perp,\tau). \end{gather*} To keep our
notation simpler for the remainder of these notes, we will assume that
the origin of this new coordinate system is on the nominal trajectory
($\min_\tau |\bx - \bx^*(\tau)| = 0 \Rightarrow \bx_\perp = 0$).
Similarly, by taking $\tau$ to be the phase variable, we will leverage
the fact that on the nominal trajectory, we have $\dot\tau = f_\tau(0,
\tau) = 1.$</p>
<p>The value of
this construction for Lyapunov analysis was proposed in
<elib>Hauser94a</elib>
and has been extended nicely to control design in <elib>Shiriaev08</elib>
and for region of attraction estimation in <elib>Manchester10b</elib>. A
quite general numerical strategy for defining the transversal coordinates
is given in <elib>Manchester10a</elib>.</p>
<theorem><h1>A Lyapunov theorem for orbital stability</h1>
<p>For a dynamical system $\dot\bx = f(\bx)$ with $\bx \in \Re^n$, $f$
continuous, a continuous periodic solution $\bx^*(\tau)$, and a smooth
mapping $\bx \rightarrow (\tau,\bx_\perp)$ where $\bx_\perp$ vanishes on
$\bx^*$, then for some $n-1$ dimensional ball ${\cal B}$ around the
origin, if I can produce a $V(\bx_\perp,\tau)$ such that \begin{gather*}
\forall \tau, V(0,\tau) = 0, \\ \forall \tau, \forall \bx_\perp \in
{\cal B}, \bx_\perp \ne 0, V(\bx_\perp,\tau) > 0, \end{gather*} with
\begin{gather*} \forall \tau, \dot{V}(0,\tau) = 0, \\ \forall \tau,
\forall \bx_\perp \in {\cal B}, \bx_\perp \ne 0, \dot{V}(\bx_\perp,\tau)
< 0, \end{gather*} then the solution $\bx^*(t)$ is <em>locally orbitally
asymptotically stable</em>.</p>
</theorem>
<p>Orbital stability in the sense of Lyapunov and exponential orbital
stability can also be verified (with $\dot{V}_\perp \le 0$ and
$\dot{V}_\perp \le \alpha V_\perp$, respectively).</p>
<example><h1>Simple ring oscillator</h1>
<p> Perhaps the simplest oscillator is the first-order system which
converges to the unit circle. In cartesian coordinates, the dynamics
are \begin{align*} \dot{x}_1 =& x_2 -\alpha x_1 \left( 1 -
\frac{1}{\sqrt{x_1^2+x_2^2}}\right) \\ \dot{x}_2 =& -x_1 -\alpha x_2
\left( 1 - \frac{1}{\sqrt{x_1^2+x_2^2}}\right), \end{align*} where
$\alpha$ is a positive scalar gain.</p>
<figure><img width=80%
src="figures/ringOscillator.svg"/><figcaption>Vector field of the
ring oscillator</figcaption></figure>
<p>If we take the transverse coordinates to be the polar coordinates,
shifted so that $x_\perp$ is zero on the unit circle, \begin{align*}\tau
=& \text{atan2}(-x_2,x_1) \\ x_\perp =& \sqrt{x_1^2+x_2^2}-1,
\end{align*} which is valid when $x_\perp>-1$, then the simple
transverse dynamics are revealed: \begin{align*} \dot\tau =& 1 \\
\dot{x}_\perp =& -\alpha x_\perp. \end{align*} Taking a Lyapunov
candidate $V(x_\perp,\tau) = x_\perp^2$, we can verify that $$\dot{V} =
-2 \alpha x_\perp^2 < 0, \quad \forall x_\perp > -1.$$ This
demonstrates that the limit cycle is locally asymptotically stable, and
furthermore that the invariant open-set $V < 1$ is inside the region of
attraction of that limit cycle. In fact, we know that all $x_\perp > -1$
are in the region of attraction that limit cycle, but this is not proven
by the Lyapunov argument above. </p>
</example>
<p>Let's compare this approach again with the approach that we used in
the chapter on walking robots, where we used a Poincaré map
analysis to investigate limit cycle stability. In transverse coordinates
approach, there is an additional burden to construct the coordinate
system along the entire trajectory, instead of only at a single surface
of section. In fact, the transverse coordinates approach is sometimes
referred to as a "moving Poincaré section". Again, the reward for
this extra work is that we can check a condition that only involves the
instantaneous dynamics, $f(\bx)$, as opposed to having to integrate the
dynamics over an entire cycle to generate the discrete Poincaré
map, $\bx_p[n+1] = P(\bx_p[n])$. As we will see below, this approach
will also be more compatible with designing continuous feedback
controller that stabilize the limit cycle.</p>
</subsection> <!-- end transverse -->
<subsection><h1>Transverse linearization</h1>
<p>In the case of Lyapunov analysis around a fixed point, there was an
important special case: for stable linear systems, we actually have a
recipe for constructing a Lyapunov function. As a result, for nonlinear
systems, we often found it convenient to begin our search by linearizing
around the fixed point and using the Lyapunov candidate for the linear
system as an initial guess. That same approach can be extended to limit
cycle analysis.</p>
<p>In particular, let us make a linear approximation of the transverse
dynamics: $$\dot{\bx}_\perp = f_\perp(\bx_\perp, \tau) \approx
\pd{f_\perp}{\bx_\perp} \bx_\perp = \bA_\perp(\tau)\bx_\perp.$$ Note that
I've assumed that $\bx_\perp$ is zero on the nominal trajectory, so don't
need the additional notation of working in the error coordinates here.
Remember that $\tau$ is the phase along the trajectory, but
<elib>Hauser94a</elib> showed that (exponential) stability of the
<i>time-varying</i> linear system $\dot\bx_\perp = \bA_\perp(t)
\bx_\perp$ implies local exponential orbital stability of the original
system. In particular, if this transverse linear system is periodic and
orbitally stable, then for each $\bQ=\bQ^T \succ 0$, there exists a
unique positive definite solution to the periodic Riccati equation:
\begin{gather*} V(\bx_\perp, \tau) = \bx_\perp^T {\bf P}(\tau) \bx_\perp,
\qquad {\bf P}^T(\tau) = {\bf P}(\tau), \quad {\bf P}(\tau + t_{period})
= {\bf P}(\tau), \\ -\dot{\bf P}(\tau) = {\bf P}(\tau) \bA(\tau) +
\bA(\tau){\bf P}^T(\tau) + \bQ, \qquad \bQ=\bQ^T\succ 0. \end{gather*}
There is a surprisingly rich literature on the numerical methods for
finding the periodic solutions to these Lyapunov (and corresponding
Riccati) equations. In practice, for every problem I've ever tried, I've
simply integrated the equations backwards in time until the integration
converges to a periodic solution (this is not guaranteed to work, but
almost always does).
</p>
<p>This is very powerful. It gives us a general way to construct a
Lyapunov candidate that certifies local stability of the cycle, and which
can be used as a candidate for nonlinear Lyapunov analysis.</p>
</subsection>
<subsection><h1>Region of attraction estimation using sums-of-squares</h1>
<p>Coming soon. If you are interested, see <elib>Manchester10a+Manchester10b</elib>.</p>
</subsection>
</section> <!-- end lyapunov -->
<section><h1>Feedback design</h1>
<p>Many of the tools we've developed for stabilization of fixed points or
stabilization of trajectories can be adapted to this new setting. When we
discussed <a href="simple_legs.html#slip_control">control for the
spring-loaded inverted pendulum model</a>, we considered discrete control
decisions, made once per cycle, which could stabilize the discrete Poincare
map, $\bx_p[n+1] = f_p(\bx_p[n], \bu[n]),$ with $\bu[n]$ the once-per-cycle
decisions. While it's possible to get the local linear approximations of
this map using our adjoint methods, remember that we rarely have an
analytical expression for the nonlinear $f_p$.</p>
<p>How limiting is it to restrict ourselves to once-per-cycle decisions? Of
course we can improve performance in general with continuous feedback. But
there are also some cases where once-per-cycle decisions feel quite
reasonable. Foot placement for legged robots is a good example -- once
we've chosen a foot placement we might make minor corrections just before
we place the foot, but rarely change that decision once the foot is on the
ground; once per footstep seems like a reasonable approximation. Another
good example is flapping flight: here the wing beat cycle times can be much
faster than the dynamic timescales of the center of mass, and changing the
parameters of the wing stroke one per flap seems pretty reasonable.</p>
<p>But the tools we've developed above also give us the machinery we need
to consider continuous feedback throughout the trajectory. Let's look at a
few important formulations.</p>
<subsection><h1>For underactuation degree one.</h1>
<p>It turns out many of our simple walking models -- particularly ones
with a point foot that are derived in the minimal coordinates -- are only
short one actuator (between the foot and the ground). One can represent
even fairly complex robots this way; much of the theory that I'll elude
to here was originally developed by Jessy Grizzle and his group in the
context of the bipedal robot <a
href="https://web.eecs.umich.edu/~grizzle/papers/RABBITExperiments.html">RABBIT</a>.
Jessy's key observation was that limit cycle stability is effectively
stability in $n-1$ degrees of freedom, and you can often achieve it
easily with $n-1$ actuators -- he called this line of work "Hybrid Zero
Dynamics" (HZD). We'll deal with the "hybrid" part of that in the next
chapter, but here is a simple example to illustrate the "zero dynamics"
concept. [Coming soon...] </p>
<todo>Add a simple example here.</todo>
<p>Stabilizing the zero dynamics is synonymous with achieving orbital
stability. Interestingly, it doesn't actually guarantee that you will go
around the limit cycle. One "feature" of the HZD walkers is that you can
actually stand in front of them and stop their forward progress with your
hand -- the control system will continue to act, maintaining itself on the
cycle, but progress along the cycle has stopped. In some of the robots
you could even push them backwards and they would move through the same
walking cycle in reverse. If one wants to certify that forward progress
will be achieved, then the tasks is reduced to inspecting the
one-dimensional dynamics of the phase variable along the cycle.</p>
<p>The notion of "zero dynamics" is certainly not restricted to systems
with underactuation of degree one. In general, we can easily stabilize a
manifold of dimension $m$ with $m$ actuators (we saw this in the section
on <a href="acrobot.html#task_space_pfl"> task-space partial feedback
linearization</a>), and if being on that manifold is sufficient to achieve
our task, or if we can certify that the resulting dynamics on the manifold
are sufficient for our task, then life is good. But in the
"underactuation degree one" case, the manifold under study is a
trajectory/orbit, and the tools from this chapter are immediately
applicable.</p>
</subsection>
<subsection><h1>Transverse LQR</h1>
<p>Another natural approach to stabilizing a cycle uses the transverse
linearization. In particular, for a nominal (control) trajectory,
$[\bx_0(t), \bu_0(t)],$ with $\dot{\bx}_0 = f(\bx_0, \bu_0)$, we can
approximate the transverse dynamics: $$\dot{\bx}_\perp =
f_\perp(\bx_\perp, \tau, \bu) \approx \pd{f_\perp}{\bx_\perp} \bx_\perp +
\pd{f_\perp}{\bu}(\bu - \bu_0(\tau)) = \bA_\perp(\tau)\bx_\perp +
\bB_\perp(\tau) \bar{\bu}.$$ <elib>Shiriaev08</elib> showed that the
(periodic) time-varying LQR controller that stabilized this system
achieves <i>orbital</i> stabilization of the original system.</p>
<p>Let's take a minute to appreciate the difference between this approach
and time-varying LQR in the full coordinates. Of course, the behavior is
quite different: time-varying LQR in the full coordinates will try to
slow down or speed up to keep time with the nominal trajectory, where
this controller makes no attempt to stabilize the phase variable. You
might think you could design the original time-varying controller,
$\bar\bu = \bK(t)\bar\bx,$ and just "project to the closest time" during
execution (e.g. find the time $t = \argmin_\tau |\bx - \bx_0(\tau)|)$ and
use the feedback corresponding to that time. But this projection is not
guaranteed to be safe; you can find systems that are stabilizable by the
transverse linearization are unstable in closed-loop using a feedback
based on this projection.</p>
<example><h1>Time-varying LQR simply projected in time can lead to
instability.</h1>
<p>Coming soon...</p>
</example>
<p>But there is an even more important reason to like the transverse LQR
approach. If orbital stability is all you need, then this is a better
formulation of the problem because you are asking for less. As we've
just discussed with hybrid zero-dynamics, stabilizing $m$ degrees of
freedom with $m$ actuators is potentially very different than stabilizing
$m+1$ degrees of freedom. In the transverse formulation, we are asking
LQR to minimize the cost only in the transverse coordinates (it is
mathematically equivalent to designing a cost function in the full
coordinates that is zero in the direction of the nominal trajectory). In
practice, this can result is much smaller cost-to-go matrices, ${\bf
S}(t)$, and smaller feedback gains, $\bK(t)$. For underactuated systems,
this difference can be dramatic.</p>
</subsection>
<subsection><h1>Orbital stabilization for non-periodic trajectories</h1>
<p>The observation that orbital stabilization of a trajectory can ask
much less of your underactuated control system (and lead to smaller LQR
feedback gains), is quite powerful. It is not limited to stabilizing
periodic motions. If your aim is to stabilize a non-periodic path
through state-space, but do not actually care about the timing, then
formulating your stabilization can be a very good choice. You can find
examples where a system is stabilizable in the transverse coordinates,
but not in the full coordinates.</p>
<example><h1>Stabilizable in the transverse coordinates, but not the full
coordinates.</h1>
<p>Coming soon...</p>
</example>
<p>Take care, though, as the converse can also be true: it's possible
that a system could be stabilizable in the full coordinates but
<i>not</i> in some transverse coordinates; if you choose those
coordinates badly.</p>
<example><h1>Stabilizable in the full coordinates, but not the transverse
coordinates.</h1>
<p>Consider a trajectory with $u(t) = 1$ (from any initial condition)
for <a href="dp.html#minimum_time_double_integrator">
the double integrator</a>. We can certainly stablize this trajectory in
the full coordinates using LQR. Technically, one can choose transverse
coordinates $\tau = \dot{q},$ and $x_\perp = q.$ Clearly $x_\perp$ is
transverse to the nominal trajectory everywhere. But this transverse
linearization is also clearly not stabilizable. It's a bad choice!</p>
</example>
<p>The general approach to designing the transverse coordinates
<elib>Manchester10a</elib> could address this concern by including a
criteria for controllability when optimizing the transverse
coordinates.</p>
</subsection>
</section> <!-- end feedback -->
</chapter>
<!-- EVERYTHING BELOW THIS LINE IS OVERWRITTEN BY THE INSTALL SCRIPT -->
<div id="references"><section><h1>References</h1>
<ol>
<li id=Hauser94a>
<span class="author">John Hauser and Chung Choo Chung</span>,
<span class="title">"Converse {L}yapunov functions for exponentially stable periodic orbits"</span>,
<span class="publisher">Systems & Control Letters</span>, vol. 23, no. 1, pp. 27 -- 34, <span class="year">1994</span>.
</li><br>
<li id=Shiriaev08>
<span class="author">Anton S. Shiriaev and Leonid B. Freidovich and Ian R. Manchester</span>,
<span class="title">"Can We Make a Robot Ballerina Perform a Pirouette? Orbital Stabilization of Periodic Motions of Underactuated Mechanical Systems"</span>,
<span class="publisher">Annual Reviews in Control</span>, vol. 32, no. 2, pp. 200--211, Dec, <span class="year">2008</span>.
</li><br>
<li id=Manchester10b>
<span class="author">Ian R. Manchester and Mark M. Tobenkin and Michael Levashov and Russ Tedrake</span>,
<span class="title">"Regions of Attraction for Hybrid Limit Cycles of Walking Robots"</span>,
<span class="publisher">Proceedings of the 18th IFAC World Congress, extended version available online: arXiv:1010.2247 [math.OC]</span>, <span class="year">2011</span>.
</li><br>
<li id=Manchester10a>
<span class="author">Ian R. Manchester</span>,
<span class="title">"Transverse Dynamics and Regions of Stability for Nonlinear Hybrid Limit Cycles"</span>,
<span class="publisher">Proceedings of the 18th IFAC World Congress, extended version available online: arXiv:1010.2241 [math.OC]</span>, Aug-Sep, <span class="year">2011</span>.
</li><br>
</ol>
</section><p/>
</div>
<table style="width:100%;"><tr style="width:100%">
<td style="width:33%;text-align:left;"><a class="previous_chapter" href=output_feedback.html>Previous Chapter</a></td>
<td style="width:33%;text-align:center;"><a href=index.html>Table of contents</a></td>
<td style="width:33%;text-align:right;"><a class="next_chapter" href=contact.html>Next Chapter</a></td>
</tr></table>
<div id="footer">
<hr>
<table style="width:100%;">
<tr><td><a href="https://accessibility.mit.edu/">Accessibility</a></td><td style="text-align:right">© Russ
Tedrake, 2023</td></tr>
</table>
</div>
</body>
</html>