Initial Value Problem Differential Equations 175090

Partial capture of text on file.
            CORE                                                                                                                                                                                                                  Metadata, citation and similar papers at core.ac.uk
       Provided by CONICET Digital
                                    Latin American Applied Research                                                                                                                                                                        39:207-211 (2009) 
                                                                     PARTIAL DIFFERENTIAL EQUATIONS FOR MISSING  
                                                                BOUNDARY CONDITIONS IN THE LINEAR-QUADRATIC  
                                                                                                            OPTIMAL CONTROL PROBLEM 
                                                                                                                                                                           
                                                                                                                                                          †                                                                 ‡ 
                                                                                                              V. COSTANZA    and   C. E. NEUMAN
                                                                                         † INTEC (UNL-CONICET), Güemes 3450, 3000 Santa Fe, Argentina 
                                                                                                                                              tsinoli@ceride.gov.ar 
                                              ‡ Dep. de Matemática (FIQ), Universidad Nac. del Litoral, Sgo. del Estero 2829, 3000 Santa Fe, Argentina 
                                                                                                                                       ceneuman@fiqus.unl.edu.ar 
                                     
                                            Abstract−− New equations involving the unknown                                                                                the Hamiltonian of the problem can be uniquely opti-
                                                                                                                                                                                                                                      0
                                    final states and initial costates corresponding to  mized by a control value u  depending on the remaining 
                                    families of LQR problems are found, and their solu-                                                                                   variables (t,x,λ), then a set of 2n ordinary differential 
                                    tions are computed and validated.  Having the initial                                                                                 equations (ODEs) with a two-point boundary-value 
                                    values of the costates, the optimal control can then                                                                                  condition, known as Hamilton's (or Hamiltonian) equa-
                                    be constructed, for each particular problem, from                                                                                     tions (HE), has to be solved. This is often a rather diffi-
                                    the solution to the Hamiltonian equations, now  cult numerical problem.  For the linear-quadratic regula-
                                    achievable through on-line integration. The missing                                                                                   tor (LQR) with a finite horizon there exist well known 
                                    boundary conditions are obtained by solving (off-                                                                                     methods (see for instance Sontag, 1998) to transform 
                                    line) two uncoupled, first-order, quasi-linear, partial                                                                               the boundary-value problem into an initial-value one.  
                                    differential equations for two auxiliary n × n matri-                                                                                 In the infinite-horizon, bilinear-quadratic regulator and 
                                    ces, whose independent variables are the time-                                                                                        change of set-point servo, there is a recent attempt to 
                                    horizon duration T and the final-penalty matrix S.                                                                                    find the missing initial condition for the costate vari-
                                    The solutions to these PDEs give information on the                                                                                   able, which allows to integrate the equations on-line 
                                    behavior of the whole two-parameter family of con-                                                                                    with the underlying control process (Costanza and 
                                    trol problems, which can be used for design pur-                                                                                      Neuman, 2006). 
                                    poses.  The mathematical treatment takes advantage                                                                                            Hamiltonian systems (modelled by a 2n-dimensional 
                                    of the symplectic structure of the Hamiltonian for-                                                                                   ODE whose vector field can be expressed in terms of 
                                    malism, which allows to reformulate one of Bell-                                                                                      the partial derivatives of an underlying “total energy” 
                                    man's conjectures related to the “invariant-                                                                                          function -called “the Hamiltonian”-, constant along tra-
                                    imbedding” methodology.  Results are tested against                                                                                   jectories), are key objects in Mathematical Physics.  The 
                                    solutions of the differential Riccati equations associ-                                                                               ODEs for the state and costate of an optimal control 
                                    ated with these problems, and the attributes of the                                                                                   problem referred above constitute a Hamiltonian system 
                                    two approaches are illustrated and discussed.                                                                                         from this general point of view.  Richard Bellman has 
                                            Keywords−− optimal control, linear-quadratic  contributed in both fields, but was particularly interested 
                                    problem, first order PDEs, boundary-value prob-                                                                                       in symplectic systems coming from Physics (see for in-
                                    lems, Riccati equations.                                                                                                              stance Abraham and Marsden, 1978) when he devised a 
                                                                                                                                                                          partial differential equation (PDE) for the final value of 
                                                                          I. INTRODUCTION                                                                                 the state x(t)=r(T,c) as a function of the duration of the 
                                                                                                                                                                                                    f
                                                                                                                                                                          process T=t-t , and of the final value imposed to the co-
                                    The linear-quadratic regulator (LQR) problem is proba-                                                                                                          f   0
                                                                                                                                                                          state λ(t)=c (one of the boundary conditions, the other 
                                    bly the most studied and used in the state-space optimal                                                                                                f
                                                                                                                                                                          being the fixed initial value of the state x(t )=x , see 
                                    control literature.  The main line of work in this direc-                                                                                                                                                                                     0        0
                                    tion has evolved around the algebraic (ARE, for infi-                                                                                 Bellman and Kalaba, 1963).  Bellman exploited in that 
                                    nite-horizon problems) and differential (DRE, for finite-                                                                             case ideas common to the “invariant imbedding” nu-
                                    horizon ones) Riccati equations.  When expressed in 2n-                                                                               merical techniques, also associated with his name.  
                                    phase space, i.e. introducing the costate (the spacial de-                                                                                    In Costanza (2008) the invariant imbedding ap-
                                    rivative of the value function), the dynamics of the op-                                                                              proach is generalized and proved for the one-
                                    timal control problem takes the form of the classical                                                                                 dimensional nonlinear-quadratic optimal control situa-
                                    Hamilton's equations of fundamental Physics.                                                                                          tion, where the final value of the costate depends on the 
                                            Since early sixties, Hamiltonian formalism has been                                                                           final value of the state, i.e. c=c(r).  The procedure fol-
                                    at the core of the development of modern optimal con-                                                                                 lowed in this proof induces another PDE for the initial 
                                                                                                                                                                          value σ of the costate λ(t ), which was actually the main 
                                    trol theory (Pontryagin et al., 1962).  When the problem                                                                                                                                      0
                                    concerning an n-dimensional system and an additive  concern from the optimal control point of view. The 
                                    cost objective is regular (Kalman et al., 1969), i.e. when                                                                            first-order quasilinear equation for σ developed here is 
                                                                                                                                                                          new.  It can be integrated after the PDE for the final 
                                                                                                                                                                  207 
                                                                      V. CONSTANZA, C. E. NEUMAN 
                    state ρ (independent of σ) has been solved.  The “initial”                                        0                                   (7) 
                    condition for σ depends on the final value of the state                                  λ =−Hx(x,λ); λ(T)=2Sx(T),
                                                                                                          0                              0                  0      0
                    and the weight matrix S involved in the quadratic final                    where H (x,λ) stands for H(x,λ,u (x,λ)), and Hλ ,  Hx  
                    penalty x’(T)Sx(T).  Therefore it seems more natural to                    for the column vectors with i-components  ∂H0 ∂λ , 
                    consider here (T,S) as the independent parameters of the                                                                                       i
                                                                                               ∂H0 ∂x  respectively, which here take the form 
                    family of control problems under consideration.  Having                              i
                    found the solution σ(T,S) the HE can be integrated for                                                      1            (8) 
                    each particular value of the parameters.  However, the                                            x = Ax− 2Wλ
                    whole curves ρ(.,S), σ(.,S) can be useful in real time, as                                                                 (9) 
                                                                                                                      λ = −2Qx−A'λ
                    a kind of safeguard against unexpected departures of the                   with  W ≡ BR−1B'.  There are no general solutions to 
                    numerical solution to the HE.  Normal solutions to  boundary-value problems.  In the following section a 
                    Hamiltonian systems are unstable near equilibrium, a                       novel approach to overcome this difficulty, by imbed-
                    characteristic inherent to the spectrum of their lineariza-                ding the individual situation into a two-parameter fam-
                    tions (if λ is an eigenvalue of the linear approximation,                  ily of similar problems, will be presented and substanti-
                    so is –λ, see Abraham and Marsden, 1978).                                  ated.  
                         In what follows, and after some notation and general 
                    characteristics of the problem are exposed in section II,                           III. EQUATIONS FOR THE MISSING  
                    the main PDE equations for the missing boundary con-                                      BOUNDARY CONDITIONS 
                    ditions are proved in section III.  Numerical validations                  For the nonlinear-quadratic one-dimensional case two 
                    and illustrations are provided in section IV, and the                      quasilinear first-order PDEs (53, 55) have been found 
                    whole approach discussed in the Conclusions.  An Ap-                       (see the Appendix), one for each of the missing bound-
                    pendix is added to substantiate the general set-up valid                   ary conditions of the problem, namely the final state 
                    for the nonlinear case, and the corresponding equations                    x(T) and the initial costate λ(0).  Unfortunately such 
                    for the one-dimensional case reviewed (see Costanza,                       PDEs can not be extrapolated to higher dimensions in 
                    2008; for additional details).                                             an obvious way.  However, the immersion of the prob-
                           II. FORMULATION OF THE PROBLEM                                      lem into a two-parameter (T,S) family is still fruitful, as 
                                                                                               will be evident from what follows.  
                    The classical finite-horizon formulation of the “LQR                           It is well known that the LQR problem has a unique 
                    problem” for finite-dimensional, constant systems, at-                     solution via the Riccati differential equation (DRE) 
                    tempts to minimize the (quadratic) cost                                                                                             (10) 
                                                                                                       π =πWπ −πA−A'π −Q; π(T)=S,
                                          T                                                    leading to the optimal feedback 
                                            []
                          JT,0,x (u(.)) = ∫ x'(τ)Qx(τ)+u'(τ)Ru(τ) dτ +
                               0                                                   (1)                          u*(t) = −R−1B'π(t)x(t).     (11) 
                                           0
                                         x'(T)Sx(T)                                                An alternative classical approach (see for instance 
                    with respect to all admissible control trajectories u(.) of                Bernard, 1972) transforms the original boundary-value 
                    duration T applied to some fixed, deterministic (linear)                   problem into an initial-value one, by introducing the fol-
                                                            n                                  lowing auxiliary objects:  
                    plant; i.e. those affecting the ℜ  - valued states x of the                (i)   the Hamiltonian matrix H, 
                    system through some dynamic restriction                                                               ⎡            1    ⎤
                                                                                (2) 
                              x = f (x,u) = Ax+bu,          x(0) = x0 ≠ 0.                                          H=⎢ A           −2W⎥,    (12) 
                    The (real, time-constant) matrices in Eqs. (1,2) will                                                 ⎢−2Q        −A' ⎥
                    have the following properties: Q,R,S symmetric, Q,S≥0,                                                ⎣                 ⎦
                    R>0, A∈M (ℜ), B is n × m.  The expression under the                        (ii)   and the augmented Hamiltonian system (a linear 
                                  n                                                                  matrix ODE) defined for two n×n matrices X(t), 
                    integral is usually known as the “Lagrangian” L of the                           Λ(t), t∈[0,T] through 
                    cost, i.e.,                                                                                 ⎛  ⎞        X         X(T)         I
                                       L(x,u)≡ x'Qx+u'Ru.     (3)                                               ⎜ X ⎟      ⎛    ⎞    ⎛       ⎞    ⎛    ⎞    (13) 
                                                                                                                           ⎜    ⎟    ⎜       ⎟    ⎜    ⎟
                                                                                                                      =H         ;             =        .
                                                                                         n                      ⎜  ⎟      ⎜Λ⎟ ⎜Λ(T)⎟ ⎜2S⎟
                         The Hamiltonian of such a problem, namely the ℜ                                        ⎝Λ⎠        ⎝    ⎠    ⎝       ⎠    ⎝    ⎠
                    × ℜn              n
                            ×ℜ→ℜ function defined by                                               The solution to system (13) is 
                                   H(x,λ,u)≡L(x,u)+λ' f(x,u),   (4)                                                 ⎛ X(0)⎞       −HT⎛ I ⎞     (14) 
                                                                                                                    ⎜       ⎟         ⎜     ⎟
                                                                                                                              =e             ,
                    is known to be regular, i.e. that H is uniquely minimized                                       ⎜Λ(0)⎟            ⎜2S⎟
                    with respect to u by the control value                                                          ⎝       ⎠         ⎝     ⎠
                                                      1                                        and since in this case Eqs. (6-7) read 
                                         0                 −1          (3)                                                 
                                       u (x,λ)=− R B'λ                                                                   ⎛ x⎞      ⎛ x⎞       (15) 
                                                                                                                         ⎜  ⎟      ⎜   ⎟
                                                      2                                                                       =H        ,
                                                                                                                         ⎜ ⎟      ⎜   ⎟
                    (in this case, independent of x), which is usually called                                            ⎝λ⎠       ⎝λ⎠
                    “the H-minimal control.”  The “Hamiltonian” form of                        the missing boundary conditions can be explicitly found 
                    the problem (see for instance Sontag, 1998) requires                                                 x(T)= X−1(0)x ,    (16) 
                    then to solve the two-point boundary-value problem                                                                      0
                                                                                                                    λ(0)=Λ(0)X−1(0)x ,    (17) 
                                         0                             (6)                                                                   0
                                   x = Hλ(x,λ);       x(0) = x0,                               (see Sontag, 1998, for the invertibility of X and other 
                                                                                          208 
                 Latin American Applied Research                                                                    39:207-211 (2009) 
                 details).  Actually, the whole solution to DRE can be re-       It can be easily checked that the PDEs and boundary 
                 cuperated from the solution to the augmented system         conditions (53-54, 55-56), whose validity is already 
                 (13), namely                                                known in the one-dimensional nonlinear case (see the 
                                            −1              .     (18)  Appendix), are also verified by the scalar version of 
                              π(t) = Λ(t)X    (t),  t ∈[]0,T
                 However it is desirable to count with the missing  Eqs. (30), namely 
                 boundary values for different values of the parameters                    ρ = x0 α,    σ =βx0 α,    (31) 
                 T,S without solving either the DRE or the augmented         (α is always nonzero), provided that α, β satisfy Eqs. 
                 system described above.  A method to solve the whole        (27, 28, 29).  
                 (T,S)-family of LQR problems (with common A, B, Q,              Two other possible reformulations of these equa-
                 R, x  values) was then developed. To be precise, just the   tions for linear n-dimensional systems may be explored 
                     0
                 case with S=sI, with s≥0 will be exposed, the extension     (with theoretical rather than practical purposes): the sca-
                 to non-scalar matrices being more operationally in-         lar (internal) product  
                 volved but not conceptually different.  Nevertheless, the                             (           )       (32) 
                 preset set-up is pertinent to many applications.  From             ρ'ρT −(Mρ)'ρS ≈ (A−WS)ρ 'ρ
                 now on, the notation for the relevant missing boundary      and the matrix (external product) form: 
                                                                                                       (           ) ,    (33) 
                 values and matrices will be                                        ρTρ'−ρS(Mρ)'≈ρ (A−WS)ρ '
                           ρ ≡ x(T);   σ ≡λ(0); U ≡e−HT .             (19)   (and the corresponding analogous equations for σ). Both 
                 The method starts by defining, for each particular (T,S)-   proved to be insufficient to predict the desired values of 
                 problem                                                     ρ, σ, so it was necessary to generalize to matrix equa-
                                  ⎛α(T,S)⎞ ⎛X(0)⎞                            tions for α and β in order to solve the general linear 
                                                           (20) 
                                  ⎜        ⎟   ⎜     ⎟                       problem. In the next section some results of numerical 
                                             ≡        ,
                                  ⎜β(T,S)⎟ ⎜Λ(0)⎟                            calculations involving the solutions to Eqs. (27,28) are 
                                  ⎝        ⎠   ⎝     ⎠
                 which allows to rewrite Eq. (14) in the form                examined and compared against the DRE approach. 
                                      ⎛α⎞      ⎛ I ⎞      (21)                    IV. NUMERICAL CALCULATIONS AND  
                                      ⎜  ⎟     ⎜   ⎟
                                           ≡U       .
                                      ⎜β⎟      ⎜2S⎟                                     ADDITIONAL VALIDATIONS 
                                      ⎝  ⎠     ⎝   ⎠                         Equations (27, 28, 29) were solved numerically with 
                    Since the subjacent Hamiltonian system is linear, so-    standard software in several cases, and the solutions 
                 lutions depend smoothly on parameters and initial con-      were tested to verify the following identities stemming 
                 ditions, and then derivatives of Eq. (21) with respect to   from the symplectic structure of the problem (see Katok 
                 (T,S) can be taken                                          and Hasselblatt, 1999; Jacobson, 1974) 
                                  ⎛αT ⎞         ⎛ I ⎞      (22)                     U 'U −U 'U =I =U 'U −U 'U ,   (34) 
                                  ⎜   ⎟         ⎜   ⎟
                                        =−HU         ,                                1  4     3  2        4   1    2   3
                                  ⎜β ⎟          ⎜2S⎟
                                  ⎝ T ⎠         ⎝   ⎠                               U 'U −U 'U = I =U 'U −U 'U .   (35) 
                                      ⎛α ⎞      ⎛ 0 ⎞                                 1  3     3  1         2   4    4   2
                                         S                                   Also, to illustrate the theoretical results, some compo-
                                      ⎜   ⎟     ⎜   ⎟      (23) 
                                            =U       .
                                      ⎜β ⎟      ⎜2I⎟                         nents of the solutions (ρ, σ) for the linear system with 
                                      ⎝ S ⎠     ⎝   ⎠
                 Now, by partitioning in the obvious way                     matrices 
                                          ⎛U    U ⎞                                             ⎛−2    0 ⎞       ⎛1⎞     (36) 
                                             1     2                                            ⎜        ⎟       ⎜ ⎟
                                          ⎜         ⎟      (24)                            A=             ,  b=     ,
                                     U=              .                                          ⎜        ⎟       ⎜ ⎟
                                          ⎜         ⎟                                             3   −1          1
                                          ⎝U3 U4⎠                                               ⎝        ⎠       ⎝ ⎠
                 Then, Eq. (23) reads                                        subject to the quadratic Lagrangian defined by matrices 
                                  1             1                                                 ⎛ 1    0.3⎞          
                                                                                                  ⎜          ⎟
                                    α =U ,        β =U ,   (25)                               Q=             ,   r =1,
                                      S    2       S     4                                        ⎜0.3    2 ⎟
                                  2             2                                                 ⎝          ⎠
                 which combined with Eq. (21) gives                          were plotted for the range (T,S)∈[0,1]×[0,1] in Figs. 1 
                              U1 =α −SαS, U3 = β −SβS,  (26) and 2.  
                 and then, by inserting these results in Eq. (22), the fol-      The results were also compared with the DRE solu-
                 lowing (main) relations are obtained                        tion for several intermediate (T,S)-problems, which 
                                  α −α M =−αN,     (27) showed almost no difference, as can be seen in Figure 3.  
                                    T    S                                       The time trajectories were also calculated from the 
                                  β −β M =−βN,     (28) initial conditions 
                                    T    S                                                     σ(0.5,sI) for several values of s, just to 
                 where M ≡ A'S +SA+Q−SWS, N ≡ A−WS.                          illustrate how the regulation capacity (the approaching 
                    Boundary conditions for a process of zero horizon        to (0,0)) increases with increasing final penalty (Fig. 4).  
                 are imposed in view of Eqs. (20, 16, 17), i.e.              It is also interesting to observe that the two components 
                                                                             of the state tend to the origin, since the optimal LQR 
                              α(0,S)=I, β(0,S)=2S.    (29) 
                    The desired values of the missing values for the state   control is stabilizing, but they do not decrease mono-
                 and costate, for any (T,S)-problem may then be recuper-     tonically as the results in Fig. 1 may suggest.  Actually, 
                                                                             for small final penalty (s=0.25), the second component 
                 ated from the solutions α and β through                     first grows from its initial value (0.1), and only after 
                                 −1             −1                           some time it heads towards equilibrium (Figs. 4 and 5). 
                           ρ =α x0, σ =βα x0 =βρ.    (30) 
                                                                          209 
                                                             V. CONSTANZA, C. E. NEUMAN 
                                                                                   tonian formulation of finite-horizon LQR problems, into 
                                                                                   an initial-value set-up with unique solution, have been 
                                                                                   derived, solved and illustrated.  The approach is based 
                                                                                   on invariant-imbedding ideas, although the original 
                                                                                   Bellman methodology resulted somehow inadequate in 
                      0.1                                                          this case.  By solving two matrix, quasilinear, first-order 
                    0.075                                                          PDEs for auxiliary variables α,  β proposed here, the 
                      0.05                                               1         missing boundary conditions are effectively recuperated 
                     0.025                                            0.75         after simple manipulations.  Actually, the auxiliary vari-
                                                                                   ables are found for a two-parameter family of LQR 
                           0                                       0.5 S           problems posed for fixed plant dynamics and trajectory 
                               0.25                             0.25               costs, but with variable final penalty and horizon spans.  
                                  T 0.5                                            This immersion allows a whole range of (T,S)-problems 
                                               0.75         0                      to be assessed by looking at the final reachable state 
                                                         1                         ρ(T,S) and the associated marginal cost σ(T,S). 
                  Figure 1: First component ρ (T,s) of final state value calcu-        It is remarkable that the solution to a twice-infinite 
                                               1                                   family of LQR problems requires little numerical effort, 
                  lated from matrix α.                                             roughly similar to the one involved in running the asso-
                                                                                   ciated DRE\ for just one individual situation.  The solu-
                                                                                   tion for a range of (T,S)-values provides design informa-
                                                                                   tion, useful when flexible choice of the parameters to 
                                                                                   improve performance is present. 
                      0.2                                                              The soundness of this approach for linear plants 
                    0.15                                                1          seems also promising in suggesting an algorithm to 
                       0.1                                                         solve multidimensional nonlinear problems with regular 
                      0.05                                           0.75          Hamiltonians, even allowing for more general Lagran-
                          0                                       0.5              gians than those described by quadratic forms.  This 
                             0.25                                      S           idea is already under development. 
                                      0.5                      0.25
                                   T          0.75
                                                        1 0                     
                  Figure 2: First component σ (T,s) of initial costate, from ma-
                                              1
                  trices α and β. 
                              x 10-4
                           5
                        δ  0
                                                                                                                                                 
                         -5                                                        Figure 4: Trajectories in phase space. 
                           1                                          1
                                 0.5                       0.5
                                    s       0 0          T                                 0.1
                                                                                                                            x
                  Figure 3: Difference between ρ  calculated from matriz α and         s                                      2
                  by solving de DRE.             1
                      For this example both members of relations (32) and              state0.05
                                                                              -3                                                 x
                  (33) were also evaluated. An agreement of the order 10                                                           1
                  was observed.  Since the values of the variables in-
                  volved in the calculations are relatively small, firm hy-
                  potheses on the soundness of Eqs. (32, 33) can not be                       0
                  raised yet.                                                                  0              0.2              0.4
                                     V. CONCLUSIONS                                                                  t                           
                  New equations to transform the classical two-point  Figure 5: States time-trajectories for small final penalty 
                  boundary-value ODE system associated with the Hamil-             (s=0.25).  
                                                                               210
The words contained in this file might help you see if this file matches what you are looking for:

...Core metadata citation and similar papers at ac uk provided by conicet digital latin american applied research partial differential equations for missing boundary conditions in the linear quadratic optimal control problem v costanza c e neuman intec unl guemes santa fe argentina tsinoli ceride gov ar dep de matematica fiq universidad nac del litoral sgo estero ceneuman fiqus edu abstract new involving unknown hamiltonian of can be uniquely opti final states initial costates corresponding to mized a value u depending on remaining families lqr problems are found their solu variables t x then set n ordinary tions computed validated having odes with two point values condition known as hamilton s or equa constructed each particular from he has solved this is often rather diffi solution now cult numerical regula achievable through line integration tor finite horizon there exist well obtained solving off methods see instance sontag transform uncoupled first order quasi into an one auxiliary m...
Related files

Share

Help

Related files

Share

Share to social media

Help

Login Area