Skip to content

The Duality and the Failure of LQG Control

Slide: Explore the duality between state observers and feedback controllers, focusing on KF and LQR. Understand why combining the "optimal observer" with the "optimal controller" might fail. Inspired by Dominikus Noll's page "A generalization of the Linear Quadratic Gaussian Loop Transfer Recovery procedure (LQG/LTR)".

Introduction

System Model

Consider a n-th order linear time-invariant (LTI) discrete-time dynamic system with m-dimensional input and p-dimensional output:

xk+1=Axk+Buk+ωk,ωkN(0,Wk)yk=Cxk+νk,νkN(0,Vk)
  • xkRn: state vector at time step k
  • ukRm: control input vector at time step k
  • ykRp: measurement vector at time step k
  • ARn×n: state transition matrix
  • BRn×m: control input matrix
  • CRp×n: observation matrix

Controllability

A LTI system is said to be controllable if,

x0,x,k>0,uk=[uk1,,u1,u0],such thatxk=x.

This is equivalent to rank(Mc)=n, where Mc=[B,AB,A2B,,An1B]Rn×nm is the controllability matrix.

xn=Axn1+Bun1=A(Axn2+Bun2)+Bun1=A2xn2+ABun2+Bun1=Anx0+An1Bu0++ABun2+Bun1=Anx0+Mcunun=Mc(McMc)1(xAnx0)

Observability

A LTI system is said to be observable if,

x0Rnk>0,yk=[y0,y1,,yk1]x0.

This is equivalent to rank(Mo)=n, where Mo=[C,(CA),(CA2),,(CAn1)]Rnp×n is the observability matrix.

y0=Cx0y1=Cx1=CAx0  yn1=CAn1x0yn=[y0y1yn1]=[CCACAn1]x0=Mox0x0=(MoMo)1Moyn

The Optimality

Optimal Estimator: Kalman Filter

Goal:

minx^k|kE[(xkx^k|k)(xkx^k|k)y1,,yk]

Solution:

x^k|k1=Ax^k1|k1+Buk1P^k|k1=AP^k1|k1A+Wk1Kk=P^k|k1C(CP^k|k1C+Vk)1x^k|k=x^k|k1+Kk(ykCx^k|k1)P^k|k=P^k|k1KkCP^k|k1=(P^k|k11+CVk1C)1

Optimal Regulator: LQR

Goal:

min{uk}E[xNQNxN+k=0N1(xkQkxk+ukRkuk)]

Solution:

SN=QNLk=(Rk+BkSk+1Bk)1BkSk+1AkSk=Qk+AkSk+1(AkBkLk)uk=Lkxk

Linear Quadratic Gaussian (LQG)

The separation principle states that the design of the optimal controller and the optimal observer can be separated. The optimal control law is given by:

uk=Lkx^k|k

where x^k|k is the state estimate provided by the Kalman filter.

'Inertial-Based LQG Control' by Daniel Engelsman

The Duality

The Duality in Control Theory

Controllability vs Observability For the original system Σ=(A,B,C), the dual system is defined as Σ=(A,C,B).

  • Σ is controllable Σ is observable
  • Σ is observable Σ is controllable

Controller vs Observer

  • Feedback controller uk=Lkxk "suppresses" the state deviation xk through inputs
  • State observer x^k|k=x^k|k1+Kk(ykCx^k|k1) "corrects" the state estimate x^k|k through measurements
  • The design of Lk and Kk are dual problems

The Duality in LQR and Kalman Filter (Optimization)

Optimization formulation of LQR:

minx1:N,u1:N1xNQNxN+k=0N1[xkQkxk+ukRkuk]

Optimization formulation of Kalman Filter:

minx1:N,ω1:N1(x0x^0|0)P01(x0x^0|0)+k=0N1[(ykCxk)Vk1(ykCxk)+ωkWk1ωk]

subject to xk+1=Axk+Buk+ωk.

Duality:

AA,BC,QW,RV

The Duality in LQR and Kalman Filter (Riccati)

Riccati Equation in LQR:

{Lk=(Rk+BkSk+1Bk)1BkSk+1AkSk=Qk+AkSk+1(AkBkLk)S=ASA+QASB(BSB+R)1BSA

Riccati Equation in Kalman Filter:

{P^k|k1=AP^k1|k1A+Wk1Kk=P^k|k1C(CP^k|k1C+Vk)1P^k|k=P^k|k1KkCP^k|k1=(P^k|k11+CVk1C)1P=APA+WAPC(CPC+V)1CPA

Duality:

AA,BC,QW,RV,SP

The Failure

The Paradox of Optimality

LQR Robustness (SISO systems):

  • 60deg Phase Margin
  • 6 dB Gain Margin
  • Infinite gain reduction margin

Kalman Filter Robustness:

  • Dual robustness properties at sensor output
  • Excellent margins against sensor errors

The Fundamental Trade-Off

LQR's Need for High-Gain Feedback:

  • Large Q & Small R
  • Excellent stability margins

KF's Need for High-Gain Feedback:

  • Large W & Small V
  • Prompt response to new measurements

Optimizing for individual robustness leads to a fragile combined LQG system.

The Destructive Feedback Loop:

  1. High-gain L reacts aggressively to state deviations
  2. High-gain K amplifies sensor noise
  3. This creates a positive feedback loop
  4. Resulting in potential instability of the system

No stability guarantee for imperfect models, leading to the development of H Control