The Duality and the Failure of LQG Control

Slide: Explore the duality between state observers and feedback controllers, focusing on KF and LQR. Understand why combining the "optimal observer" with the "optimal controller" might fail. Inspired by Dominikus Noll's page "A generalization of the Linear Quadratic Gaussian Loop Transfer Recovery procedure (LQG/LTR)".

Introduction

System Model

Consider a $n$ -th order linear time-invariant (LTI) discrete-time dynamic system with $m$ -dimensional input and $p$ -dimensional output:

\begin{aligned} x_{k + 1} & = A x_{k} + B u_{k} + ω_{k}, & ω_{k} \sim N (0, W_{k}) \\ y_{k} & = C x_{k} + ν_{k}, & ν_{k} \sim N (0, V_{k}) \end{aligned}

$x_{k} \in R^{n}$ : state vector at time step $k$
$u_{k} \in R^{m}$ : control input vector at time step $k$
$y_{k} \in R^{p}$ : measurement vector at time step $k$
$A \in R^{n \times n}$ : state transition matrix
$B \in R^{n \times m}$ : control input matrix
$C \in R^{p \times n}$ : observation matrix

Controllability

A LTI system is said to be controllable if,

\forall x_{0}, x^{*}, \exists k > 0, u_{k} = [u_{k - 1}, \dots, u_{1}, u_{0}], such that x_{k} = x^{*} .

This is equivalent to $rank (M_{c}) = n$ , where $M_{c} = [B, A B, A^{2} B, \dots, A^{n - 1} B] \in R^{n \times n m}$ is the controllability matrix.

\begin{aligned} x_{n} & = A x_{n - 1} + B u_{n - 1} \\ = A (A x_{n - 2} + B u_{n - 2}) + B u_{n - 1} \\ = A^{2} x_{n - 2} + A B u_{n - 2} + B u_{n - 1} \\ = A^{n} x_{0} + A^{n - 1} B u_{0} + \dots + A B u_{n - 2} + B u_{n - 1} \\ = A^{n} x_{0} + M_{c} u_{n} \end{aligned}

u_{n} = M_{c}^{⊤} (M_{c} M_{c}^{⊤})^{- 1} (x^{*} - A^{n} x_{0})

Observability

A LTI system is said to be observable if,

\forall x_{0} \in R^{n} \exists k > 0, y_{k} = [y_{0}, y_{1}, \dots, y_{k - 1}]^{⊤} \Rightarrow x_{0} .

This is equivalent to $rank (M_{o}) = n$ , where $M_{o} = [C^{⊤}, (C A)^{⊤}, (C A^{2})^{⊤}, \dots, (C A^{n - 1})^{⊤}]^{⊤} \in R^{n p \times n}$ is the observability matrix.

\begin{aligned} y_{0} & = C x_{0} \\ y_{1} & = C x_{1} = C A x_{0} \\ ⋮ \\ y_{n - 1} & = C A^{n - 1} x_{0} \end{aligned} \Rightarrow y_{n} = [\begin{matrix} y_{0} \\ y_{1} \\ ⋮ \\ y_{n - 1} \end{matrix}] = [\begin{matrix} C \\ C A \\ ⋮ \\ C A^{n - 1} \end{matrix}] x_{0} = M_{o} x_{0}

x_{0} = (M_{o}^{⊤} M_{o})^{- 1} M_{o}^{⊤} y_{n}

The Optimality

Optimal Estimator: Kalman Filter

Goal:

min_{{\hat{x}}_{k | k}} E [(x_{k} - {\hat{x}}_{k | k}) (x_{k} - {\hat{x}}_{k | k})^{⊤} ∣ y_{1}, \dots, y_{k}]

Solution:

\begin{aligned} {\hat{x}}_{k | k - 1} & = A {\hat{x}}_{k - 1 | k - 1} + B u_{k - 1} \\ {\hat{P}}_{k | k - 1} & = A {\hat{P}}_{k - 1 | k - 1} A^{⊤} + W_{k - 1} \\ K_{k} & = {\hat{P}}_{k | k - 1} C^{⊤} (C {\hat{P}}_{k | k - 1} C^{⊤} + V_{k})^{- 1} \\ {\hat{x}}_{k | k} & = {\hat{x}}_{k | k - 1} + K_{k} (y_{k} - C {\hat{x}}_{k | k - 1}) \\ {\hat{P}}_{k | k} & = {\hat{P}}_{k | k - 1} - K_{k} C {\hat{P}}_{k | k - 1} = ({\hat{P}}_{k | k - 1}^{- 1} + C^{⊤} V_{k}^{- 1} C)^{- 1} \end{aligned}

Optimal Regulator: LQR

Goal:

min_{{u_{k}}} E [x_{N}^{⊤} Q_{N} x_{N} + \sum_{k = 0}^{N - 1} (x_{k}^{⊤} Q_{k} x_{k} + u_{k}^{⊤} R_{k} u_{k})]

Solution:

\begin{aligned} S_{N} & = Q_{N} \\ L_{k} & = (R_{k} + B_{k}^{⊤} S_{k + 1} B_{k})^{- 1} B_{k}^{⊤} S_{k + 1} A_{k} \\ S_{k} & = Q_{k} + A_{k}^{⊤} S_{k + 1} (A_{k} - B_{k} L_{k}) \\ u_{k} & = - L_{k} x_{k} \end{aligned}

Linear Quadratic Gaussian (LQG)

The separation principle states that the design of the optimal controller and the optimal observer can be separated. The optimal control law is given by:

u_{k} = - L_{k} {\hat{x}}_{k | k}

where ${\hat{x}}_{k | k}$ is the state estimate provided by the Kalman filter.

'Inertial-Based LQG Control' by Daniel Engelsman

The Duality

The Duality in Control Theory

Controllability vs Observability For the original system $Σ = (A, B, C)$ , the dual system is defined as $Σ^{*} = (A^{⊤}, C^{⊤}, B^{⊤})$ .

$Σ$ is controllable $\Leftrightarrow$ $Σ^{*}$ is observable
$Σ$ is observable $\Leftrightarrow$ $Σ^{*}$ is controllable

Controller vs Observer

Feedback controller $u_{k} = - L_{k} x_{k}$ "suppresses" the state deviation $x_{k}$ through inputs
State observer ${\hat{x}}_{k | k} = {\hat{x}}_{k | k - 1} + K_{k} (y_{k} - C {\hat{x}}_{k | k - 1})$ "corrects" the state estimate ${\hat{x}}_{k | k}$ through measurements
The design of $L_{k}$ and $K_{k}$ are dual problems

The Duality in LQR and Kalman Filter (Optimization)

Optimization formulation of LQR:

min_{x_{1 : N}, u_{1 : N - 1}} x_{N}^{⊤} Q_{N} x_{N} + \sum_{k = 0}^{N - 1} [x_{k}^{⊤} Q_{k} x_{k} + u_{k}^{⊤} R_{k} u_{k}]

Optimization formulation of Kalman Filter:

min_{x_{1 : N}, ω_{1 : N - 1}} (x_{0} - {\hat{x}}_{0 | 0})^{⊤} P_{0}^{- 1} (x_{0} - {\hat{x}}_{0 | 0}) + \sum_{k = 0}^{N - 1} [(y_{k} - C x_{k})^{⊤} V_{k}^{- 1} (y_{k} - C x_{k}) + ω_{k}^{⊤} W_{k}^{- 1} ω_{k}]

subject to $x_{k + 1} = A x_{k} + B u_{k} + ω_{k}$ .

Duality:

A \leftrightarrow A^{⊤}, B \leftrightarrow C^{⊤}, Q \leftrightarrow W, R \leftrightarrow V

The Duality in LQR and Kalman Filter (Riccati)

Riccati Equation in LQR:

{\begin{cases} L_{k} = (R_{k} + B_{k}^{⊤} S_{k + 1} B_{k})^{- 1} B_{k}^{⊤} S_{k + 1} A_{k} \\ S_{k} = Q_{k} + A_{k}^{⊤} S_{k + 1} (A_{k} - B_{k} L_{k}) \end{cases}

S = A^{⊤} S A + Q - A^{⊤} S B (B^{⊤} S B + R)^{- 1} B^{⊤} S A

Riccati Equation in Kalman Filter:

{\begin{cases} {\hat{P}}_{k | k - 1} = A {\hat{P}}_{k - 1 | k - 1} A^{⊤} + W_{k - 1} \\ K_{k} = {\hat{P}}_{k | k - 1} C^{⊤} (C {\hat{P}}_{k | k - 1} C^{⊤} + V_{k})^{- 1} \\ {\hat{P}}_{k | k} = {\hat{P}}_{k | k - 1} - K_{k} C {\hat{P}}_{k | k - 1} = ({\hat{P}}_{k | k - 1}^{- 1} + C^{⊤} V_{k}^{- 1} C)^{- 1} \end{cases}

P = A P A^{⊤} + W - A P C^{⊤} (C P C^{⊤} + V)^{- 1} C P A^{⊤}

Duality:

A \leftrightarrow A^{⊤}, B \leftrightarrow C^{⊤}, Q \leftrightarrow W, R \leftrightarrow V, S \leftrightarrow P

The Failure

The Paradox of Optimality

LQR Robustness (SISO systems):

$\geq 60 \deg$ Phase Margin
$\geq 6 dB$ Gain Margin
Infinite gain reduction margin

Kalman Filter Robustness:

Dual robustness properties at sensor output
Excellent margins against sensor errors

The Fundamental Trade-Off

LQR's Need for High-Gain Feedback:

Large Q & Small R
Excellent stability margins

KF's Need for High-Gain Feedback:

Large W & Small V
Prompt response to new measurements

Optimizing for individual robustness leads to a fragile combined LQG system.

The Destructive Feedback Loop:

High-gain L reacts aggressively to state deviations
High-gain K amplifies sensor noise
This creates a positive feedback loop
Resulting in potential instability of the system

No stability guarantee for imperfect models, leading to the development of $H_{\infty}$ Control

The Duality and the Failure of LQG Control ​

Introduction ​

System Model ​

Controllability ​

Observability ​

The Optimality ​

Optimal Estimator: Kalman Filter ​

Optimal Regulator: LQR ​

Linear Quadratic Gaussian (LQG) ​

The Duality ​

The Duality in Control Theory ​

The Duality in LQR and Kalman Filter (Optimization) ​

The Duality in LQR and Kalman Filter (Riccati) ​

The Failure ​

The Paradox of Optimality ​

The Fundamental Trade-Off ​