PHR Conic Augmented Lagrangian Method

Slide: Starting from the penalty method, we extend to the augmented Lagrangian method for improved stability. By introducing $s$ , symmetric cone constraints are integrated, forming a unified framework for solving constrained optimization problems iteratively. Inspired by Dr. Zhepei Wang's Lecture "Numerical Optimization for Robotics".

Introduction

Penalty Method

Consider the constrained optimization problem:

min_{x} f (x) s.t. h (x) = 0

Penalty method solves a series of unconstrained problems:

Q_{ρ} (x) = f (x) + \frac{ρ}{2} | | h (x) | |^{2}

Challenge:

Requires $ρ \to \infty$ for exact solution, causing ill-conditioned Hessian.
Finite $ρ$ leads to constraint violation $h (x) \neq 0$ .

Lagrangian Relaxation

The Lagrangian is defined as:

L (x, λ) = f (x) + λ^{⊤} h (x)

At the optimal solution $x^{*}$ , there exists $λ^{*}$ such that $\nabla_{x} L (x^{*}, λ^{*}) = 0$ .

Uzawa's method iteratively updates $x$ and $λ$ :

{\begin{cases} x^{k + 1} = \arg min_{x} L (x, λ^{k}) \\ λ^{k + 1} = λ^{k} + α_{k} h (x^{k + 1}) \end{cases}

where $α_{k} > 0$ is a step size.

However, when $λ$ is fixed and one attempts to minimize $L (x, λ)$ :

$min_{x} L (x, λ)$ can be non-smooth even for smooth $f$ and $h$ .
$min_{x} L (x, λ)$ may be unbounded or have no finite solution.

Equality Constraint

PHR Augmented Lagrangian Method

Consider the optimization problem with a penalty on the deviation from a prior $\bar{λ}$ :

min_{x} max_{λ} f (x) + λ^{⊤} h (x) - \frac{1}{2 ρ} | | λ - \bar{λ} | |^{2}

The inner problem:

\nabla_{λ} = h (x) - \frac{1}{ρ} (λ - \bar{λ}) \Rightarrow λ^{*} (\bar{λ}) = \bar{λ} + ρ h (x)

The outer problem:

\begin{aligned} min_{x} max_{λ} f (x) + λ^{⊤} h (x) - \frac{1}{2 ρ} | | λ - \bar{λ} | |^{2} \\ = & min_{x} f (x) + [λ^{*} (\bar{λ})]^{⊤} h (x) - \frac{1}{2 ρ} | | λ^{*} (\bar{λ}) - \bar{λ} | |^{2} \\ = & min_{x} f (x) + [\bar{λ} + ρ h (x)]^{⊤} h (x) - \frac{ρ}{2} | | h (x) | |^{2} \\ = & min_{x} f (x) + {\bar{λ}}^{⊤} h (x) + \frac{ρ}{2} | | h (x) | |^{2} \end{aligned}

PHR Augmented Lagrangian Method (cont.)

To increase precision:

Reduce the penalty weight $1 / ρ$
Update the prior multiplier $\bar{λ} \leftarrow λ^{*} (\bar{λ})$

Uzawa's method for the augmented Lagrangian function is:

$x \leftarrow \arg min_{x} f (x) + {\bar{λ}}^{⊤} h (x) + \frac{ρ}{2} | | h (x) | |^{2}$
$\bar{λ} \leftarrow \bar{λ} + ρ h (x)$

Penalty Method Perspective

The corresponding primal problem of the augumented Lagrangian Function is obviously:

\begin{aligned} min_{x} & f (x) + \frac{ρ}{2} | | h (x) | |^{2} \\ s.t. & h (x) = 0 \end{aligned}

Advantages:

Even without $ρ \to \infty$ , the constraints can be exactly satisfied in the limit through multiplier updates.
For large $ρ$ , the penalty term $\frac{ρ}{2} | | h (x) | |^{2}$ dominates, ensuring $min_{x} L_{ρ} (x, λ)$ has a local solution.
The augmented dual function $q_{ρ} (λ)$ is smooth in proper conditions, with $\nabla q_{ρ} (λ) \approx h (x (λ))$ .

Practical PHR-ALM

In practical, we use its equivalent form:

L_{ρ} (x, λ) = f (x) + \frac{ρ}{2} {‖ h (x) + \frac{λ}{ρ} ‖}^{2} - \underset{x -independent}{\underset{⏟}{\frac{1}{2 ρ} | | λ | |^{2}}}

The KKT solution can be solved via:

{\begin{cases} x^{k + 1} = \arg min_{x} L_{ρ^{k}} (x, λ^{k}) \\ λ^{k + 1} = λ^{k} + ρ^{k} h (x^{k + 1}) \\ ρ^{k + 1} = min [(1 + γ) ρ^{k}, ρ_{max}] \end{cases}

where $ρ^{k}$ can be any nondecreasing positive sequence.

Inequality Constraint

Slack Variables Relaxation

Consider the optimization problem with inequality constraints:

min_{x} f (x) s.t. g (x) \leq 0

We use the equivalent formulation using slack variables:

min_{x, s} f (x) s.t. g (x) + [s]^{2} = 0

where $[\cdot]^{2}$ means element-wise squaring.

We can directly form Lagrangian like equality-constrained case

\begin{aligned} min_{x, s} {f (x) + \frac{ρ}{2} {‖ g (x) + [s]^{2} + \frac{λ}{ρ} ‖}^{2}} \\ = & min_{x} f (x) + min_{x} min_{s} \frac{ρ}{2} {‖ g (x) + [s]^{2} + \frac{λ}{ρ} ‖}^{2} \\ = & min_{x} f (x) + \frac{ρ}{2} {‖ max [g (x) + \frac{λ}{ρ}, 0] ‖}^{2} \end{aligned}

Simplified Form

Summing over all components gives the final form:

L_{ρ} (x, μ) = f (x) + \frac{ρ}{2} {‖ max [g (x) + \frac{μ}{ρ}, 0] ‖}^{2} - \underset{x -independent}{\underset{⏟}{\frac{1}{2 ρ} | | μ | |^{2}}}

For the dual update, from the optimality condition:

\begin{aligned} μ^{k + 1} & = μ^{k} + ρ (g (x^{k + 1}) + [s^{k + 1}]^{2}) \\ = max [μ^{k} + ρ g (x^{k + 1}), 0] \end{aligned}

Summary

PHR Augmented Lagrangian Method for General Nonconvex cases:

\begin{aligned} min_{x} & f (x) \\ s.t. & h (x) = 0 \\ g (x) \leq 0 \end{aligned}

Its PHR Augmented Lagrangian is defined as

L_{ρ} = f (x) + \frac{ρ}{2} {‖ h (x) + \frac{λ}{ρ} ‖}^{2} + \frac{ρ}{2} {‖ max [g (x) + \frac{μ}{ρ}, 0] ‖}^{2} - \frac{1}{2 ρ} {| | λ | |^{2} + | | μ | |^{2}}

The PHR-ALM is simply repeating the primal descent and dual ascent iterations:

{\begin{cases} x^{k + 1} = \arg min_{x} L_{ρ^{k}} (x, λ^{k}, μ^{k}) \\ λ^{k + 1} = λ^{k} + ρ^{k} h (x^{k + 1}) \\ μ^{k + 1} = max [μ^{k} + ρ^{k} g (x^{k + 1}), 0] \\ ρ^{k + 1} = min [(1 + γ) ρ^{k}, ρ_{max}] \end{cases}

Courtesy: Z. Wang

Symmetric Cone Constraint

Extension to Symmetric Cone Constraints

Consider the symmetric cone constrained optimization problem:

\begin{aligned} min_{x} & f (x) \\ s.t. & h (x) = 0 \\ g (x) \in K \end{aligned}

Second Order Cone	Positive Definite Cone

Generalized Inequality Constraint

For the symmetric cone constraint $x \in K$ , we can equivalently express it as:

g (x) = - x ⪯_{K} 0

The standard inequality constraint $g (x) \leq 0$ corresponds to the nonnegative orthant cone:

K = R_{+}^{n} = {x \in R^{n} : x_{i} \geq 0, i = 1, \dots, n}

t's projection operator is exactly element-wise max function:

Π_{R_{+}^{n}} (v) = max [v, 0], (R_{+}^{n})^{*} = R_{+}^{n}

Slack Variables Relaxation

Consider the optimization problem with inequality constraints:

min_{x} f (x) s.t. g (x) \in K

By Euclidean Jordan algebra, the conit program is equivalent to

min_{x, s} f (x) s.t. g (x) = s \circ s

We can directly form Lagrangian like equality-constrained case

\begin{aligned} min_{x, s} {f (x) + \frac{ρ}{2} {‖ g (x) - s \circ s + \frac{λ}{ρ} ‖}^{2}} \\ = & min_{x} f (x) + min_{x} min_{s} \frac{ρ}{2} {‖ g (x) - s \circ s + \frac{λ}{ρ} ‖}^{2} \\ = & min_{x} f (x) + \frac{ρ}{2} {‖ Π_{K} (- g (x) - \frac{λ}{ρ}) ‖}^{2} \end{aligned}

Simplified Form

Let $μ = - λ$ , we get the final form:

L_{ρ} (x, μ) = f (x) + \frac{ρ}{2} {‖ Π_{K^{*}} (- g (x) + \frac{μ}{ρ}) ‖}^{2} - \underset{x -independent}{\underset{⏟}{\frac{1}{2 ρ} | | μ | |^{2}}}

For the dual update, from the optimality condition:

\begin{aligned} μ^{k + 1} & = μ^{k} + ρ [g (x^{k + 1}) - s^{k + 1} \circ s^{k + 1}] \\ = Π_{K^{*}} [μ^{k} - ρ \cdot g (x^{k + 1})] \end{aligned}

where $K^{*}$ is the dual cone of $K$ .

Summary

PHR Augmented Lagrangian Method for General Nonconvex cases:

\begin{aligned} min_{x} & f (x) \\ s.t. & h (x) = 0 \\ g (x) \in K \end{aligned}

Its PHR Augmented Lagrangian is defined as

L_{ρ} = f (x) + \frac{ρ}{2} {‖ h (x) + \frac{λ}{ρ} ‖}^{2} + \frac{ρ}{2} {‖ Π_{K} (- g (x) + \frac{μ}{ρ}) ‖}^{2} - \frac{1}{2 ρ} {| | λ | |^{2} + | | μ | |^{2}}

The PHR-ALM is simply repeating the primal descent and dual ascent iterations:

{\begin{cases} x^{k + 1} = \arg min_{x} L_{ρ^{k}} (x, λ^{k}, μ^{k}) \\ λ^{k + 1} = λ^{k} + ρ^{k} h (x^{k + 1}) \\ μ^{k + 1} = Π_{K^{*}} (μ^{k} - ρ^{k} x^{k + 1}) \\ ρ^{k + 1} = min [(1 + γ) ρ^{k}, ρ_{max}] \end{cases}

PHR Conic Augmented Lagrangian Method ​

Introduction ​

Penalty Method ​

Lagrangian Relaxation ​

Equality Constraint ​

PHR Augmented Lagrangian Method ​

PHR Augmented Lagrangian Method (cont.) ​

Penalty Method Perspective ​

Practical PHR-ALM ​

Inequality Constraint ​

Slack Variables Relaxation ​

Simplified Form ​

Summary ​

Symmetric Cone Constraint ​

Extension to Symmetric Cone Constraints ​

Generalized Inequality Constraint ​

Slack Variables Relaxation ​

Simplified Form ​

Summary ​

PHR Conic Augmented Lagrangian Method

Introduction

Penalty Method

Lagrangian Relaxation

Equality Constraint

PHR Augmented Lagrangian Method

PHR Augmented Lagrangian Method (cont.)

Penalty Method Perspective

Practical PHR-ALM

Inequality Constraint

Slack Variables Relaxation

Simplified Form

Summary

Symmetric Cone Constraint

Extension to Symmetric Cone Constraints

Generalized Inequality Constraint

Slack Variables Relaxation

Simplified Form

Summary