Detailed derivation of On-Manifold IMU Preintegration

C. Forster, L. Carlone, F. Dellaert, and D. Scaramuzza, “On-Manifold Preintegration for Real-Time Visual--Inertial Odometry,” IEEE Trans. Robot., vol. 33, no. 1, pp. 1–21, Feb. 2017, doi: 10.1109/TRO.2016.2597321.
Z. Yang and S. Shen, “Monocular Visual–Inertial State Estimation With Online Initialization and Camera–IMU Extrinsic Calibration,” IEEE Trans. Automat. Sci. Eng., vol. 14, no. 1, pp. 39–51, Jan. 2017, doi: 10.1109/TASE.2016.2550621.

Inertial Measurement Unit (IMU) preintegration is a fundamental technique in visual-inertial odometry that efficiently combines high-frequency IMU measurements between keyframes. This approach, pioneered by Forster et al. and Yang et al., formulates the integration process on the manifold of rigid body motions $S E (3)$ , addressing the nonlinear nature of 3D rotations through Lie group theory.

The key innovation lies in separating the integration of IMU measurements from the global state, enabling computationally efficient optimization by precomputing relative motion constraints. This derivation details the mathematical foundations, including the special orthogonal group $S O (3)$ , perturbation models, uncertainty representation on manifolds, and the complete preintegration theory that handles sensor biases and noise characteristics while maintaining real-time performance. The resulting preintegrated terms serve as constraints in factor graph optimization frameworks for robust state estimation.

Special Orthology Group $S O (3)$

The special orthology group $S O (3)$ is defined as the set of all $R^{3 \times 3}$ rotation matrix:

S O (3) = {R \in R^{3 \times 3} : R^{T} R = I, det (R) = 1}

And its tagent space on the manifold $so (3)$ , consist of skew-symmetric matrices:

\begin{matrix} (1) & ω^{\land} = {[\begin{matrix} ω_{1} \\ ω_{2} \\ ω_{3} \end{matrix}]}^{\land} = [\begin{matrix} 0 & - ω_{3} & ω_{2} \\ ω_{3} & 0 & - ω_{1} \\ - ω_{2} & ω_{1} & 0 \end{matrix}] \in so (3) \end{matrix}

The hat operator $(\cdot)^{\land}$ maps a vector $ω \in R^{3}$ to $so (3)$ , while the vee operator $(\cdot)^{\lor}$ performs the inverse mapping.

Rodrigues' Rotation Formula

Let $v \in R^{3}$ be a 3D vector to be rotated by a unit vector axis $n = (\begin{matrix} n_{x} \\ n_{y} \\ n_{z} \end{matrix})$ by an angle $θ$ . The resulting rotated vector is denoted $v_{rot}$ .

$v$ can be decomposed to two orthogonal components: the parallel to $n$ part $v_{∥}$ and the perpendicular to $n$ part $v_{⊥}$ . Which satisfy,

{\begin{cases} v_{∥} = (v \cdot n) n \\ v_{⊥} = v - (v \cdot n) n = - n \times (n \times v) \end{cases}

The whole vector $v$ is then rotated by:

\begin{aligned} v_{rot} & = v_{∥ rot} + v_{⊥ rot} \\ = v_{∥} + \cos θ v_{⊥} + \sin θ n \times v_{⊥} \\ = (v \cdot n) n + \cos θ v_{⊥} + \sin θ n \times v \\ = \cos θ v + (1 - \cos θ) (n \cdot v) n + \sin θ n \times v \end{aligned}

We get Rodrigues' Rotation Formula. Write it on the manifold, we then have:

R = \cos (| | ϕ | |) I + [1 - \cos (| | ϕ | |)] \frac{ϕ ϕ^{T}}{| | ϕ | |^{2}} + \sin (| | ϕ | |) ϕ^{\land}

The exponential map $\exp : so (3) \to S O (3)$ converts a skew-symmetric matrix into a rotation matrix via the Rodrigues rotation formula:

\begin{matrix} (3) & R = \exp (ϕ^{\land}) = I + \frac{\sin ∥ ϕ ∥}{∥ ϕ ∥} ϕ^{\land} + \frac{1 - \cos ∥ ϕ ∥}{∥ ϕ ∥^{2}} (ϕ^{\land})^{2} \end{matrix}

For small angles, this simplifies to $R \approx I + ϕ^{\land}$ .

Exponential Mapping

\begin{aligned} R & = \exp (ϕ^{\land}) \\ = \sum_{n = 0}^{\infty} \frac{(ϕ^{\land})^{n}}{n!} \\ = I + (| | ϕ | | - \frac{| | ϕ | |^{3}}{3!} + \frac{| | ϕ | |^{5}}{5!} + \dots) ϕ^{\land} + (\frac{| | ϕ | |^{2}}{2!} - \frac{| | ϕ | |^{4}}{4!} + \frac{| | ϕ | |^{6}}{6!} - \dots) (ϕ^{\land})^{2} \\ = I + \frac{\sin | | ϕ | |}{| | ϕ | |} ϕ^{\land} + \frac{1 - \cos | | ϕ | |}{| | ϕ | |^{2}} (ϕ^{\land})^{2} \end{aligned}

The logarithmic map $ \log : SO(3) \to \mathfrak{so}(3) $ extracts the axis-angle representation from a rotation matrix:

\begin{matrix} (5) & ϕ = \log (R)^{\lor} = \frac{∥ ϕ ∥}{2 \sin ∥ ϕ ∥} [\begin{matrix} r_{32} - r_{23} \\ r_{13} - r_{31} \\ r_{21} - r_{12} \end{matrix}] \end{matrix}

Logarithmic Mapping

R = I + \frac{\sin | | ϕ | |}{| | ϕ | |} ϕ^{\land} + \frac{1 - \cos | | ϕ | |}{| | ϕ | |^{2}} (ϕ^{\land})^{2}

So the trace of the rotation matrix satisfied:

\begin{aligned} t r (R) & = t r (I) + \frac{\sin | | ϕ | |}{| | ϕ | |} t r (ϕ^{\land}) + \frac{1 - \cos | | ϕ | |}{| | ϕ | |^{2}} t r [(ϕ^{\land})^{2}] \\ = 3 + 0 + \frac{1 - \cos | | ϕ | |}{| | ϕ | |^{2}} (- | | ϕ^{\land} | |^{2}) \\ = 3 + 0 + \frac{1 - \cos | | ϕ | |}{| | ϕ | |^{2}} (- 2 | | ϕ | |^{2}) \\ = 3 + 2 \cos | | ϕ | | - 2 \end{aligned}

The angle $θ$ is calculated by

| | ϕ | | = \arccos (\frac{t r (R) - 1}{2}) + 2 k π

When $ϕ \neq 0 \Leftrightarrow R \neq I$ , we construct a skew part and a non-skew part.

R = \underset{\frac{1}{2} (R - R^{T})}{\underset{⏟}{\frac{\sin | | ϕ | |}{| | ϕ | |} ϕ^{\land}}} + \underset{\frac{1}{2} (R + R^{T})}{\underset{⏟}{I + \frac{1 - \cos | | ϕ | |}{| | ϕ | |^{2}} (ϕ^{\land})^{2}}}

\begin{matrix} (5) & \begin{aligned} ϕ & = \log (R)^{\lor} \\ = [\frac{| | ϕ | | (R - R^{T})}{2 \sin | | ϕ | |}]^{\lor} \\ = \frac{| | ϕ | |}{2 \sin | | ϕ | |} [\begin{array}{c} r_{32} - r_{23} \\ r_{13} - r_{31} \\ r_{21} - r_{12} \end{array}] \end{aligned} \end{matrix}

For simplicity of notation, $Exp$ and $Log$ are defined as mappings between vector space $R^{3}$ and Lie Group $S O (3)$ , while $\exp$ and $\log$ operate between Lie Algebra $so (3)$ and $S O (3)$

Perturbation Models and Jacobians

For small perturbations $δ ϕ$ of exponential and logarithm, we use first order approximation:

\begin{matrix} (7,9) & {\begin{cases} Exp (ϕ + δ ϕ) \approx Exp (ϕ) Exp (J_{r} (ϕ) δ ϕ) \\ Log (Exp (ϕ) Exp (δ ϕ)) \approx ϕ + J_{r}^{- 1} (ϕ) δ ϕ \end{cases} \end{matrix}

The right jacobian and its inverse are given by

\begin{matrix} (8) & \begin{aligned} J_{r} (ϕ) & = I - \frac{1 - \cos (| | ϕ | |)}{| | ϕ | |^{2}} ϕ^{\land} + \frac{| | ϕ | | - \sin (| | ϕ | |)}{| | ϕ | |^{3}} (ϕ^{\land})^{2} \\ J_{r}^{- 1} (ϕ) & = I + \frac{1}{2} ϕ^{\land} + (\frac{1}{| | ϕ | |^{2}} + \frac{1 + \cos (| | ϕ | |)}{2 | | ϕ | | \sin (| | ϕ | |)}) (ϕ^{\land})^{2} \end{aligned} \end{matrix}

Perturbation Jacobians

Since a general increment cannot be defined on the special orthogonal group $R_{1} + R_{2} \notin S O (3), R_{1}, R_{2} \in S O (3)$ , we use perturbation models defined above.

\begin{matrix} (7) & Exp (ϕ + Δ ϕ) = Exp (J_{l} Δ ϕ) Exp (ϕ) = Exp (ϕ) = Exp (J_{r} Δ ϕ) \end{matrix}

The Lie bracket (binary operator on Lie groups) is defined as:

[A, B] = A B - B A

Using the Baker-Campbell-Hausdorff (BCH) formula:

\begin{matrix} (9) & \begin{aligned} \log (Exp (α) Exp (β)) \\ = & \log (A B) \\ = & \sum_{n = 1}^{\infty} \frac{(- 1)^{n - 1}}{n} \sum_{r_{i} + s_{i} > 0, i \in [1, n]} \frac{(\sum_{i = 1}^{n} (r_{i} + s_{i}))^{- 1}}{Π_{i = 1}^{n} (r_{i}! s_{i}!)} [A^{r_{1}} B^{s_{1}} A^{r_{2}} B^{s_{2}} \dots A^{r_{n}} B^{s_{n}}] \\ = & A + B + \frac{1}{2} [A, B] + \frac{1}{12} [A, [A, B]] - \frac{1}{12} [B, [A, B]] + \dots \\ \approx & {\begin{cases} β + J_{l} (β)^{- 1} α, & α \to 0 \\ α + J_{r} (α)^{- 1} β, & β \to 0 \end{cases} \end{aligned} \end{matrix}

Additional Jacobians:

\begin{aligned} J_{l} (ϕ) & = I + \frac{1 - \cos (| | ϕ | |)}{| | ϕ | |^{2}} ϕ^{\land} + \frac{| | ϕ | | - \sin (| | ϕ | |)}{| | ϕ | |^{3}} (ϕ^{\land})^{2} \\ J_{r} (ϕ) & = I - \frac{1 - \cos (| | ϕ | |)}{| | ϕ | |^{2}} ϕ^{\land} + \frac{| | ϕ | | - \sin (| | ϕ | |)}{| | ϕ | |^{3}} (ϕ^{\land})^{2} \\ J_{l}^{- 1} (ϕ) & = I - \frac{1}{2} ϕ^{\land} + (\frac{1}{| | ϕ | |^{2}} + \frac{1 + \cos (| | ϕ | |)}{2 | | ϕ | | \sin (| | ϕ | |)}) (ϕ^{\land})^{2} \\ J_{r}^{- 1} (ϕ) & = I + \frac{1}{2} ϕ^{\land} + (\frac{1}{| | ϕ | |^{2}} + \frac{1 + \cos (| | ϕ | |)}{2 | | ϕ | | \sin (| | ϕ | |)}) (ϕ^{\land})^{2} \end{aligned}

For any vector $v \in R^{3}$ , using the properties of the cross product and the special orthogonal group, we have:

(R p)^{\land} v = (R p) \times v = (R p) \times (R R^{- 1} v) = R [p \times (R^{- 1} v)] = R p^{\land} R^{T} v

Since $R p^{\land} R^{T} = (R p)^{\land}$ holds for each term in the Taylor expansion of the exponential map, it follows that:

R \exp (ϕ^{\land}) R^{T} = \exp ((R ϕ)^{\land})

Equivalently:

\begin{matrix} (10) & R Exp (ϕ) R^{T} = Exp (R ϕ) \end{matrix}

Uncertainty Description in $S O (3)$

An intuitive way to define uncertainty on rotation matrices is to right-multiply the matrix by a small perturbation that follows a normal distribution:

\begin{matrix} (12) & \tilde{R} = R \cdot Exp (ϵ), ϵ \in N (0, Σ) \end{matrix}

For Gaussian distributions, we have the normalization condition:

\begin{matrix} (13) & \int_{R^{3}} p (ϵ) d ϵ = \int_{R^{3}} \frac{1}{\sqrt{(2 π)^{3} det (Σ)}} \cdot \exp (- \frac{1}{2} | | ϵ | |_{Σ}^{2}) d ϵ = 1 \end{matrix}

Substituting $ϵ = Log (R^{- 1} \tilde{R})$ , we obtain

\int_{S O (3)} \frac{1}{\sqrt{(2 π)^{3} det (Σ)}} \cdot \exp (- \frac{1}{2} | | Log (R^{- 1} \tilde{R}) | |_{Σ}^{2}) | \frac{d ϵ}{d \tilde{R}} | d \tilde{R} = 1

The scaling factor, known as the Jacobian determinant, is given by the right-perturbation model:

| \frac{d ϵ}{d \tilde{R}} | = | \frac{1}{J_{r} (Log (R^{- 1} \tilde{R}))} |

Rewriting gives:

\begin{matrix} (14) & \int_{S O (3)} \frac{1}{\sqrt{(2 π)^{3} det (Σ)}} \cdot | \frac{1}{J_{r} (Log (R^{- 1} \tilde{R}))} | \cdot \exp (- \frac{1}{2} | | Log (R^{- 1} \tilde{R}) | |_{Σ}^{2}) d \tilde{R} = 1 \end{matrix}

From this, the probability density function on $S O (3)$ is:

\begin{matrix} (15) & p (\tilde{R}) = \frac{1}{\sqrt{(2 π)^{3} det (Σ)}} \cdot | \frac{1}{J_{r} (Log (R^{- 1} \tilde{R}))} | \cdot \exp (- \frac{1}{2} | | Log (R^{- 1} \tilde{R}) | |_{Σ}^{2}) \end{matrix}

For small perturbations, the normalization term $\frac{1}{\sqrt{(2 π)^{3} det (Σ)}} \cdot | \frac{1}{J_{r} (Log (R^{- 1} \tilde{R}))} |$ can be approximated as constant, leading to the following expression for the negative log-likelihood:

\begin{matrix} (16) & \begin{aligned} L (R) & = \frac{1}{2} | | Log (R^{- 1} \tilde{R}) | |_{Σ}^{2} + c \\ = \frac{1}{2} | | Log ({\tilde{R}}^{- 1} R) | |_{Σ}^{2} + c \end{aligned} \end{matrix}

Gauss-Newton Method on Manifolds

For standard Gauss-Newton optimization:

x^{*} = \arg min_{x} f (x) \Rightarrow x^{*} = x + \arg min_{Δ x} f (x + Δ x)

On manifolds, this becomes:

\begin{matrix} (18) & x^{*} = \arg min_{x \in M} f (x) \Rightarrow x^{*} = R_{x} \cdot \arg min_{δ x \in R^{n}} f (R_{x} (δ x)) \end{matrix}

Where $R_{x} (\cdot)$ is a retraction mapping from the tangent space to the manifold.

In the case of the $S O (3)$ group, the retraction is defined as:

\begin{matrix} (20) & R_{R} (δ ϕ) = R \cdot Exp (δ ϕ), δ ϕ \in R^{3} \end{matrix}

For the $S E (3)$ group, it is:

\begin{matrix} (21) & R_{T} (δ ϕ, δ p) = [\begin{matrix} R \cdot Exp (δ ϕ) & p + R \cdot δ p \end{matrix}], [\begin{matrix} δ ϕ \\ δ p \end{matrix}] \in R^{6} \end{matrix}

IMU Preintegration

The state of the system at time $k$ is represented by the IMU's orientation, position, velocity, and sensor biases

\begin{matrix} (22) & x_{k} = [R_{w b_{k}} (q_{w b_{k}}), p_{w b_{k}}, v_{k}^{w}, b_{g}^{b_{k}}, b_{a}^{b_{k}}] \end{matrix}

Let ${\hat{a}}^{b} (t)$ and ${\hat{ω}}^{b} (t)$ denote the measurements from the three-axis accelerometer and gyroscope, respectively. These are corrupted by noise and time-varying biases:

\begin{matrix} (27, 28) & \begin{aligned} {\hat{ω}}^{b} (t) & = ω^{b} (t) + b_{g}^{b} (t) + n_{g}^{b} (t) \\ {\hat{a}}^{b} (t) & = R_{b w} (t) [a^{w} (t) + g^{w}] + b_{a}^{b} (t) + n_{a}^{b} (t) \end{aligned} \end{matrix}

In this notation, the superscript $w$ refers to the world (inertial) frame, while $b$ denotes the body (sensor) frame. The subscripts $a$ and $g$ refer to the accelerometer and gyroscope, respectively.

The time dirivatives of $R, p, v$ are given as:

\begin{matrix} (29) & \begin{aligned} {\dot{p}}_{w b} (t) & = v^{w} (t) \\ {\dot{v}}^{w} (t) & = a^{w} (t) \\ {\dot{R}}_{w b} (t) & = R_{w b} (t) \cdot Exp [ω^{b} (t)] \\ {\dot{q}}_{w b} (t) & = q_{w b} (t) \otimes [\begin{array}{c} 0 \\ \frac{1}{2} ω^{b} (t) \end{array}] \end{aligned} \end{matrix}

Using these dynamics, we can express the system state at time $t + Δ t$ as follows:

\begin{aligned} p_{w b} (t + Δ t) & = p_{w b} (t) + v^{w} (t) \cdot Δ t + \iint_{t}^{t + Δ t} a^{w} (τ) d τ^{2} \\ = p_{w b} (t) + v^{w} (t) \cdot Δ t + \iint_{t}^{t + Δ t} [R_{w b} (τ) ({\hat{a}}^{b} (τ) - b_{a}^{b} (τ) - n_{a}^{b} (τ)) - g^{w}] d τ^{2} \\ = p_{w b} (t) + v^{w} (t) \cdot Δ t - \frac{1}{2} g^{w} \cdot (Δ t)^{2} + \iint_{t}^{t + Δ t} R_{w b} (τ) [{\hat{a}}^{b} (τ) - b_{a}^{b} (τ) - n_{a}^{b} (τ)] d τ^{2} \\ v^{w} (t + Δ t) & = v^{w} (t) + \int_{t}^{t + Δ t} a^{w} (τ) d τ \\ = v^{w} (t) + \int_{t}^{t + Δ t} [R_{w b} (τ) ({\hat{a}}^{b} (τ) - b_{a}^{b} (τ) - n_{a}^{b} (τ)) - g^{w}] d τ^{2} \\ = v^{w} (t) - g^{w} \cdot Δ t + \int_{t}^{t + Δ t} R_{w b} (τ) [{\hat{a}}^{b} (τ) - b_{a}^{b} (τ) - n_{a}^{b} (τ)] d τ^{2} \\ R_{w b} (t + Δ t) & = \int_{t}^{t + Δ t} R_{w b} (τ) \cdot Exp (ω^{b} (τ)) d τ \\ = \int_{t}^{t + Δ t} R_{w b} (τ) \cdot Exp ({\hat{ω}}^{b} (τ) - b_{g}^{b} (τ) - n_{g}^{b} (τ)) d τ \\ q_{w b} (t + Δ t) & = \int_{t}^{t + Δ t} q_{w b} (τ) \otimes [\begin{array}{c} 0 \\ \frac{1}{2} ω^{b} (t) \end{array}] d τ \\ = \int_{t}^{t + Δ t} q_{w b} (τ) \otimes [\begin{array}{c} 0 \\ \frac{1}{2} ({\hat{ω}}^{b} (τ) - b_{g}^{b} (τ) - n_{g}^{b} (τ)) \end{array}] d τ \end{aligned}

However, recomputing $R_{w b} (τ)$ at each step leads to repeated integration and unnecessary computational cost. To mitigate this, we use the identity:

R_{w b} (τ) = R_{w b_{t}} \cdot R_{b_{t} b} (τ)

This allows us to factor out $R_{w b_{i}}$ from the integrals over the interval $[t_{i}, t_{j}]$ , resulting in:

\begin{aligned} R_{w b_{j}} & = R_{w b_{i}} \cdot \int_{t_{i}}^{t_{j}} R_{b_{i} b} (τ) \cdot Exp ({\hat{ω}}^{b} (τ) - b_{g}^{b} (τ) - n_{g}^{b} (τ)) d τ \\ q_{w b_{j}} & = q_{w b_{i}} \otimes \int_{t_{i}}^{t_{j}} q_{b_{i} b} (τ) \otimes [\begin{array}{c} 0 \\ \frac{1}{2} ({\hat{ω}}^{b} (τ) - b_{g}^{b} (τ) - n_{g}^{b} (τ)) \end{array}] d τ \\ p_{w b_{j}} & = p_{w b_{i}} + v_{i}^{w} \cdot Δ t - \frac{1}{2} g^{w} \cdot (Δ t)^{2} + R_{w b_{i}} \iint_{t_{i}}^{t_{j}} R_{b_{i} b} (τ) [{\hat{a}}^{b} (τ) - b_{a}^{b} (τ) - n_{a}^{b} (τ)] d τ^{2} \\ v_{j}^{w} & = v_{i}^{w} - g^{w} \cdot Δ t + R_{w b_{i}} \int_{t_{i}}^{t_{j}} R_{b_{i} b} (τ) [{\hat{a}}^{b} (τ) - b_{a}^{b} (τ) - n_{a}^{b} (τ)] d τ \end{aligned}

We define the following preintegrated terms:

\begin{aligned} α_{b_{i} b_{j}} & = \iint_{t_{i}}^{t_{j}} R_{b_{i} b} (τ) [{\hat{a}}^{b} (τ) - b_{a}^{b} (τ) - n_{a}^{b} (τ)] d τ^{2} \\ β_{b_{i} b_{j}} & = \int_{t_{i}}^{t_{j}} R_{b_{i} b} (τ) [{\hat{a}}^{b} (τ) - b_{a}^{b} (τ) - n_{a}^{b} (τ)] d τ \\ γ_{b_{i} b_{j}} & = \int_{t_{i}}^{t_{j}} R_{b_{i} b} (τ) \cdot Exp ({\hat{ω}}^{b} (τ) - b_{g}^{b} (τ) - n_{g}^{b} (τ)) d τ \end{aligned}

Thus, the final update equations become:

\begin{aligned} p_{w b_{j}} & = p_{w b_{i}} + v_{i}^{w} Δ t - \frac{1}{2} g^{w} Δ t^{2} + R_{w b_{i}} α_{b_{i} b_{j}} \\ v_{j}^{w} & = v_{i}^{w} - g^{w} Δ t + R_{w b_{i}} β_{b_{i} b_{j}} \\ R_{w b_{j}} & = R_{w b_{i}} \cdot γ_{b_{i} b_{j}} \end{aligned}

Finally, since the gyroscope and accelerometer biases are modeled as Gaussian white noise processes with zero mean:

\dot{b} \sim N (0, Σ)

we assume that the bias remains approximately constant between two consecutive time steps:

b_{g}^{b_{i}} = b_{g}^{b_{j}}, b_{a}^{b_{i}} = b_{a}^{b_{j}}, \forall i, j

Detailed derivation of On-Manifold IMU Preintegration ​

Special Orthology Group SO(3) ​

Perturbation Models and Jacobians ​

Uncertainty Description in SO(3) ​