Control: PID, LQR & Geometric Control

Chapter 0: The Hovering Problem

Imagine your quadrotor is hovering 2 meters above the ground. A gust of wind pushes it 0.5 m to the left. The state estimator — doing its Kalman filter job — reports: "position error: −0.5 m in x." Now what? Someone has to translate that error number into four rotor speeds. Get it wrong and the drone either drifts away indefinitely, oscillates like a pendulum until it crashes, or snaps so aggressively it flips.

This is the control problem. It has nothing to do with estimation (knowing where you are) or planning (knowing where to go). Control is the bridge: given an error signal, compute actuator commands that drive the error to zero, quickly and smoothly.

The three questions every controller must answer. (1) How hard do I push back? (2) How do I handle errors that persist for seconds? (3) How do I avoid overshoot and oscillation? The three terms of a PID controller answer exactly these three questions — P, I, and D respectively.

Why naive solutions fail

The simplest idea: set rotor thrust proportional to position error. If you're 0.5 m too low, add thrust. This is the P (proportional) term — and it works, sort of. But a constant P gain causes the system to oscillate around the setpoint: thrust too much, overshoot, reverse, undershoot, repeat. You need damping.

Add damping via the velocity: reduce thrust as you approach the target. Now it settles, but there's a new problem: a small constant disturbance (like a steady breeze) creates a permanent position offset. The P term only pushes back proportional to error; a small steady error produces a small steady counterforce that exactly balances the disturbance — but never eliminates it. You need memory.

The integral term accumulates past errors. Over time, the integral grows until the integral action is strong enough to overcome the steady disturbance. Now you have P (stiffness) + I (memory) + D (damping) = a PID controller. Each term exists to solve a specific failure mode of the others.

Open-loop vs. closed-loop: watch the drift

A disturbance (wind gust) hits at t=2 s. Open-loop has no feedback — the drone drifts away. Closed-loop reads the error and corrects. Toggle between modes and vary the disturbance magnitude.

Disturbance 1.0

A quadrotor hovers with a constant proportional (P) controller. A steady 1 N wind force pushes it left. What happens in the long run?

The drone returns exactly to the setpoint because P is always pushing back The drone oscillates indefinitely because P creates overshoot The drone settles at a permanent offset where K_P·e_steady exactly balances the wind force The drone accelerates away because the proportional term adds to the wind

Chapter 1: State-Space & Stability

Before designing controllers, we need a precise language for describing how systems evolve. The language is state-space form. Rather than writing out Newton's equations from scratch each time, we pack everything into two compact equations:

ẋ = f(x, u) (dynamics)
y = g(x, u) (output)

Here x ∈ ℝⁿ is the state vector — everything you need to predict the future. u ∈ ℝ^m is the control input — the actuator commands. y is the output you observe. The dot notation ẋ means dx/dt — the time derivative of the state.

Quadrotor state vector

For a quadrotor, the state is: position p ∈ ℝ³, velocity v ∈ ℝ³, rotation R ∈ SO(3), angular velocity ω ∈ ℝ³. We can list this as x = [p, v, R, ω]. The full state has 12 real-valued degrees of freedom (3+3+3+3, since SO(3) is a 3-DOF manifold even though R is a 3×3 matrix). The control inputs are the four rotor speeds w = [w₁, w₂, w₃, w₄].

Linear state-space: ẋ = Ax + Bu

When f is linear in x and u, the dynamics simplify to:

ẋ = Ax + Bu

where A ∈ ℝ^n×n is the system matrix (captures how the state drives its own derivative) and B ∈ ℝ^n×m is the input matrix (captures how control inputs affect the derivative). This linear form is not always exact, but near an equilibrium point, any nonlinear system can be linearized to this form via a Taylor expansion.

Stability: eigenvalues decide everything

The long-term behavior of ẋ = Ax (with u=0) is entirely determined by the eigenvalues of A. Let λ be an eigenvalue. If Re(λ) < 0 for all eigenvalues, perturbations decay exponentially — the system is asymptotically stable. If any Re(λ) > 0, perturbations grow exponentially — the system is unstable.

Eigenvalue location	Re(λ) sign	Behavior	Example
Left half-plane	< 0	Decays to zero	Pendulum with friction
Imaginary axis	= 0	Oscillates forever	Undamped spring
Right half-plane	> 0	Grows without bound	Inverted pendulum
Complex pair left	< 0	Damped oscillation	Pendulum with light friction

A hovering quadrotor without control is unstable — its linearized A matrix has eigenvalues in the right half-plane. Every position perturbation grows. Control must place all closed-loop eigenvalues in the left half-plane.

Worked numbers: a simple 1D system

Consider a mass on a spring: ẍ = −kx/m − bẋ/m + u/m. In state-space form with x = [position, velocity]:

ẋ = [ẋ₁, ẋ₂]ᵀ = [[0, 1],[ −k/m, −b/m]] · [x₁, x₂]ᵀ + [[0],[1/m]] · u

With m=1 kg, k=4 N/m, b=0: A = [[0,1],[−4,0]]. Eigenvalues: λ = ±2j (purely imaginary — undamped oscillation). Add b=4: eigenvalues λ = −2±0j (both real negative — overdamped, stable, no oscillation). Drag b is the key to stability here.

Pole-plane stability visualizer — drag the poles

The complex plane shows two conjugate poles. Drag them left (stable) or right (unstable). The right panel shows the resulting time response. Poles in the left half = decay. Right half = explosion. On imaginary axis = oscillation.

A 2D system ẋ = Ax has A = [[−1, 2],[0, −3]]. The eigenvalues are −1 and −3 (both negative real). What is the system's long-term behavior from any initial condition?

The state oscillates because A has off-diagonal entries The state decays to zero because all eigenvalues have negative real parts The state grows because A has a positive entry (the +2 off-diagonal) Cannot determine — must simulate to find out

Chapter 2: Open vs Closed Loop

There are two fundamentally different ways to control a system. In open-loop control, you compute your commands from the desired trajectory alone, ignoring what the system actually does. In closed-loop (feedback) control, you continuously measure the actual state, compare it to the desired state, and compute commands from the error.

Reference r(t)

Desired setpoint or trajectory

↓

Controller C

Computes command u from error e = r − y

↓

Plant P

The physical system (quadrotor, motor, arm)

↻ measured output y fed back to compute e

Why open-loop fails in robotics

Open-loop control works beautifully if your model is perfect and there are no disturbances. In a real robot, neither is true. Motors have friction that varies with temperature. Wind applies unknown forces. Sensor calibration drifts. An open-loop controller that was tuned on a calm day will fail on a windy day — it never "knows" it's drifting.

Feedback is error-driven actuation. The key insight: a feedback controller doesn't need a perfect model. It just needs to know the error. Even with a rough model, feedback will push the system toward zero error. This robustness to model mismatch is why virtually all real-world controllers use feedback.

The feedback loop equation

For a linear system, the closed-loop dynamics look like this. Without control: ẋ = Ax. With a linear feedback law u = −Kx (we'll derive K later):

ẋ = Ax + B(−Kx) = (A − BK)x = A_clx

The closed-loop system matrix is A_cl = A − BK. The controller K reshapes the eigenvalues of A into whatever we want. An unstable A (right-half eigenvalues) can become a stable A_cl with the right K. This is the fundamental promise of feedback control: you can move the poles to stable locations by choosing K.

A concrete 1D example

Scalar system: ẋ = ax + bu. Without control (u=0), if a > 0, x(t) = x₀e^at → ∞. Now apply u = −kx. Then ẋ = ax − bkx = (a−bk)x. If we choose k > a/b, the exponent (a−bk) becomes negative and x decays to zero. Worked numbers: a=2, b=1. For stability we need k > 2. Take k=5: closed-loop pole = 2−5 = −3. Time constant τ = 1/3 ≈ 0.33 s. Starting at x₀=1, after 1 s: x = e⁻³ ≈ 0.05. The feedback gain k=5 stabilized an unstable system (a=+2) in under a second.

A linear system has A = 3 (scalar, unstable). You apply feedback u = −kx with b = 1. What is the minimum k needed to make the closed-loop system stable?

k > 0 (any positive gain works) k > 1 (the gain must exceed unity) k > 3 (the gain must exceed the unstable pole magnitude) k > 9 (k must exceed A²)

Chapter 3: PID: Derive Each Term

PID stands for Proportional-Integral-Derivative. Each term responds to a different aspect of the error signal e(t) = r(t) − y(t), where r is the reference (setpoint) and y is the measured output. The control law is:

u(t) = K_P·e(t) + K_I·∫e(τ)dτ + K_D·(de/dt)

Let's derive why each term is necessary, and what happens if you remove any one of them.

P term: stiffness (push proportional to displacement)

The proportional term u_P = K_P·e is the simplest feedback law. Think of it as a spring: the further you are from the setpoint, the harder you push back. It gives the system stiffness. Increasing K_P makes the response faster but also more prone to overshoot and oscillation. For the drone 0.5 m below setpoint, K_P=4 gives u_P = 4×0.5 = 2 N of upward thrust correction.

D term: damping (resist the velocity of change)

The derivative term u_D = K_D·ė = K_D·(de/dt) acts like a damper. It opposes the rate of change of the error. When the drone is falling toward the setpoint fast (error decreasing rapidly), ė < 0, so u_D < 0 — it brakes the approach. This prevents overshoot. Without D, a highly-tuned P gain causes oscillation; the D term is the "anti-oscillation" term. If the drone is 0.5 m below and approaching at 0.3 m/s: ė = −0.3 m/s (error decreasing). K_D=2 gives u_D = 2×(−0.3) = −0.6 N (braking).

I term: memory (eliminate steady-state error)

The integral term u_I = K_I·∫e(τ)dτ accumulates all past errors. If there is a constant disturbance (say, a steady 1 N wind), the P term alone settles at a steady-state error e_ss = disturbance/K_P. No matter how long you wait, this offset persists. The integral builds up over time until K_I·∫e·dt = 1 N, fully canceling the disturbance. The I term gives the controller infinite DC gain — it will keep pushing until the error is exactly zero.

Worked numbers: step response at t=0

Initial conditions: error e₀=1 m, ėe₀=0, ∫e₀=0. Gains: K_P=4, K_D=2, K_I=1.

u_P(0) = 4 × 1.0 = 4.0 N
u_D(0) = 2 × 0.0 = 0.0 N (ė=0 at start)
u_I(0) = 1 × 0.0 = 0.0 N (no history yet)
u(0) = 4.0 + 0.0 + 0.0 = 4.0 N

At t=0.5 s (suppose error has dropped to 0.4 m, ė = −1.2 m/s, ∫e=0.3 m·s):

u_P(0.5) = 4 × 0.4 = 1.6 N
u_D(0.5) = 2 × (−1.2) = −2.4 N (braking!)
u_I(0.5) = 1 × 0.3 = 0.3 N
u(0.5) = 1.6 − 2.4 + 0.3 = −0.5 N

The D term has gone negative (braking) because the system is approaching the setpoint fast. Without D, u(0.5) = 1.9 N, still pushing forward — causing overshoot.

A PID controller has K_P=5, K_I=0, K_D=0 (P-only). The system reaches a steady state with error e_ss=0.2 m against a constant wind disturbance. What happens if you add K_I=1?

Nothing — the steady-state error is already as small as the gains allow The error doubles because integral amplifies the existing error The integral accumulates e_ss=0.2 over time, building output until the wind is fully canceled, driving steady-state error to zero The P term is cancelled because P and I fight each other

Chapter 4: PID Tuning & Anti-Windup

Having a PID controller and having a working PID controller are two different things. The gains K_P, K_I, K_D must be tuned. Too low: sluggish. Too high: oscillation or instability. The interaction between the three terms makes tuning non-trivial.

Tuning intuition: the Ziegler-Nichols mental model

Start with K_I=0, K_D=0. Increase K_P until the system oscillates continuously — this is the ultimate gain K_u at the ultimate period T_u. Ziegler-Nichols suggests K_P=0.6K_u, K_I=1.2K_u/T_u, K_D=0.075K_uT_u. This is a starting point, not a final answer — always retune on the real hardware.

More P gain is not always better. Increasing K_P makes the system faster but also more oscillatory. The D term damps oscillation, but too much D amplifies sensor noise. There is always a tradeoff: high bandwidth (fast response) vs. noise sensitivity. Real tuning is balancing these tradeoffs for the specific task.

Integrator windup

The I term has a dangerous failure mode called integrator windup. Suppose the drone is trying to lift off from the ground but is held down (saturated actuator). The error is large and constant: e = +2 m. The integral keeps accumulating: ∫e grows to 10, 100, 1000 m·s. When the constraint is released, the massive integral term causes the drone to shoot up far beyond the setpoint — severe overshoot or even a crash.

The fix is anti-windup: when the actuator saturates (command clips to max or min), stop integrating. Concretely: don't accumulate the integral when u is at its limit.

python
import numpy as np

class PIDController:
    def __init__(self, Kp, Ki, Kd, u_min, u_max, dt):
        self.Kp, self.Ki, self.Kd = Kp, Ki, Kd
        self.u_min, self.u_max = u_min, u_max
        self.dt = dt
        self.integral = 0.0
        self.prev_e   = 0.0

    def step(self, e):
        # Proportional
        u_p = self.Kp * e
        # Derivative (backward difference)
        u_d = self.Kd * (e - self.prev_e) / self.dt
        # Unclamped output (without I)
        u_raw = u_p + self.Kd * (e - self.prev_e) / self.dt

        # Anti-windup: only integrate when NOT saturated
        u_test = u_p + self.Ki * self.integral * self.dt + u_d
        if self.u_min < u_test < self.u_max:
            self.integral += e * self.dt   # accumulate

        u_i = self.Ki * self.integral
        u = np.clip(u_p + u_i + u_d, self.u_min, self.u_max)
        self.prev_e = e
        return u

PID step-response tuning playground

Unit step: reference jumps from 0 to 1 at t=0. Tune K_P, K_I, K_D and watch the step response. Observe overshoot, settling time, and steady-state error in real time. The system is a double integrator (mass with friction — like a quadrotor altitude axis).

K_P 6.0

K_I 1.0

K_D 2.0

Integrator windup is most dangerous when:

K_I is very small, because a small integral builds up slowly The actuator is saturated for a long time while the error remains large K_D is large, because the derivative amplifies the integral The setpoint changes rapidly, confusing the integral calculation

Chapter 5: LQR: Optimal Control

PID works well when you can tune three numbers by hand. But a quadrotor has 12 state dimensions and 4 inputs. Tuning 12-dimensional feedback gains manually is impractical. Linear Quadratic Regulator (LQR) automates the gain design by solving an optimization problem: find the feedback law u = −Kx that minimizes a weighted cost over all future time.

The cost function

LQR minimizes the infinite-horizon quadratic cost:

J = ∫₀^∞ (x^TQx + u^TRu) dt

where Q ∈ ℝ^n×n (positive semidefinite) penalizes state error and R ∈ ℝ^m×m (positive definite) penalizes control effort. The term x^TQx says "I care about states x_i being near zero, with weight Q_ii." The term u^TRu says "I care about keeping inputs small — don't waste actuator authority."

Q/R tradeoff intuition

The ratio Q/R is the fundamental design knob. Large Q/R: the optimizer is told "state errors are very expensive, control effort is cheap" — it pushes the state hard toward zero, using lots of control. Small Q/R: "control effort is expensive, state error is acceptable" — gentle corrections, slower convergence, smaller actuator commands. This is directly analogous to the PID gain: high K_P ↔ large Q, low K_D ↔ small R.

Q penalizes where you are; R penalizes how hard you push. Diagonal Q means each state is penalized independently. Q₁₁=100 means "keep x₁ near zero very tightly." R=0.01 means "control effort is cheap — use the actuators freely." The solution automatically balances these competing objectives.

The optimal feedback law

The remarkable result of LQR: the optimal control is always a linear state feedback law:

u^* = −Kx where K = R⁻¹B^TP

Here P ∈ ℝ^n×n is the unique positive definite solution to the Algebraic Riccati Equation (ARE):

A^TP + PA − PBR⁻¹B^TP + Q = 0

You do not need to solve the ARE by hand. In practice, scipy.linalg.solve_continuous_are(A, B, Q, R) computes P in milliseconds. The resulting K is the globally optimal linear feedback gain for the given Q and R.

Worked numbers: LQR cost evaluation for two K choices

Scalar system: A=0, B=1, Q=1, R=1. The ARE simplifies to: −P² + 1 = 0, so P=1, K=P=1. Optimal gain K=1. Compare two controllers from x₀=1:

Controller	K	Closed-loop pole	x(1 s)	Integral cost J (1 s)
Under-gain	0.5	−0.5	e^−0.5=0.61	∫(x²+0.25x²)dt=1.25∫e^−tdt≈1.25
Optimal LQR	1.0	−1.0	e⁻¹=0.37	∫(e^−2t+e^−2t)dt=1.0 ✓
Over-gain	3.0	−3.0	e⁻³=0.05	∫(e^−6t+9e^−6t)dt≈1.67

The optimal K=1 achieves the minimum cost J=1.0. The over-gain (K=3) drives the state to zero faster but pays a high control cost (9u²), giving a higher total J. The under-gain settles slowly, paying a high state cost. LQR finds the sweet spot automatically.

python
import numpy as np
from scipy.linalg import solve_continuous_are

# System: quadrotor altitude axis (double integrator + drag)
# State x = [z, z_dot], input u = thrust deviation from hover
m = 1.0   # kg
b = 0.1   # drag coefficient
A = np.array([[0, 1],
              [0, -b/m]])
B = np.array([[0],
              [1/m]])

# LQR weights: penalize altitude error 10x more than thrust effort
Q = np.diag([10.0, 1.0])   # [z, z_dot] weights
R = np.array([[1.0]])         # thrust cost

# Solve Riccati equation for P
P = solve_continuous_are(A, B, Q, R)
# Optimal gain: K = R^{-1} B^T P
K = np.linalg.inv(R) @ B.T @ P

# K = [Kz, Kzdot] — feedback on altitude error and velocity
# Control law: u = -K @ (x - x_desired)
print(f"K = {K.flatten()}")  # e.g., K = [3.16, 4.63]

# LQR cost evaluator: simulate and compute J
def lqr_cost(K, A, B, Q, R, x0, T=5.0, dt=0.01):
    x = x0.copy(); J = 0.0
    for _ in np.arange(0, T, dt):
        u = -K @ x
        J += (x @ Q @ x + u @ R @ u) * dt
        x = x + (A @ x + B.flatten() * u[0]) * dt
    return J

LQR Q/R tradeoff — trajectory and cost live

Slider sets log(Q/R). Left = gentle control (low Q/R). Right = aggressive control (high Q/R). Watch the trajectory change and the cost breakdown: state cost vs control cost.

log(Q/R) 0.5

In LQR, if you double Q (the state-error penalty) while keeping R fixed, what happens to the optimal gain K and the resulting closed-loop behavior?

K decreases — the controller is less aggressive to preserve the newly expensive control effort K stays the same — Q only affects the cost, not the gain K increases — the controller pushes harder to reduce state errors that are now penalized more The system becomes unstable — large Q creates oscillation

Chapter 6: Quadrotor Dynamics

Everything so far has been generic control theory. Now we apply it to the quadrotor — the canonical VNAV platform. The quadrotor's dynamics are richer and stranger than a simple mass-spring because it is underactuated: four scalar inputs (rotor speeds) must control six degrees of freedom.

The Newton-Euler equations

The quadrotor obeys Newton's second law in both translation and rotation. In compact form:

mä^w = −mge₃ + R^w_B f^B
Jω̇^B = −ω^B × Jω^B + τ^B

The translational equation says: mass × acceleration = gravity (downward) + thrust force rotated to world frame. The rotational equation says: inertia × angular acceleration = Euler term (gyroscopic coupling) + applied torques. Here R^w_B is the rotation from body to world frame, f^B is force in body frame (only the z component is nonzero for a standard quadrotor), τ^B is torque in body frame, and J is the 3×3 inertia matrix.

Rotor physics: thrust and torque from spin speed

Each rotor i spinning at angular velocity w_i produces:

Thrust: T_i = c_f w_i|w_i| (thrust coefficient c_f)
Drag torque: τ_drag,i = (−1)ⁱ⁺¹ c_d w_i|w_i| (alternating sign: CW/CCW rotors)

The total thrust is the sum of all four: f^B_z = c_f(w₁|w₁| + w₂|w₂| + w₃|w₃| + w₄|w₄|). The torques come from both the drag and from the geometry: off-center thrust at arm position ρ^B_i creates a moment ρ^B_i × T_ie₃.

The mixing matrix: rotor speeds to thrust/torque

We can pack the input mapping into a 4×4 matrix F̄ (the "mixing matrix") that maps the signed-square rotor speeds w = [w₁|w₁|, w₂|w₂|, w₃|w₃|, w₄|w₄|]^T to [f^B_z, τ^B_x, τ^B_y, τ^B_z]:

[f^B_z, τ^B_x, τ^B_y, τ^B_z]^T = F̄ · w

For a + configuration quad with arm length L and rotor positions at [±L,0] and [0,±L]:

F̄ = [[c_f, c_f, c_f, c_f],
     [0, −c_fL, 0, c_fL],
     [c_fL, 0, −c_fL, 0],
     [−c_d, c_d, −c_d, c_d]]

Because F̄ is invertible, you can always go from desired [f^B_z, τ^B] to required rotor speeds via w = F̄⁻¹[f^B_z, τ^B]. This inverse mapping is called the thrust-torque mixer or control allocation.

Worked thrust-torque mixing example

Parameters: m=1 kg, g=9.81 m/s², c_f=1e−5 N/(rad/s)², c_d=1e−6 Nm/(rad/s)², L=0.25 m. At hover: total thrust = mg = 9.81 N, all torques = 0. Each rotor: T_i = 9.81/4 = 2.4525 N. Rotor speed: w_i = √(T_i/c_f) = √(2.4525/1e−5) ≈ 495 rad/s ≈ 4730 rpm. Now apply a roll torque τ_x=0.5 Nm. From F̄ row 2: −c_fL(w₂−w₄) = 0.5 → Δw²=200 → w₂ decreases, w₄ increases by √200 ≈ 14 rad/s each.

python
import numpy as np

# Quadrotor thrust-torque mixer
def mixer(cf=1e-5, cd=1e-6, L=0.25):
    """Returns 4x4 mixing matrix F_bar and its inverse."""
    F = np.array([
        [cf,      cf,      cf,      cf     ],
        [0,      -cf*L,   0,       cf*L  ],
        [cf*L,    0,      -cf*L,   0     ],
        [-cd,     cd,     -cd,      cd     ]
    ])
    return F, np.linalg.inv(F)

F, Finv = mixer()

# Hover: total thrust = mg = 9.81 N, zero torques
m, g = 1.0, 9.81
desired = np.array([m*g, 0.0, 0.0, 0.0])  # [fz, tx, ty, tz]
w_sq = Finv @ desired   # signed-square rotor speeds
w_rpm = np.sqrt(np.abs(w_sq)) / (2*np.pi/60)
print(f"Hover RPM: {w_rpm}")  # ~4730 rpm each

# Roll maneuver: add 0.5 Nm roll torque
desired_roll = np.array([m*g, 0.5, 0.0, 0.0])
w_sq_roll = Finv @ desired_roll
print(f"Roll w_sq: {w_sq_roll}")   # rotor 2 decreases, rotor 4 increases

The quadrotor is underactuated. It has 6 DOF (3 position + 3 orientation) but only 4 independent inputs. This means it CANNOT independently control all 6 DOF simultaneously. Specifically: it cannot move sideways without tilting. To go left, it must roll left (tilt), redirecting the total thrust vector. Position and attitude are coupled by physics. This is why position control must compute a desired attitude and then attitude control tracks that attitude — the two cannot be decoupled.

A quadrotor wants to accelerate to the left (+y direction in the world frame). Assuming it can only thrust along its body z-axis, which physical maneuver is required?

Increase total thrust while keeping level — the extra thrust has a sideways component Apply a yaw torque to spin the drone left Decrease thrust on the right rotors to create a roll moment rightward Roll the drone to the left so the thrust vector has a leftward component in the world frame

Chapter 7: Cascaded Control Loops & Geometric Control

A quadrotor's full controller isn't one monolithic block — it's a hierarchy of nested loops. The outer loop handles position; the inner loop handles attitude. This cascaded control structure exploits the timescale separation: attitude dynamics (rotating the drone) are much faster than position dynamics (moving the drone through space).

Position Reference p_d, ψ_d

Desired 3D position + yaw angle

↓

Position Controller

Computes desired thrust magnitude f_z and desired body z-axis z_d^w

↓

Attitude Reference R_d ∈ SO(3)

Desired rotation matrix from position loop output

↓

Attitude Controller

Computes torques τ to track R_d, operating on SO(3)

↓

Mixer F̄⁻¹

Converts [f_z, τ] to individual rotor speeds

The position loop: from error to desired attitude

The position controller receives the position error e_p = p^w − p^w_d and velocity error e_v = v^w − v^w_d. It computes an "ideal force" (what force, in any direction, would PD-correct the position):

f_ideal = −k_pe_p − k_ve_v + mge₃ + mp̈^w_d

The gravity term mge₃ and feedforward term mp̈_d pre-cancel gravity and track the desired acceleration. But the quadrotor can only thrust along its own body z-axis. So the controller: (1) sets the desired body z direction z_d^w = f_ideal/‖f_ideal‖, pointing the drone to produce the ideal force; (2) sets the thrust magnitude f_z = f_ideal · R^w_Be₃ (projected onto actual body z).

Geometric control: why SO(3) matters for attitude

The attitude controller must track the desired rotation R_d ∈ SO(3). A naive approach would represent attitude as Euler angles and apply PID to each angle. This fails at singularities (gimbal lock at pitch=±90°) and doesn't respect the geometry of SO(3).

Geometric control (Lee, Leok, McClamroch 2010) works directly on SO(3) by defining the rotation error using the Lie group structure (exactly as in L2). The rotation error vector is:

e_R = ½(R_d^TR_B − R_B^TR_d)^∨

This is the vee (∨) of the skew-symmetric part of R_d^TR_B — essentially the axis-angle error between current and desired rotation, expressed in the body frame. The angular velocity error is e_ω = ω^B − R_B^TR_dω_d.

The geometric control law

The full geometric controller torque:

τ^B = −k_Re_R − k_ωe_ω + ω^B × Jω^B − J([ω^B]×R_B^TR_dω_d − R_B^TR_dω̇_d)

The first two terms are PD on the rotation error (in SO(3)). The remaining terms cancel the gyroscopic coupling (ω×Jω) and feedforward the desired angular acceleration. This controller is proven to be almost globally stable on SO(3) — it converges from almost any initial orientation, with only the "upside-down" configuration excluded (a measure-zero set).

Geometric control is PD control on the rotation manifold. The k_Re_R term is the proportional part: proportional to how far the current rotation is from desired. The k_ωe_ω term is the derivative part: proportional to relative angular velocity error. The Lie group structure ensures the error metric respects the non-Euclidean geometry of SO(3) — the same insight from L2.

Why does the geometric controller define the rotation error as e_R = ½(R_d^TR_B − R_B^TR_d)^∨ instead of simply subtracting Euler angles φ_current − φ_desired?

Because Euler angles require more computation than matrix operations Because Euler angles produce larger numbers that cause numerical overflow Because Euler angles have gimbal-lock singularities at pitch=±90°, and angle subtraction doesn't respect the non-commutative geometry of SO(3); the matrix error works globally Because the desired Euler angles φ_desired are not available from the position controller

Chapter 8: Showcase: Quadrotor Hover Simulator

This chapter is the payoff. A 2D quadrotor (two rotors, planar motion) uses a PID position controller feeding into a PD attitude controller. Click anywhere to set a new target. Use sliders to tune gains and add wind disturbance. Watch how the drone tilts to move, corrects attitude, and hovers at the target.

This is the cascaded control stack in action. Position error → desired tilt angle → attitude error → differential rotor thrust → motion. The tilt is not commanded directly — it emerges from the position controller demanding a sideways acceleration.

2D quadrotor hover — click to set target

Click on the canvas to set the target position (orange star). The quadrotor uses cascaded PID+PD control. Increase wind to stress the I term. Reduce K_P to see sluggishness; increase to see overshoot. The tilt angle is produced by the position controller, not set manually.

K_{P pos} 3.0

K_{D pos} 2.0

Wind (N) 0.0

Chapter 9: Connections & Cheat Sheet

You have now built the complete control stack: from state-space fundamentals through PID and LQR to the geometric controller running on SO(3). Here's how everything connects.

Control cheat sheet

Concept	Key formula	Key intuition
State-space	ẋ = Ax + Bu	Eigenvalues of A determine stability
P term	u_P = K_P·e	Stiffness: push back proportionally
I term	u_I = K_I·∫e dt	Memory: eliminates steady-state error
D term	u_D = K_D·ė	Damping: prevents overshoot
LQR cost	J = ∫(x^TQx + u^TRu)dt	Q penalizes error, R penalizes effort
LQR gain	K = R⁻¹B^TP	P from Riccati equation (automated)
Quad thrust	[f_z,τ] = F̄·w	Invertible mixer: desired torques → rotor speeds
Cascade	pos loop → attitude ref → att loop → mixer	Outer loop (slow) → inner loop (fast)
Geometric eR	e_R = ½(R_d^TR_B−R_B^TR_d)^∨	Axis-angle error on SO(3) manifold

What comes next: Trajectory Optimization (L4)

The control stack assumes a desired position p_d(t) and velocity v_d(t) are given at each instant. Where do those come from? That's the job of the trajectory optimizer (Lectures 9–11), which computes smooth, dynamically feasible paths from start to goal. The interface is clean: trajectory optimizer outputs [p_d(t), v_d(t), p̈_d(t), ψ_d(t)] → geometric controller turns it into rotor commands → drone flies the path.

Connection to Lie groups (L2)

The geometric controller is the application of L2 ideas. The rotation error e_R uses the vee map (∨) to extract the axis-angle vector from the relative rotation matrix R_d^TR_B — this is exactly the log map from L2 (at small angles). The proof of almost-global stability relies on the Lyapunov function constructed on the SO(3) manifold, not on a linearized Euler-angle representation.

Related Gleams

VNAV L1: 3D Geometry & Transforms — rotation matrices, SE(3), prerequisite
VNAV L2: Lie Groups & exp Map — SO(3) Lie algebra, retraction, log map
Kalman Filter — the estimator that feeds pose error into this control stack
MDP & Optimal Control — how LQR relates to the broader optimal control / RL framework
Actor-Critic RL — learning-based control that replaces hand-tuned gains

"What I cannot create, I do not understand." You can now: write state-space equations for a quadrotor; implement a PID controller with anti-windup; set up an LQR problem and interpret Q and R; understand why a quadrotor must tilt to translate; derive the rotation error in geometric control; and trace an error signal from sensor reading to rotor command through the full cascaded loop.

In the cascaded control architecture, why does the position controller output a desired rotation matrix R_d rather than directly outputting torque commands?

Because the position controller doesn't have access to the inertia matrix J Because torques would violate the underactuated constraint Because the position controller computes the direction the drone must face to produce the desired force, and a separate faster attitude controller then tracks that orientation using the full rotational dynamics Because the position controller operates in the world frame and torques are only defined in the body frame