Visual Navigation for Autonomous Vehicles · MIT 16.485 · Lectures 8–10

Trajectory Optimization: Minimum-Snap & Differential Flatness

A quadrotor racing through gates must hit each waypoint — but also fly smoothly enough that its motors can actually track the path. A straight-line point-to-point plan requires infinite acceleration at every corner: physically impossible. This lesson shows why smooth flight means minimizing high-order derivatives, how polynomial trajectories encode smoothness as a Quadratic Program, and how differential flatness lets you plan in flat output space while guaranteeing the full dynamics are satisfiable. Hand-derived numbers, five interactive canvases, worked QP cost matrices. MIT 16.485 by Luca Carlone, Lectures 8–10.

Prerequisites: VNAV L1 (3D geometry). VNAV L3 (quadrotor dynamics & differential flatness). Basic calculus (derivatives, integrals). No QP background needed.

Chapters

Live Canvases

Derived

From First Principles

Chapter 0: The Racing Gate Problem

Picture a drone racing course: five orange gates suspended in mid-air. Your path planner (an RRT* algorithm) hands you five waypoints — the center of each gate. Now you need to generate a trajectory: position as a continuous function of time, from the start to gate 1, gate 2, and so on to the finish.

The naive solution? Fly a straight line to each waypoint in turn. Simple. But watch what happens at each corner: to travel in a straight line, stop, and turn to the next straight line, the drone must decelerate to zero velocity at each waypoint and then accelerate again. That costs time and wastes energy. Worse, what if you want to fly through each gate at full speed? You need the velocity to be continuous — no abrupt stops.

The corner catastrophe. A piecewise-linear path (straight line segments) has zero radius of curvature at each waypoint. To follow a path with zero turning radius, the drone needs instantaneously infinite centripetal acceleration. No motor can deliver that. A straight-line plan through waypoints is dynamically infeasible.

What we actually need

We need a trajectory — not just a path (a sequence of positions) but a function that tells us position at every moment in time, including the velocities, accelerations, and higher derivatives that the drone's actuators must produce. The trajectory must:

Pass through every waypoint at the assigned time ("waypoint constraints")
Have continuous position, velocity, acceleration, and jerk across segment boundaries ("continuity constraints")
Minimize the motor effort needed to follow it ("smoothness objective")

The canvas below shows a concrete failure case: the straight-line trajectory and its acceleration profile. See the spikes? Those are the infinite-acceleration corners.

Straight line vs smooth trajectory — the infeasibility of corners

The top view shows a 3-waypoint path. The bottom shows the x-acceleration over time. Toggle between piecewise-linear (sharp corners) and the smooth polynomial version. Watch the accel spikes disappear.

A path planner outputs 4 waypoints. You naively connect them with straight-line segments. Why is this trajectory infeasible for a real quadrotor?

The total distance is too long for the battery The straight-line path doesn't pass through the waypoints precisely enough The sharp direction changes require infinite acceleration at each waypoint — no motor can produce that Straight lines don't work in 3D, only in 2D

Chapter 1: Why Smoothness = Minimizing Derivatives

Before we can build a smooth trajectory, we need to make "smooth" precise. Intuitively, a smooth trajectory changes direction gently. Mathematically, smooth means the derivatives of position are small.

The derivative hierarchy

Position x(t) is the trajectory itself. Its derivatives have names:

Derivative	Name	Physical meaning
dx/dt	Velocity	How fast you move
d²x/dt²	Acceleration	Force required (F=ma)
d³x/dt³	Jerk	Rate of force change
d⁴x/dt⁴	Snap (crackle)	Rate of jerk change

A large acceleration means large forces. A large jerk means forces change rapidly, which stresses motors and passengers. A large snap means the higher-order dynamics are excited. We want to fly in a regime where all these derivatives are small.

The smoothness functional

To get a single number measuring smoothness, we integrate the squared r-th derivative over the flight time T:

J_r = ∫₀^T (d^rx/dt^r)² dt

Why squared? Because (1) it's always non-negative, (2) squaring penalizes large values more than small ones (a single large spike is penalized more than many small variations adding to the same sum), and (3) it makes the optimization problem quadratic — much easier to solve than absolute value or fourth-power penalties.

Minimizing J_r is minimizing energy consumption. For a quadrotor, the motor current is approximately proportional to thrust, which relates to acceleration. Jerky flight saturates motors and wastes power. Smooth flight stays within the linear actuator regime where efficiency is highest.

Which derivative do we minimize?

Depending on r, we get different optimal trajectories:

r = 1 (min velocity): shortest-distance path — straight lines
r = 2 (min acceleration): smoother turns, uses minimum force
r = 3 (min jerk): common in animation and robot arm control
r = 4 (min snap): the right choice for quadrotors (we'll see why in Ch. 3)

Derivative order comparison — smoother as r increases

Same 3 waypoints, same time budget. Change the derivative order r to minimize. Higher r produces smoother transitions at the cost of larger velocity peaks.

r (order) 1

Why do we minimize the integral of the squared r-th derivative, rather than the integral of the r-th derivative itself?

Because the r-th derivative is always positive so the integral is always positive Squaring makes it non-negative, penalizes large values disproportionately, and makes the cost function quadratic (easier to optimize) Squaring reduces the dimension of the optimization problem by half The integral of the derivative itself is always zero (fundamental theorem of calculus)

Chapter 2: Polynomial Trajectories

We need a parametric family of functions that can (a) represent any smooth path, (b) be easily differentiated to compute the cost, and (c) have finitely many free parameters to optimize. The answer is polynomials.

The polynomial ansatz

Represent the x-coordinate of the trajectory as a degree-N polynomial in time:

x(t) = p_Nt^N + p_N−1t^N−1 + … + p₁t + p₀

The free parameters are the N+1 coefficients p = [p₀, p₁, …, p_N]^T. We do the same for y(t), z(t), and ψ(t) (yaw) — independently, since the cost function decouples over these flat outputs (more on that in Chapter 6).

What order N do we need?

The Euler-Lagrange equation tells us: minimizing ∫(P^(r))²dt without any endpoint constraints requires a polynomial of degree N = 2r−1. For minimum-snap (r=4), this gives N = 7. The general rule: N = 2r−1 for the single-segment, unconstrained case.

In practice we often use higher-degree polynomials (e.g., N=9 is common) to get more degrees of freedom for satisfying continuity constraints at many waypoints.

Derivatives of a polynomial

Differentiating r times:

P^(r)(t) = ∑_i=r^N [∏_m=0^r−1(i−m)] · p_i · t^i−r

The product ∏(i−m) for m=0..r−1 is just the falling factorial: i(i−1)(i−2)…(i−r+1). For example, the 4th derivative of p₇t⁷ is 7×6×5×4 · p₇ · t³ = 840 p₇ t³.

Misconception: polynomials can represent any smooth function. Not quite — Runge's phenomenon shows that high-degree polynomial interpolation through many points can oscillate wildly between them. This is why we use piecewise polynomials (one polynomial per segment between waypoints) rather than one giant polynomial for the whole path.

Multi-segment trajectories

For M waypoints (producing M−1 segments), we define one polynomial P_i(t) per segment. The total trajectory is piecewise:

x(t) = P_i(t) for t_i ≤ t ≤ t_i+1

The cost decomposes over segments: J = J₁ + J₂ + … + J_M−1. Each J_i is an integral of the squared r-th derivative of P_i. This sum structure is key to building the block-diagonal QP cost matrix (Chapter 4).

You want to minimize snap (r=4) for a single-segment trajectory using Euler-Lagrange. What is the minimum polynomial degree N needed?

N = 4 N = 5 N = 7 (= 2r − 1 = 2×4 − 1) N = 9

Chapter 3: Minimum-Snap: Why r = 4?

We've established that higher r gives smoother trajectories. So why not minimize the 6th or 8th derivative? The answer is specific to quadrotor dynamics, and it comes from differential flatness.

How rotor commands relate to position derivatives

From L3 (Quadrotor Dynamics), the total thrust f and the moment vector τ are the actual control inputs. These are produced by combinations of rotor spin speeds. When you work through the Newton-Euler equations for a quadrotor, the total thrust is approximately:

f ≈ m · |ẍ| (thrust ≅ mass × acceleration magnitude)

But the attitude (roll and pitch angles) needed to produce a desired translational acceleration involves more derivatives. When you compute the desired body z-axis z_B,d from the desired acceleration vector, and then differentiate it to get the desired angular velocity, you find that:

angular velocity ω_d ∝ d³x/dt³ (jerk drives angular rate)

angular acceleration α_d ∝ d⁴x/dt⁴ (snap drives angular acceleration)

The motor torques needed to achieve a desired angular acceleration are τ = Iα. So snap is the proximal cause of motor torque demand. Minimizing snap minimizes peak motor torque, which keeps the trajectory within the actuator limits of the quadrotor.

Why not minimize jerk? Jerk (r=3) drives angular rate, not angular acceleration. A trajectory with bounded jerk but unbounded snap still requires instantaneous torque spikes — which saturate motors. Snap is the right lever for feasibility.

The minimum-snap cost functional

For position components x, y, z (each optimized separately), the cost is:

J = ∫₀^T (x⁽⁴⁾)² dt + ∫₀^T (y⁽⁴⁾)² dt + ∫₀^T (z⁽⁴⁾)² dt + ∫₀^T (ψ⁽²⁾)² dt

Note: yaw ψ only requires minimizing the 2nd derivative (angular acceleration) because yaw torque maps directly to the 2nd derivative of yaw, not the 4th. This is why the yaw cost uses r=2.

Snap profile — why it maps to motor commands

A segment flies from x=0 to x=3 in T seconds. Sliders control the approach/departure speeds. The canvas shows position, jerk, and snap profiles. High snap = motor torque spike.

Entry velocity 1.0

Exit velocity 2.0

For a quadrotor trajectory, why is it more important to minimize snap (4th derivative) than jerk (3rd derivative)?

Because snap is always larger than jerk numerically, so it dominates the cost Because snap is proportional to angular acceleration, which directly drives motor torques — the true actuator bottleneck Because minimizing snap automatically minimizes jerk as well Because the 4th derivative is easier to compute than the 3rd

Chapter 4: The QP Formulation

Now for the key insight that makes trajectory optimization tractable: the minimum-snap problem is a Quadratic Program (QP). This means the cost is quadratic in the unknown parameters (polynomial coefficients) and the constraints are linear. QPs have globally optimal solutions and can be solved in milliseconds.

Deriving the cost matrix Q

Let p = [p₀, p₁, …, p_N]^T be the coefficient vector. The r-th derivative of P(t) is a linear function of p (see the falling factorial formula from Ch. 2). So P^(r)(t)² is a quadratic function of p. Integrating from 0 to T:

J = ∫₀^T (P^(r)(t))² dt = p^T Q p

where Q is an (N+1)×(N+1) positive semi-definite matrix. The elements of Q are computed from:

Q_lk = [∏_m=0^r−1(l−m)(k−m)] · T^l+k−2r+1 / (l+k−2r+1) if l ≥ r and k ≥ r

Q_lk = 0 if l < r or k < r

The first r rows and columns are zero because differentiating r times kills all terms with degree < r. The remaining block (indices r through N) forms a dense symmetric positive semi-definite submatrix.

Worked example: N=3, r=2 (minimum acceleration)

Polynomial: x(t) = p₃t³ + p₂t² + p₁t + p₀. Second derivative: x''(t) = 6p₃t + 2p₂.

∫₀^T (x'')² dt = ∫₀^T (6p₃t + 2p₂)² dt = [36p₃²T³/3 + 2×6p₃×2p₂T²/2 + 4p₂²T]

= 12p₃²T³ + 12p₃p₂T² + 4p₂²T

Packing into p^TQp with p = [p₀, p₁, p₂, p₃]^T:

Q = [[0,0,0,0],[0,0,0,0],[0,0,4T,6T²],[0,0,6T²,12T³]]

The first 2×2 block is zero (as expected, since r=2 kills p₀ and p₁). With T=1: Q₂₂=4, Q₂₃=Q₃₂=6, Q₃₃=12.

The complete QP

Subject to constraints Ap = b (waypoint + continuity, derived in Ch. 5):

min_p p^TQp subject to Ap = b

For multiple segments, Q is block-diagonal (each segment contributes one block) and p stacks all segment coefficient vectors. This QP can be solved via the KKT conditions or any standard QP solver.

QP cost matrix visualizer

Color-mapped Q matrix for polynomial degree N and derivative order r. Hover a cell to see Q_lk. The first r rows/cols are zero. The active block grows with N.

N (degree) 5

r (order) 2

For minimum-snap (r=4) with polynomial degree N=7, how many rows and columns of the Q matrix are identically zero?

0 (none — all entries are nonzero) 2 (only the diagonal is zero) 4 (the first r=4 rows and columns are zero) 7 (all rows up to N−1)

Chapter 5: Continuity Constraints

The QP cost matrix captures the smoothness objective. Now we need the constraint matrix A and vector b. These encode two things: (1) the trajectory must pass through each waypoint, and (2) it must do so smoothly (matching derivatives across segment boundaries).

Evaluating a polynomial and its derivatives

For a polynomial P(t) of degree N with coefficient vector p, evaluating at time t₀ is linear in p:

P(t₀) = [1, t₀, t₀², …, t₀^N] · p (= basis vector · p)

The q-th derivative at t₀ is similarly linear. Define the row vector:

e_q(t₀) = [0, …, 0, q!, (q+1)! t₀, …, N(N−1)…(N−q+1)t₀^N−q]

Then P^(q)(t₀) = e_q(t₀) · p. This lets us stack all constraints as a linear system Ap = b.

Waypoint constraints

For waypoint i at time t_i, each flat output must equal the waypoint value. For the start of segment i+1 and end of segment i:

P_i(t_i) = x̅_i (end of segment i)

P_i+1(t_i) = x̅_i (start of segment i+1)

These are 2 linear equations per waypoint per flat output. Adding them to A,b is straightforward.

Continuity constraints

At each internal waypoint, the trajectory and its first r−1 derivatives must match between adjacent segments:

P_i^(q)(t_i) = P_i+1^(q)(t_i) for q = 0, 1, …, r−1

Each of these is also linear in the concatenated coefficient vector p = [p₁; p₂; …; p_M−1]. For r=4, we enforce continuity of position, velocity, acceleration, and jerk at each waypoint — 4 constraints per internal waypoint per flat output.

Counting constraints vs. degrees of freedom. One segment: degree N means N+1 coefficients. Two endpoints, each with q+1 derivative constraints: 2(q+1) constraints total. With N = 2r−1, we have N+1 = 2r coefficients and exactly 2r constraints (r at start, r at end) — a perfectly determined system. Every extra waypoint adds N+1 new coefficients and r+1 new constraints (position + continuity), maintaining well-posedness.

Worked numbers: single segment, min-jerk (r=3)

N=5 (degree-5 polynomial: 6 coefficients). Boundary conditions: position, velocity, acceleration at both endpoints = 6 constraints. Let x(0)=0, x'(0)=0, x''(0)=0 and x(T)=1, x'(T)=0, x''(T)=0 (start and end at rest).

The constraint system Ap=b becomes:

python
import numpy as np

T = 1.0  # segment duration
# Rows: [P(0), P'(0), P''(0), P(T), P'(T), P''(T)]
# Cols: [p0, p1, p2, p3, p4, p5]
A = np.array([
    [1, 0, 0,   0,   0,   0  ],  # P(0) = p0
    [0, 1, 0,   0,   0,   0  ],  # P'(0) = p1
    [0, 0, 2,   0,   0,   0  ],  # P''(0) = 2p2
    [1, T, T**2, T**3, T**4, T**5],  # P(T)
    [0, 1, 2*T, 3*T**2, 4*T**3, 5*T**4],  # P'(T)
    [0, 0, 2, 6*T, 12*T**2, 20*T**3],  # P''(T)
])
b = np.array([0, 0, 0, 1, 0, 0])
p = np.linalg.solve(A, b)
# Result: p = [0, 0, 0, 10, -15, 6]
# x(t) = 10t^3 - 15t^4 + 6t^5  (classic minimum-jerk)

This is the famous minimum-jerk trajectory: x(t) = 10t³ − 15t⁴ + 6t⁵. Verify: x(0)=0 ✓, x(1)=10−15+6=1 ✓, x'(0)=0 ✓, x'(1)=30−60+30=0 ✓, x''(0)=0 ✓, x''(1)=60−180+150=30 ≠ 0 ... wait, let’s check x''(1) = 60−180+150 = 30... that's not 0. Actually for pure minimum-jerk (no acceleration constraint), there are only 4 boundary conditions (pos+vel at each endpoint), giving a degree-3 polynomial: the 10t³−15t⁴+6t⁵ form enforces zero acceleration as well (all 6 conditions). Let’s verify x''(1): P''(t)=60t−180t²+150t³; P''(1)=60−180+150=30? Hmm: 60t is 0+0+2×0+(6t·p3)+(12t²·p4)+(20t³·p5). Let me be precise: x''(t) = 6p3 + 12p4·t + 20p5·t² = 60t−180t²+120t³? Check: p3=10: 6×10=60; p4=−15: 12×(−15)t=−180t; p5=6: 20×6t²=120t². x''(1)=60−180+120=0 ✓. (The coefficient is 120 not 150 — a typo above corrected here.)

For a 3-segment minimum-snap (r=4) trajectory with degree-7 polynomials, how many total coefficients are there, and how many continuity constraints are imposed at the 2 internal waypoints?

21 coefficients, 4 continuity constraints 24 coefficients, 8 continuity constraints (4 per internal waypoint: pos/vel/accel/jerk) 24 coefficients, 16 continuity constraints 12 coefficients, 8 continuity constraints

Chapter 6: Differential Flatness

We've been treating the trajectory optimization as a problem over position x(t). But the quadrotor has 12 states (position, velocity, rotation, angular velocity) and 4 inputs (rotor speeds). Why can we get away with planning only in x(t), y(t), z(t), ψ(t)?

The answer is differential flatness. A system is differentially flat if there exists a set of flat outputs σ(t) such that the full state and all inputs can be written as smooth functions of σ and its derivatives — without integrating any differential equation.

The quadrotor flat outputs

For a quadrotor, the flat outputs are σ(t) = [x, y, z, ψ]^T — three position coordinates plus yaw angle. This means: given any smooth trajectory x(t), y(t), z(t), ψ(t), we can compute exactly what rotor speeds are needed at every instant, without solving any ODEs.

Misconception: you still need to integrate the dynamics. No! That’s the whole point of differential flatness. The full state (including roll, pitch, angular velocity) and the inputs (rotor speeds) are algebraic functions of the flat outputs and their derivatives. Derivatives, not integrals — and derivatives of a polynomial are trivial.

Recovering the full state from the flat output

Given x(t), y(t), z(t), ψ(t), compute (step by step):

Step 1: Thrust direction

Desired acceleration a_d = [ẍ, ÿ, z̈ + g]^T. Body z-axis: z_B,d = a_d / |a_d|. Thrust magnitude: f = m|a_d|.

↓

Step 2: Full attitude

From z_B,d and desired yaw ψ, compute the full desired rotation R_d ∈ SO(3) (construct x_B,d and y_B,d from ψ and z_B,d).

↓

Step 3: Angular velocity

ω_d = R_d^T Ṙ_d (vee of R_d^TṘ_d). Ṙ_d involves the 3rd derivative of position (jerk) — this is why flatness requires smooth trajectories.

↓

Step 4: Angular acceleration → torques

α_d involves the 4th derivative of position (snap). τ_d = Iα_d. This confirms: snap = motor torque demand.

↓

Step 5: Rotor speeds

Invert the mixing matrix F̄ to get [w₁², w₂², w₃², w₄²] from [f, τ_x, τ_y, τ_z].

Every step is an algebraic (no ODE integration) function of x(t) and its derivatives up to 4th order. So a minimum-snap polynomial trajectory in the flat outputs automatically specifies the full quadrotor state and inputs at every time instant.

Flatness = feasibility guarantee. Any smooth trajectory in the flat outputs is kinematically feasible — a valid state/input trajectory exists for the full nonlinear quadrotor dynamics. Planning in flat output space is sufficient for feasibility, as long as thrust and torque stay within limits.

What does it mean for a quadrotor to be "differentially flat" with flat outputs [x, y, z, ψ]?

The quadrotor can only move in these 4 dimensions and ignores roll and pitch The dynamics equations simplify to linear equations in these 4 coordinates The full state (including roll, pitch, angular velocity) and all rotor inputs can be written as algebraic functions of [x,y,z,ψ] and their time derivatives The quadrotor can control [x,y,z,ψ] independently without affecting roll and pitch

Chapter 7: Time Allocation

One free parameter we haven't addressed: how long should each segment take? If T_i is too large, the drone flies very slowly and wastes time. If T_i is too small, the required accelerations saturate the motors.

Why timing matters

The cost J = p^TQp depends on T through the Q matrix entries (recall Q_lk ∝ T^l+k−2r+1). A smaller T "compresses" the trajectory into less time, increasing all derivatives. The optimal timing balances the smoothness cost against a time penalty.

J_total = J_r(p, T) + λ ∑_k=1^M−1 T_k

where λ is a weight trading off smoothness vs. speed. A larger λ makes the drone prefer shorter segment times (faster flight). A smaller λ gives more room for smooth transitions.

Gradient-based time optimization

Because J_total is differentiable with respect to T_k, we can optimize the segment times using gradient descent. This creates an outer loop: for a fixed set of T_k, solve the QP to get optimal p; then update T_k via ∇J_total/∇T_k. Iterate until convergence.

A practical shortcut: start with T_k proportional to the Euclidean distance between waypoints k and k+1. This heuristic works well and only needs gradient refinement for tight constraints.

Feasibility check after time allocation. After solving for both the polynomial coefficients p and the segment times T_k, check whether the maximum thrust and torque anywhere on the trajectory exceed the motor limits. If so: either lengthen the violated segment times (adding a new waypoint from the path planner to split a long arc) or relax the continuity constraints to higher order derivatives.

Time allocation slider — speed vs. smoothness tradeoff

Three waypoints. Drag the time allocation slider to compress or expand the second segment time. Watch the peak acceleration change (red dashed line = motor limit).

Segment 1 time (s) 1.0

Segment 2 time (s) 1.0

You decrease all segment times T_k by half (trying to fly faster). What happens to the optimal snap values of the trajectory?

Snap decreases because the path is shorter in time Snap stays the same because the polynomial coefficients rescale proportionally Snap increases (approximately 16× for a halved segment time since d⁴/dt⁴ scales as 1/T⁴) Snap decreases because the QP finds a globally optimal solution regardless of T

Chapter 8: Showcase: Interactive Trajectory Builder

Every concept in this lesson comes together here. Click on the canvas to place waypoints, drag to reposition them, and watch the minimum-snap trajectory update in real time. The derivative profiles below show position, velocity, acceleration, and jerk — with continuity visible at segment joins.

Minimum-snap trajectory builder — click to add waypoints, drag to move

Top: 2D space with waypoints (white dots) and the minimum-snap curve (teal). Bottom: derivative profile over time (toggle which derivative to display). Color of derivative curve = orange (feasible) or red (exceeds limit).

Segment time T 1.0

In the showcase, you add 3 waypoints (2 segments) and set a very short segment time T=0.3s. The jerk profile turns red. What does the red color indicate, and what should you do?

Red means a solver error — increase the polynomial degree Red means the jerk exceeds the feasibility limit (implying motor torques are saturated) — increase T to give the drone more time Red means the trajectory passes through an obstacle — add more waypoints Red means the continuity constraints are violated at segment joins

Chapter 9: Connections & Cheat Sheet

Cheat Sheet

Concept	Key formula / fact
Smoothness = derivative minimization	J_r = ∫(x^(r))²dt — r=1 vel, r=2 accel, r=3 jerk, r=4 snap
Min-snap matters for quadrotors	Snap ∝ angular acceleration ∝ motor torque demand
Polynomial order (Euler-Lagrange)	N = 2r−1 (single segment); N=7 for r=4
QP cost matrix	J = p^TQp, Q_lk = [∏(l−m)(k−m)] · T^l+k−2r+1/(l+k−2r+1), first r rows/cols = 0
Constraint matrix	Ap = b; rows from e_q(t₀) · p = value; continuity: r constraints per internal waypoint
Multi-segment	Q block-diagonal, p stacked; M segments × (N+1) coefficients
Differential flatness	Flat outputs σ = [x,y,z,ψ]: full state + inputs are algebraic functions of σ and derivatives to order 4
Time allocation	J_total = J_r + λ∑T_k; gradient descent on T_k; snap scales as 1/T⁴
Feasibility check	After solve: verify max thrust, max torque ≤ motor limits everywhere on trajectory

What we left out

Unconstrained QP (endpoint derivative parameterization): Mellinger & Kumar 2011 reformulate to optimize free endpoint derivatives instead of polynomial coefficients directly, giving a numerically better-conditioned problem for high-degree polynomials and many waypoints.
Obstacle avoidance: After generating the trajectory, check for collisions along the curve (not just at waypoints). If detected, add a new intermediate waypoint sampled from the RRT* path and re-solve.
State constraints: Real drones have maximum velocity and bank-angle limits. These can be added as inequality constraints to the QP or enforced via conservative time allocation.
MPPI / MPC replanning: In dynamic environments, run the QP solver online at 10–50 Hz, replanning over a receding horizon.

Links to related lessons

VNAV L3: Control — How the trajectory is tracked. The geometric controller consumes R_d(t), ω_d(t) computed from the flat output. The full differential-flatness pipeline connects directly to the cascaded control loop from L3.

VNAV L5: Image Formation — The perception side begins next: how the camera produces the measurements that feed back into the planning loop. Camera Models & Image Formation.

MDP / RL: Markov Decision Processes provide an alternative planning framework where trajectories emerge from a value function rather than explicit optimization.

Minimum-jerk in robot arms: Inverse Kinematics uses the same polynomial-trajectory philosophy for joint-space trajectory generation.

The big picture

The planning stack in three sentences. A path planner (RRT*, A*) finds a collision-free sequence of waypoints in configuration space. Trajectory optimization (this lesson) converts those waypoints into a smooth, dynamically-feasible curve by solving a Quadratic Program that minimizes snap. A geometric controller (L3) tracks that trajectory by computing rotor commands using the full nonlinear quadrotor model on SO(3).

"Optimization is the language in which we write intent. The QP doesn't know what a drone is — it only sees a quadratic objective and linear constraints. The engineering insight is choosing the objective and constraints that translate intent into a feasible flight."
— paraphrasing Mellinger & Kumar, ICRA 2011