The privileged state is $\mathbf{s}_t = [p_x,\,p_y,\,\theta]$, where the car is controlled by a continuous angular velocity $a_t \in \mathcal{A} = [-a_{\max},\,a_{\max}]$, with $a_{\max} = 1.25\,\mathrm{rad/s}$. The forward speed is fixed at $v = 1\,\mathrm{m/s}$, and the system updates every $\Delta t = 0.05\,\mathrm{s}$. To capture the system's stochasticity, the sign of the control input may be flipped at each step. A red boundary in the right video highlights exactly when the action's sign is flipped, making each occurrence of the "naughty" intervention visually clear.
Specifically, $\delta_t = -1$ with probability $0.4$ and $\delta_t = +1$ with probability $0.6$. The ground-truth criterion that defines the failure set is given by
This yields a circular failure region of radius $0.25\,\mathrm{m}$ centered at the origin. We train a diffusion-based world model following the EDM framework using 4000 randomly sampled trajectories, and subsequently learn the criterion function atop the model's latent space. Steering utilizes the learned safety margin as the objectiveβmaximizing it for optimistic, or minimizing it for pessimistic steering. The exponential growth of possible outcomes from the sign-flip stochasticity poses a significant challenge for long-horizon steering.
Given a fixed initial state and a fixed action sequence, stochasticity in dynamics lead to multiple different outcomes in world model generations. Noise optimization aims to choose an initial noise of the world model to steer the world model means selecting the initial noise so that the generated trajectory moves toward a chosen extremum of the safety criterion. Use the Pessimistic / Optimistic toggle to compare two searched noise patterns that target opposite ends of that criterion. The red and blue points on the left mark those two candidate initial noises; other gray points are random samples for context.
Click Refresh to draw a new $\mathbf{s}_0$ and new actions $(a_t)$. Click any dot in the noise plot to play the corresponding stochastic trajectory on the right; playback loops until you press pause. Turn on Possible trajectories (forward reachable set) to see where rollouts can spread under the true stochastic dynamics.
(This is an analytical demo with exact dynamics, not WM generations.)
Now, letβs steer the world model! We optimize the initial noise of the naughty Dubins car world model to maximize or minimize the safety margin: pessimistic steering searches for worst-case outcomes, while optimistic steering seeks best-case trajectoriesβboth of which can be difficult to discover under uniform random initialization (gray points). For comparison, we also sample N = 20 random rollouts per scenario. Click Refresh scenario to load a new trajectory, or select any point in the noise scatter to visualize its rollout.
Note: The video on the right shows open-loop imagination from the learned world model, not ground-truth trajectories. As such, generations and steering results may be imperfect due to model bias or error.
Why is it hard? This is a long-horizon autoregressive rollout. The exponential branching from stochastic dynamics makes it practically impossible for Best-of-N sampling to discover the dangerous modes of the imagined distribution.
We steer the world model's imagination using only the initial observation and action sequence, then classify the trajectory as a predicted failure if the minimum predicted safety score along the rollout is below zero. Noise optimization with the typical-set regularizer achieves high TPR while keeping FPR near the nominal level. Both noise optimization without the regularizer and classifier guidance trivially drive the safety score downward, but with a high false-positive rate β indicating physically implausible imaginations.