Skip to content

Replay bundles

To prove a fix works, you have to re-run the exact situation that failed. A replay bundle is the self-contained snapshot that makes that possible.

Lumni assembles a bundle from a failing run:

  • Inputs — the original user request and any run parameters.
  • Tool schemas & responses — what each tool was called with and what it returned, so tools can be replayed or mocked faithfully.
  • Context snapshot — the prompt, retrieved documents, and history that reached the model.
  • Environment — model id and version, dependency versions, and the policy version in force.

The point is determinism: replay the bundle and you should reproduce the original failure before you change anything.

Agent typeReplay support
LangGraphNative, stateful replay of the graph
SDK-instrumented runnersBring-your-own runner via a replay adapter
Deterministic agentsAnything with fully captured inputs and tool I/O

Non-deterministic external calls are handled by replaying the captured tool responses from the bundle rather than hitting live services — so a replay is safe and repeatable.

Once you have a bundle that reproduces the failure, test a fix against it.