Interactive challenges
Production AI judgment you can inspect
These sanitized challenges reveal debugging judgment, unit economics, workflow dependency design, and artifact inspection without using private data or live execution.
Representative sanitized scenario. Customer data, private prompts, internal traces, and exact company costs omitted.
Trace diagnosis
Debug This Agent
Inspect a representative trace, identify the root cause, and compare the diagnosis with the production fix.
Cost architecture
Cost Anatomy
Toggle normalized cost models and see how routing, retries, sandbox reuse, and judge coverage change unit economics.
Workflow simulator
DAG Execution Simulator
Step through dependency state, judge failure, recovery decisions, and downstream readiness in a synthetic workflow.
Artifact boundary simulator
Deck IR Previewer
Edit synthetic Deck IR, inspect validation errors, and switch between preview, outline, and speaker notes.