Real Mooncake baseline bundled with Tracerator.
Parameters
Higher → more simulated prefix reuse / cache hits
Extra independent sessions added
Blend in additional modeled requests
Reshapes input-token buckets for prefill and KV reuse studies
For reproducible output
KV cache planning
Preview computes policy curves from the generated trace
Live estimates
Instant feedback — backend recomputes exactly on generate
Est. requests
12031
Total lines in trace.jsonl
Est. time span
3537s
Rough trace duration from first to last
Est. hit ratio
50%
Derived from reuse bias
Est. peak concurrency
28
Max concurrent in-flight (scaled)
Estimated ISL distribution
Input length buckets that will be written into trace.jsonl
Est. zip size ≈ 12 MB. trace.jsonl is JSON Lines (one record per line). Files >10–20 MB are normal at higher scales — most GUI editors (TextEdit, basic JSON viewers) will fail or hang. Use head, jq, pandas read_json(..., lines=True), or VS Code. The generated zip now includes a README.txt with examples.
zip contains trace.jsonl + manifest.json + isl_distribution.svg/txt + README.txt
The live numbers are client-side approximations that mirror the backend formulas. Generation always uses the authoritative server-side logic seeded for repeatability. Generated and augmented traces are simulations for planning and replay experiments; they preserve selected workload characteristics but do not necessarily represent an exact production workload profile. See README for details and the full parameter contract.