Agent & Protocol · Toolkit
Context & Memory Engineering
SIMULATEDVerified Jul 2, 2026The same multi-turn task, four ways to carry the context. Drag the turns up and watch full-dump's cost and overflow risk climb while the others trade a little fidelity for a lot of headroom.
Same instrument · three industries — pick a use-case to reconfigure the run
Full dump
Send the whole history every call.
$66/1k calls
Summarize
Rolling summary + last two turns.
$36/1k calls
Compress
Semantic compression of history.
$39/1k calls
Sub-agent handoff
Only the current sub-task's brief.
$21/1k calls
It's a dial, not a default
Steering-committee takeaway: Context strategy is a cost-fidelity dial. I set it per use case, not per platform.
How this is built
Per strategy, context/call grows differently with turns (full = linear, summarize = ~flat, compress = log, handoff = flat-small). Cost ≈ context × input price; fidelity and risk are modeled per strategy; overflow triggers when context exceeds a 24k working window. Memory retention applies each policy's eviction rule to a fixed fact set.
Stack: Next.js (static) + shared design system; deterministic client-side.
Limitations: token sizes and fidelity are illustrative; real numbers depend on the model, the summarizer, and the task. It shows the trade-offs' shape, not a benchmarked comparison.