Tokens, turns, and prompt caching

Every turn re-sends the prior conversation. The context-window widget makes that look free; it isn't. This timeline shows the bill actually growing, and how prompt caching keeps it tractable. The simulated session below is me (Pichaya “Micky” Puttekulangkura) poking at the garden during an evening. If you want the full classroom version of this material, I teach it at Mervia Academy.

This interactive widget works best on a larger screen. The short version: every turn re-sends the full conversation, but prompt caching bills repeat tokens at ~10% the regular rate (with a 5-minute TTL). See anthropic.com/news/prompt-caching for details.

With caching

Without caching

$ claude

Advance one turn at a time and watch the billing total grow. Every turn re-sends what came before.

Tokens, turns, and prompt caching

Watch the bill grow. Then turn caching off.