How to Manage Claude Tokens
Token Basics

Context window explained

What "context window" means

The context window is the maximum number of tokens Claude can "see" at once — both your input and its output combined. Claude Sonnet supports 200K tokens. Claude cannot reference anything outside this window.

What fills the window

  • Your system prompt (runs every turn)
  • The full conversation history
  • Any documents or files you paste in
  • Tool definitions and results
  • Claude's response

Once you hit the limit, older messages get truncated or the API returns an error. Long multi-turn conversations are the #1 silent credit drain in production apps.

operator note

Build a conversation summarization step that fires before you hit 50% of the context window. This keeps costs predictable.

Changelog · 1
  • Initial release — 5 sections, 11 lessons.