Uses getModelContextWindow() to override CONTEXT_WINDOW in the env
passed to the Claude Code subprocess. Local/constrained models (claudecode,
cc/qwen72b, etc.) now self-limit their prompt budget to their actual VRAM
context ceiling instead of using the global 160k default.