Adds MODEL_CONTEXT_OVERRIDES map and getModelContextWindow() helper so
local/constrained models (claudecode, cc/qwen72b, ollama-local/*) declare
their VRAM-limited context budget. claude-sdk.js will stamp this into
the subprocess env so Claude Code self-limits instead of overflowing the model.