Claude Opus 4.8
About this model
Claude Opus 4.8 is Anthropic's flagship generally available model, released May 28, 2026, positioned for complex reasoning, long-horizon agentic coding, and high-autonomy enterprise work. It supports a 1M-token context window by default on the Claude API, Amazon Bedrock, and Vertex AI (200K on Microsoft Foundry), 128K max output tokens, adaptive thinking, and multimodal inputs including images and files. It ships alongside Claude Opus 4.8 Fast, offered as a research preview on the API.
Anthropic frames Opus 4.8 as a direct build on Claude Opus 4.7, keeping the same pricing and context window while focusing on quality. The company's documentation lists targeted improvements in long-horizon agentic coding: better long-context handling, fewer compactions, and improved compaction recovery. It also addresses a known Opus 4.7 issue—skipping required tool calls—with better tool triggering, and lowers the minimum cacheable prompt length to 1,024 tokens.
On reported evaluations, Anthropic states Opus 4.8 is roughly four times less likely than Opus 4.7 to let a code flaw pass unflagged, and emphasizes improved honesty—avoiding overconfident claims of progress.
The model extends a steady cadence over Claude Opus 4.6 and Claude Opus 4.5. A new mid-task instruction-update feature lets developers adjust permissions, token budgets, or environment context as an agent runs, without breaking the prompt cache.
This About section is AI-generated from public sources (Claude Opus 4.8), with no human editing. It may contain inaccuracies — verify critical details against the sources listed above.
Data sources: Venice API · HuggingFace · Wikipedia — enrichment updated 1d ago