Anthropic Launches Claude Sonnet 5, Closing the Gap With Opus

July 1, 2026Provided by Utku Ege Tuluk

Anthropic released Claude Sonnet 5 on June 30, 2026 — its “most agentic Sonnet model yet,” bringing performance close to the flagship Opus 4.8 while running at a fraction of the cost. The mid-tier model posts large gains over Sonnet 4.6 in coding, reasoning, tool use, and knowledge work, and ships with introductory pricing of $2 per million input tokens through August 31. It is now the default model for Free and Pro plans and is available across the Claude apps, Claude Code, and the API.

Intermediate

Claude Sonnet 5 announcement artwork: a botanical illustration of the numeral 5 formed from leaves and flowers on a pale green background — Image credit: Anthropic

Near-Opus Performance at Mid-Tier Cost

The headline of this release is the shrinking gap between the Sonnet and Opus tiers. On agentic coding (SWE-bench Pro), Sonnet 5 scores 63.2% — up sharply from Sonnet 4.6’s 58.1% and closing in on Opus 4.8’s 69.2%. On Terminal-Bench 2.1 it jumps to 80.4% (from 67.0%), landing just behind Opus 4.8’s 82.7%. On knowledge work (GDPval-AA v2), Sonnet 5 actually edges out the flagship, scoring 1,618 to Opus 4.8’s 1,615 and Sonnet 4.6’s 1,395.

Benchmark comparison table showing Claude Sonnet 5, Sonnet 4.6, and Opus 4.8 across agentic coding, reasoning, computer use, and knowledge work — Image credit: Anthropic

The gains extend to reasoning and computer control. On Humanity’s Last Exam, Sonnet 5 reaches 43.2% without tools and 57.4% with tools — nearly matching Opus 4.8’s 57.9% tool-assisted result and far ahead of Sonnet 4.6’s 46.8%. On OSWorld-Verified, a computer-use benchmark, it scores 81.2%, up from 78.5%.

Built for Agents That Run on Their Own

Anthropic frames Sonnet 5 as an execution engine for autonomous agents. “It can make plans, use tools like browsers and terminals, and run autonomously at a level that, just a few months ago, required larger and more expensive models,” the company wrote. The model completes complex multi-part tasks end-to-end and self-checks its own outputs without being explicitly prompted to do so.

Where Sonnet 5 stands out is the cost-performance curve. Because agentic workloads run the model many times over, price per task matters as much as raw capability. The charts below show Sonnet 5 delivering Opus-tier pass rates at a lower cost per task across effort levels — for many agentic search and computer-use workloads, it sits on or near the Pareto frontier.

Line chart of agentic search performance (BrowseComp) versus cost per task, comparing Sonnet 5, Opus 4.8, and Sonnet 4.6 across low to max effort levels — Image credit: Anthropic

Line chart of agentic computer-use performance (OSWorld-Verified) versus cost per task, comparing Sonnet 5, Opus 4.8, and Sonnet 4.6 — Image credit: Anthropic

Pricing and Safety

Sonnet 5 launches with introductory pricing of $2 per million input tokens and $10 per million output tokens through August 31, 2026, after which standard rates of $3 and $15 take effect. Even at standard pricing, it undercuts Opus 4.8 ($5 / $25) while roughly matching Sonnet 4.6’s old price point. The model supports a 1M-token context window.

On safety, Anthropic reports that Sonnet 5 exhibits lower rates of undesirable behaviors than Sonnet 4.6 — it refuses malicious requests more cleanly, resists prompt-injection attacks better, and hallucinates and sycophants less often. Notably, it is deliberately less capable than the Opus line at dangerous cybersecurity tasks, and ships with cyber safeguards enabled by default.

What This Means

Sonnet 5 continues the pattern established by Sonnet 4.5 and 4.6: the mid-tier model keeps absorbing capabilities that used to be exclusive to the flagship. For developers building agentic workflows, coding assistants, and long-running automation, the practical takeaway is that the default choice for most production traffic can now be the cheaper model without a meaningful quality tradeoff — reserving Opus 4.8 for the hardest tasks. With the introductory pricing running through the end of August, the cost gap is even wider in the near term.

Related Coverage

Introducing Claude Sonnet 4.6: Flagship Performance at Mid-Tier Cost — the previous Sonnet release, February 2026
Anthropic Releases Claude Opus 4.8 for Longer Agentic Coding — the flagship Sonnet 5 is benchmarked against
Introducing Claude Sonnet 4.5 — where the modern Sonnet line began