Claude Sonnet 4 has been upgraded, and it might probably now bear in mind as much as 1 million tokens of context, however solely when it is used through API. This might change sooner or later.
That is 5x greater than the earlier restrict. It additionally implies that Claude now helps remembering over 75,000 strains of code, and even tons of of paperwork in a single session.
Beforehand, you have been required to submit particulars to Claude in small chunks, however that additionally meant Claude would overlook the context because it hit the restrict. With as much as a 1 million context restrict, you may construct higher apps, and Claude can bear in mind extra of your code than ever.
It’s price noting that the 1 million context restrict is proscribed to Sonnet 4. Opus 4.1 nonetheless has the previous limitations as a result of it is an costly mannequin.
Solely API will get 1 million tokens context restrict
The brand new context restrict is rolling out through the Anthropic API for purchasers with Tier 4 and customized price limits, with broader availability rolling out over the approaching weeks.
“Lengthy context can be out there in Amazon Bedrock and is coming quickly to Google Cloud’s Vertex AI,” Anthropic famous.
“With 1M tokens you may: load total codebases with all dependencies, analyze tons of of paperwork directly, and construct brokers that keep context throughout tons of of software calls. Pricing adjusts for prompts over 200K tokens, however immediate caching can scale back prices and latency.”
Claude’s cellular and internet apps will likely be getting the 1 million token context restrict sooner or later sooner or later.