
Good morning.
An operator I advise messaged me an hour after the announcement with one question: should he move his business onto Fable 5.
Fair question, and the wrong one.
The most useful thing about today's release hides in plain sight. There are two releases, not one, and underneath both sits a single model.
That one detail says more about the next two years than any benchmark in the announcement. Below: what shipped, why the split matters, and the read for an operator at your scale.
— Sam
IN TODAY’S ISSUE 🤖

Why there are two models and only one brain
The benchmarks that broke 90% for the first time
The capability that changes the work, not just the score
Why Anthropic split the release down the middle
The $50 question and the retention clause nobody opted into
The advantage was never the engine
Let’s get into it.

What Anthropic Shipped
Strip the names away and there is one model here, released under two configurations (Anthropic).
The naming is the product decision, and it's worth getting straight first, because most coverage will treat Fable and Mythos as two products.

Fable 5 is the public version. The dual-use domains (cybersecurity, biology, chemistry) are walled off and rerouted to an older model.
Mythos 5 is the same model with those walls lowered, released only to vetted defenders, a thin slice of enterprises, and the US government (TechCrunch).
Claude Fable 5 | Claude Mythos 5 | |
|---|---|---|
Who gets it | Public: API, enterprise, paid plans | Glasswing partners, vetted institutions, US government |
Guardrails | Cyber, bio, chemistry restricted and rerouted | Those restrictions relaxed |
Positioned as | Most powerful public model | "Strongest cybersecurity capabilities of any model in the world" |
The model underneath | Same generation, same architecture | Same generation, same architecture |
The two names encode a decision about access and trust. The model is the same underneath.
Despite the separate name, Fable 5 is the full-strength model. By Anthropic's own description, the only thing narrowed is its operating range for dual-use work (NBC News). The intelligence is identical. What changes is what you are permitted to point it at.
That reframes the question every operator is about to ask. "Do I have the best model" is close to settled the moment you open the API. The live question is what you are allowed to do with it.
The Capability Behind Both Names
The lineage is the Mythos generation that surfaced through a leak in March, which Anthropic called "the most capable model we've built to date." Its preview posted numbers notable as much for their margins as their absolute values:
Benchmark | Score | Lead over prior models |
|---|---|---|
SWE-bench Verified | 93.9% | +13 to 24 points |
SWE-bench Pro | 77.8% | n/a |
Terminal-Bench 2.0 | 82% | n/a |
USAMO 2026 | 97.6% | n/a |
For the public model, the third-party read matters more than the vendor's. Analytics firm Hex reports Fable as the first model to break 90% on its benchmark of complex, long-running analytical tasks, a ten-point jump over Opus 4.8, with "strong judgment and attention to nuance" on the hardest questions (NBC News).
Forget the score for a second. The capability that changes the work is duration.
Fable 5 is built to run continuously for days on complex, multi-stage tasks: planning across stages, delegating to sub-agents, writing its own tests, checking its own work. Anthropic's framing is "days-long, complex, and asynchronous tasks previous models couldn't sustain" (Anthropic).
Felix Rieseberg, who leads Claude Cowork and Claude Code Desktop, called it the start of a "third era," past chatbots and past copilots, toward agents that take on work at the timescale of a team (Felix Rieseberg, X).
Mythos 5, the partner-only version, carries one more claim worth flagging even though you can't touch it. Anthropic describes it as its first model to reliably generate novel, compelling scientific hypotheses, with breakthroughs claimed in drug design and molecular biology (NBC News).
The caveat sits inside the claim: it's Anthropic's own, the model is restricted, and no independent validation exists yet.
If it holds, the interesting line is the move from AI that retrieves and recombines existing knowledge to AI that adds to the frontier. That is a different category of tool, and it would compound.
Why They Split It
The split is deliberate containment. The Mythos generation can autonomously discover zero-day vulnerabilities in production software used by billions of people.
In testing it found and exploited previously unknown critical flaws across major operating systems, browsers, and cryptography libraries, reproducing working exploits on the first attempt in over 83% of cases, including a 27-year-old flaw in OpenBSD (ArmorCode).
So Anthropic did two things at once. It put the public envelope (Fable) in everyone's hands, and it routed the unrestricted model (Mythos) to defenders first through Project Glasswing:
Launched in April with roughly 50 vetted organizations, among them AWS, Apple, Microsoft, Google, CrowdStrike, Palo Alto Networks, JPMorgan, and the Linux Foundation.
In their first 30 days, partners found more than 10,000 high- or critical-severity vulnerabilities, and Cloudflare alone reported a tenfold jump in its bug-discovery rate.
The logic is preemptive, and Anthropic states it plainly: within six to twelve months, other labs will likely have Mythos-class models, and some may release them without equivalent safeguards. Deploy defense before offense catches up.
The policy weather moved the same direction in the same week. A Politico piece framed frontier AI and cybersecurity as a "hurricane warning," and a new executive order set up a voluntary framework for labs to share advanced models with the government up to 30 days before public deployment (NBC News).
For an operator the takeaway is concrete. The same capability that finds zero-days for defenders is the capability others will eventually turn on your stack.
The window to harden a web-facing business, while the strongest offensive tools are still gated, is open now and closing on Anthropic's own clock.
What It Costs, and What It Commits You To
The release also carries the pricing and policy fine print that decides whether you can build on it.
Item | Detail |
|---|---|
Pricing | $10 / million input tokens, $50 / million output (2x Opus 4.8) |
Prompt caching | 90% input discount applies |
Paid plans (Pro / Max / Team) | Free until June 22, then usage credits |
Available now | Claude API ( |
Two lines in that fine print matter more than the headline number.
The price and the free window are a signal. Anthropic was direct that serving a model this expensive at consumer scale strains compute, which is why it leaves base plans after June 22 (Anthropic). Treat frontier inference as a metered, moving cost.
Uber learned the live version earlier this year, burning its entire annual AI budget in four months and then capping spend per employee (IT Pro / IDC). Route deliberately: the frontier model for work that genuinely needs days-long reasoning or vision, cheaper models for the rest.
Retention now applies to everyone. A mandatory 30-day data retention policy covers all Fable 5 traffic, including enterprises that previously held zero-retention agreements. Anthropic says the data defends against jailbreaks, not training.
Take them at their word and the implication holds: access to frontier capability now carries monitoring you can't switch off.
If you handle regulated, confidential, or client-sensitive data, your posture changed this morning. Decide what you will feed this model before someone on your team decides it by pasting the wrong thing into a chat window.
One more, in the plumbing. Anthropic requires API customers to configure a fallback path so domain-restricted prompts reroute cleanly. The model under your product is a permissioned component now, and you build around its boundaries.
The Advantage Was Never the Engine
When the engine is the same for everyone, the engine can't be your advantage. Owning the frontier model is table stakes the day it ships to the public API.
The advantage is the envelope you build around it:
Context that makes the model act like your business rather than a generic assistant.
Judgment about what to point it at, where to trust it, and where to stop it.
Proprietary data and process a vendor will never ship to your competitor, because it belongs to you.
Anthropic just demonstrated the principle at its own scale. One model, and the entire difference in what it can do comes from the configuration wrapped around it. Your version of that wrapper is your context files, your data, and the taste only you can supply.
The practical moves follow from there:
Fix your data posture for a world where what you send is retained.
Build for model-swappability, so a price change or a capability jump is a config edit instead of a rebuild.
Set a routing discipline so frontier inference goes only where it earns its cost.
Spend the saved hours on context and judgment, the part that decides whether the output is something only your business could have produced.
The pattern under all of it is the one researchers keep finding: the bottleneck on AI value is organizational design more than model access (Stanford AI Index 2026).

The benchmarks will be stale in a quarter. The structure won't.
Anthropic shipped one model under two names and said out loud that the capability is the commodity and the envelope is the product.
The operators who internalize that stop asking which model to buy and start building the thing no vendor can hand them:
The system around the model that makes its output theirs.
That system, the context and the judgment and the data underneath it, is most of what we get into in Cortex.
Until next time,
Sam Woods
The Editor
.

