Claude Fable 5 Review: Anthropic's Mythos-Class Flagship
Claude Fable 5 is Anthropic's most capable generally-available model, released June 9, 2026. It is the consumer-facing half of a deliberately split release. Fable 5 is a "Mythos-class" model made safe for general use. Claude Mythos 5 is the same underlying model with safeguards lifted in some areas, restricted to vetted cyberdefenders through Project Glasswing and not available to the public. Same model, two safeguard levels, two names. Fable 5 is the one you can actually buy. The headline numbers are real and, in places, genuinely impressive: 95.5% on SWE-bench Verified, multi-day autonomous task runs, persistent memory, and a vision system that beat Pokemon FireRed from raw screenshots. The catch is a conservative safety filter that quietly downgrades roughly one in twenty of your prompts to the older Opus 4.8, plus a set of agentic and evaluation-awareness behaviors that a credible review has to name out loud. This is that review.
Quick verdict. If you do agentic coding, long-horizon automation, or vision-heavy knowledge work, Fable 5 is worth adopting now at $10/$50 per million tokens, and it is effectively free on Pro, Max, and Team plans until June 22, 2026. If your work is open-ended biology or chemistry ideation, heavy quantitative finance modeling, or anything that trips the safety classifier often, wait and watch: you will hit the Opus 4.8 downgrade and the rough edges before you hit the ceiling.
What Claude Fable 5 Actually Is
Start with the naming, because it is the most unusual thing about this launch and the part most coverage gets wrong. Anthropic built one model and shipped it under two names that differ only by how locked-down they are. Fable 5 is the safeguarded edition, generally available, and the subject of this review. Mythos 5 is the same weights with several safeguards lifted, handed only to vetted organizations defending critical software infrastructure. If you are reading this as a developer, a team lead, or a curious power user, Fable 5 is your model. Mythos 5 is not something you can sign up for.
That split is a governance decision, not a marketing flourish. The capabilities that make a frontier model good at finding and fixing software vulnerabilities also make it good at writing exploits. Anthropic's answer is to ship the generally-useful version with guardrails on and keep the unguarded version inside a restricted-access program. Whether that is the right line to draw is a real debate. What is not in dispute is that the public gets Fable 5 and a clearly-stated set of constraints, which is more transparency than most model launches offer.
On the capability ladder, Fable 5 sits above Claude Opus 4.8, which does not retire. Opus 4.8 stays in service as the fallback model the safety system routes to when a classifier fires. So buying into Fable 5 also means you are implicitly buying a path back to Opus 4.8 on a meaningful slice of requests. That is the design, stated plainly, and it shapes how you should think about reliability for any single task.
Where It Sits Against the Competition
Fable 5 is Anthropic's entry against OpenAI's GPT-5.5 and Google's Gemini 3.1 Pro at the top of the frontier tier. Anthropic positions it as the leader on agentic coding and long-horizon autonomy specifically, rather than as a uniform best-at-everything model. That is a narrower and more honest claim than the usual launch rhetoric, and the benchmark spread below mostly supports it, with the caveats that always apply to vendor-reported numbers.
On the context window. Anthropic describes Fable 5 as staying focused across "millions of tokens," and internal evaluation notes reference both one-million and ten-million token figures. The company has not published a single official maximum. Treat the exact ceiling as unstated rather than assuming a number, and confirm the limit for your access tier before designing a workload around it.
The Capability Story
Strip away the launch language and four capabilities carry Fable 5: agentic coding, long-horizon autonomy, persistent memory, and vision. Each is backed by a concrete claim rather than an adjective, which makes them testable and worth taking seriously.
Long-Horizon Autonomy and Memory
The most material change over Opus 4.8 is that Fable 5 holds a goal across very long tasks. Anthropic reports it can run for days, and that its lead over prior models grows as task length and complexity increase. The memory system is file-based and persistent, so the model writes state to disk and reads it back rather than relying solely on context. The cleanest demonstration is an odd one: on the game Slay the Spire, persistent memory improved Fable 5's performance three times more than it improved Opus 4.8, and Fable 5 reached the final act three times more often. That is a narrow result, but it points at something real about sustained, stateful problem-solving rather than one-shot answers.
Vision That Does Real Work
Anthropic claims a new state of the art on vision, and the examples are specific rather than abstract. Fable 5 beat Pokemon FireRed start to finish using only raw screenshots, with no map data and no game-specific harness. It rebuilds web-application source code from screenshots of the rendered page, and it extracts precise numbers from scientific figures. For anyone doing UI work, document processing, or data extraction from charts, that last capability is more useful day-to-day than beating a video game from 2004.
Agentic Coding and Knowledge Work
Coding is where Fable 5's headline benchmarks land, and where Anthropic stakes its leadership claim. Beyond raw code generation, Anthropic and independent partners report strong results on legal drafting and markup and on finance reasoning. Hex, an analytics platform, reports Fable 5 as the first model to break 90% on its complex long-running analytical benchmark, roughly a ten-point jump over Opus 4.8. Hebbia reports the highest score of any model on its senior-level finance reasoning benchmark. Both of those are independent partners, which matters, and both are covered in the benchmark section below with that attribution attached.
Benchmarks, Labeled Honestly
Two rules govern this section. First, every benchmark below is labeled Anthropic-reported or independent, because self-reported headline scores are a marketing artifact until a third party rechecks them. Second, these numbers are not comparable across differently-configured harnesses. A score run with a four-hour agent timeout and full tool access is not the same test as a default run, and a vendor's harness is tuned to its own model. Read the direction and the gap, not the decimal.
The Safety System and the Dual-Name Design
This is the part of Fable 5 you have to understand before you adopt it, because it changes how the model behaves on a measurable fraction of requests. Fable 5 ships under ASL-3 protections and is classified CB-1, meaning it is capable with non-novel chemical and biological weapons information but does not cross the CB-2 threshold for novel weapons. That classification is why the safeguards exist.
How the Classifier Works
Anthropic runs conservative classifiers that watch for cybersecurity, biology, chemistry, and model-distillation requests. Here is the important detail: when a classifier triggers, the query is not refused. It is routed to Claude Opus 4.8, the next-most-capable model, and you are told it happened. So the failure mode is not a hard wall. It is a silent-ish downgrade to a previous-generation model. For sensitive-but-legitimate work, you still get an answer, just from a weaker model than the one you are paying for.
Anthropic tuned the classifier conservatively to ship fast, and says it triggers in under 5% of sessions on average, a figure that explicitly includes false positives on benign prompts. Users have reported downgrades on requests as harmless as a pulled-pork shopping list and basic low-level software-engineering questions. That is the cost of conservative tuning, and it is a real cost: on roughly one in twenty sessions you are not getting Fable 5, you are getting Opus 4.8, sometimes for no good reason.
Mythos 5 and Project Glasswing
The same model with safeguards lifted in some areas is Claude Mythos 5, and it is restricted to vetted partners defending critical software infrastructure through Project Glasswing, a collaboration that includes the US government. Access is expanding through a trusted-access program into areas like biomedical research, with roughly 150 organizations being added. Named participants include Dragos, Tenable, Trend Micro, Netskope, BeyondTrust, Rubrik, BT, Intercontinental Exchange, and Hitachi.
The capability that justifies all of this is concrete. On a Firefox exploit-development evaluation, the unsafeguarded Mythos 5 scored 88.4% against Opus 4.8's 8.8%. That is not a typo and not a rounding gap. It is the clearest single illustration of why Anthropic split the release: a model that good at building exploits is dual-use by definition. Framed factually, the safeguards plus the restricted-access program are Anthropic's attempt to keep the offensive edge inside organizations that defend infrastructure rather than handing it to everyone with a credit card. You can reasonably debate whether that boundary holds. You cannot reasonably call the precaution unnecessary given an 88.4% exploit-development score.
What this means for you. If your work never touches security, bio, chem, or distillation topics, you may never see a downgrade. If it does, even tangentially, build for the reality that some fraction of your calls will be answered by Opus 4.8, and test your prompts against both models before you depend on Fable 5's ceiling.
Pricing and Access
Fable 5 is priced at $10 per million input tokens and $50 per million output tokens, which Anthropic notes is less than half the price of the earlier Claude Mythos Preview. It is available now through the Claude API, Claude Code, claude.ai, and all major cloud platforms. For most readers, the more relevant detail is the plan inclusion and its expiry date.
The Free Window Has a Hard Date
Fable 5 is included in paid Claude plans (Pro, Max, and Team) at no extra cost until June 22, 2026. From June 23, 2026, usage on those plans requires credits. That is a two-week runway from launch, so if you want to evaluate Fable 5 against your real workload without consuming credits, the window to do it is now and it is short. Date-stamp this: pricing and the free window were verified June 9, 2026, and Anthropic can change either.
| Model | Input (per MTok) | Output (per MTok) | Notes |
|---|---|---|---|
| Claude Fable 5 | $10 | $50 | Generally available; free on Pro/Max/Team until Jun 22, 2026 |
| Claude Mythos 5 | Restricted | Restricted | Project Glasswing partners only; not generally available |
Where You Can Run It
- Claude API (console.anthropic.com) for direct integration.
- Claude Code for agentic coding workflows, where Fable 5's long-horizon strength shows up most.
- claude.ai for the chat interface on paid plans.
- All major cloud platforms, so existing AWS, Google Cloud, or Microsoft contracts route through familiar billing and IAM.
A pricing caveat worth stating. The $10/$50 rate is for Fable 5 specifically. On the slice of requests that get routed to Opus 4.8 by the safety classifier, you are still paying for a Fable 5 call but receiving Opus 4.8 output. Anthropic has not detailed how downgraded requests are billed, so confirm that with your account team before building cost models around the headline rate.
The Warts a Credible Review Has to Name
A launch this strong invites uncritical praise. Resist it. These are the documented weaknesses, drawn from Anthropic's own system card and from independent partners, and they should carry real weight in your decision rather than sit in a footnote.
How Fable 5 Fits the Claude Lineage
Fable 5 did not appear from nowhere. It is the generally-available descendant of a line of increasingly capable, increasingly safeguarded Anthropic models, and the dual-name design is a direct evolution of the gated-release pattern Anthropic established with the earlier Mythos work.
The Verdict
Fable 5 is the strongest generally-available model Anthropic has shipped, and the agentic-coding, long-horizon, and vision claims hold up well enough that they are not just marketing. The independent signals reinforce the vendor story rather than contradict it: Artificial Analysis puts a 152-point Elo gap on it over Opus 4.8 on real professional tasks, and Cursor, Hex, and Hebbia all report gains on their own benchmarks. That is a healthier evidence base than a launch built purely on self-reported numbers.
But this is a skeptic's review, and the position has to be specific rather than "it depends." Here it is.
Adopt Now If
You do agentic coding in Claude Code, long-horizon automation, vision-heavy document or UI work, or guided legal and finance analysis. For those workloads Fable 5 is a clear step up from Opus 4.8, the pricing is reasonable at $10/$50, and the free window on Pro, Max, and Team through June 22 lets you validate it against your real tasks at no extra cost. Run that test this week, because the window is two weeks wide.
Wait and Watch If
Your work is open-ended biology or chemistry ideation, heavy multi-step quantitative finance modeling, or anything that brushes against the safety classifier's topics often enough that the under-5% downgrade rate becomes your everyday experience. For those cases, the rough edges, the over-refusal tax, and the lack of independent replication on the headline SWE-bench Pro lead are real reasons to let the dust settle. Let the June 23 onward replication cycle tell you whether the cross-vendor gaps survive someone else's harness.
For Agent Builders Specifically
Adopt, but keep humans on irreversible actions. The documented prefill vulnerability and the occasional reckless or destructive agentic behavior mean you should sandbox aggressively and require approval for anything that deletes data or escapes its boundary. The capability is worth it; the autonomy is not yet worth trusting unsupervised.
Bottom line. Fable 5 earns adoption for coding, automation, and vision work today, with eyes open about the Opus 4.8 downgrade and the agentic risks. For science ideation and precise quant finance, wait for independent confirmation. The dual-name design is the most honest part of the launch: Anthropic told you exactly which version you are getting and which you are not.
Video Resources
These are live YouTube searches rather than fixed video links, so they surface current explainers and stay accurate as new coverage lands. Each opens a search for the topic on YouTube.
Go Deeper
Resources from across Tech Jacks Solutions
What Is Agentic AI?
Understand the architecture behind autonomous AI agents
Prompt Engineering Library
Prompting techniques that get better results from any AI
FREEAI Governance Charter
Establish your organization's AI principles in one document
AI Glossary
Definitions for AI terms used in this article