Why did Claude Code switch from Fable 5 to Opus by itself?

Fable 5 runs automated safety classifiers (mainly for offensive cybersecurity and biology content). When a classifier flags a request, Claude Code re-runs it on the default Opus model, shows a notice, and the session then continues on Opus. This is documented Anthropic behavior, not a bug or an account flag.

Why does the Fable-to-Opus switch happen on my very first message?

The first request of a Claude Code session carries workspace context — your CLAUDE.md content and git status — and the classifier reviews everything the model reads, not just what you typed. A repository whose docs contain security- or biology-flavored language can trip the classifier before you type anything unusual.

How do I get back to Fable 5 after a fallback?

Run /model fable to switch back — but the flagged content is still in the conversation, so it can re-trigger. In practice a fresh session is often faster than fighting a flagged context. You can also run /config and turn off 'switch models when a message is flagged' so a flag pauses the session and lets you edit the prompt and retry instead of auto-switching.

Does the fallback mean my account is flagged?

No. Anthropic's documentation states this is expected per-request routing for those content domains, not an account-level flag.

How do I find out what triggered the fallback?

Start a session with claude --safe-mode, which disables customizations such as CLAUDE.md, skills, MCP servers, and hooks. If the fallback stops, the trigger is in your customizations — usually standing language in auto-loaded docs. Git status and directory names are still included even in safe mode.

Why does Claude Code start on the wrong model even though settings.json says Fable?

Check the exact value of the model field in ~/.claude/settings.json. If it carries stray characters (for example terminal escape text) or a suffix Claude Code does not recognize on that id, the session can silently start on your tier's default model instead. Hand-write the plain id, e.g. "model": "claude-fable-5".

tagmac.dev what we build

Guide · published 12 June 2026 · behavior described as of June 2026 — Anthropic tunes these classifiers, so details may change

Why Claude Code switches from Fable 5 to Opus — and the workspace hygiene that stops false positives

Short answer: Claude Fable 5 runs safety classifiers (mainly offensive-cybersecurity and biology). When one flags a request, Claude Code re-runs it on Opus and the session stays on Opus. The classifier reads everything the model reads — including your auto-loaded CLAUDE.md — so a session can fall back on its first message, before you type anything. For legitimate work the most durable fix is workspace hygiene: keep auto-loaded docs neutral and architecture-only, and move mechanism-heavy domain text into files that are not auto-loaded.

What this page is — and is not. This is a guide to removing false positives for legitimate work. We hit this ourselves: our own project docs had accumulated combat metaphors and security-flavored wording as a plain writing reflex ("attack the problem", "kill the stale process"), a classifier read that standing language as signal, and the honest fix was cleaning our own language. That practice is what we call hygiene. If your work genuinely is offensive security or biology, the fallback is expected, documented routing — this page is not about that, and rewording is not a way around it.

The symptom

You select Fable 5 (/model fable), and mid-session — or on the first message — a notice appears and responses start coming from Opus 4.8.
The switch is sticky: the model picker stays on Opus for the rest of the conversation.
Switching back with /model fable often re-triggers, because the flagged content is still in the conversation context.

This matches public reports: GitHub issues #66670, #66916 and #67246 describe benign sessions (startup code review, grant applications, normal engineering discussion) being switched.

Why it happens

Per Anthropic's help article and the Claude Code docs:

Fable 5 ships with automated safety checks for a few categories — offensive cybersecurity techniques and biology/life-science queries are the two that matter for most developers.
The checks review "everything the model reads, not just your latest message — including memory, content from connectors, web search results, and files."
The docs say it directly: "Fallback can trigger on the first request of a session … because the first request carries workspace context such as your CLAUDE.md content and git status. A repository that contains security or biology material can trip the classifier on that context alone."
The checks are "intentionally broad" — Anthropic's own wording — which is why false positives on legitimate work happen.

The part most teams miss: the trigger is often not what you do but how your standing docs talk. Engineering writing drifts toward combat metaphor — attack the problem, kill the process, hit the target, defend the perimeter — and toward mechanism-level shorthand borrowed from security or biology ("immune layer", "honeypot", "payload"). Each instance is innocent; accumulated across a CLAUDE.md that is sent with every session's first message, it reads like signal.

Recovering when it happens

Fastest: start a fresh session and re-select Fable. A flagged conversation keeps its flagged context; fighting it usually costs more than starting clean.
Diagnose: run claude --safe-mode — it disables customizations (CLAUDE.md, skills, MCP servers, hooks). If fallback stops, your trigger is in those files. Note git status and directory names are still included.
Take control of the switch: run /config and turn off "switch models when a message is flagged". A flag then pauses the session and offers: switch to Opus, or edit the prompt and retry on Fable — often all you need.
/model fable switches back any time, with the re-trigger caveat above.

The settings.json gotcha: sessions silently starting on the wrong model

A related but different failure: Claude Code keeps starting sessions on your tier's default model (Opus or Sonnet) even though you saved Fable as default. Since v2.1.153, /model writes your choice into the model field of ~/.claude/settings.json — so check what actually landed there:

python -c "import json,pathlib;print(repr(json.load(open(pathlib.Path.home()/'.claude'/'settings.json'))['model']))"

If the value is anything other than a clean id or alias — stray terminal-escape characters, or a suffix on an id that doesn't support it (the [1m] 1M-context suffix is documented for opus/sonnet) — Claude Code may not recognize it and quietly falls through to the default. We hit a corrupted value of exactly this shape in June 2026. The fix is one line, by hand:

{ "model": "claude-fable-5" }

Then verify at the next session start (the active model shows in /status).

Prevention: workspace hygiene

Four practices that removed our false positives, in order of leverage:

1. Keep auto-loaded docs architecture-only

CLAUDE.md (and anything else loaded every session) should carry file layout, names, commands, status — not domain mechanism. If your project legitimately touches a sensitive-sounding domain (health data, defensive security, lab-adjacent tooling), move the mechanism narrative into a separate doc that is not auto-loaded (for example DOMAIN.md) and reference it by name. Claude reads it when the task needs it; the classifier doesn't see it on every first message.

2. Result-language over mechanism-language

Say what the system produces, not how the sensitive-sounding part works. "Scores a calm reading from the breath signal" instead of physiological mechanism detail; "form automation on systems we're authorized to use" instead of access-mechanism detail. The honest content is unchanged — the framing names outcomes.

3. Neutral verbs over combat metaphor

Approach the problem, stop the process, reach the audience. This reads better to humans too — the metaphors were never load-bearing.

4. Run a checker before it bites

We generalized the script we used on our own workspace into a small, dependency-free checker — it scans your CLAUDE.md files for flag-prone standing language (protecting code spans, filenames and identifiers), sanity-checks your settings.json model id, and prints suggestions. It never modifies your files.

curl -O https://tagmac.dev/tools/claudemd-hygiene-check.py
python claudemd-hygiene-check.py            # scans ./CLAUDE.md, ~/.claude/CLAUDE.md, */CLAUDE.md

Download claudemd-hygiene-check.py — MIT-spirit, single file, Python 3 stdlib only. Exit code 1 on findings, so it slots into CI or a pre-commit hook.

FAQ

Why did Claude Code switch from Fable 5 to Opus by itself?: Fable 5's safety classifiers flagged something in the request context; Claude Code re-ran the request on Opus and the session continues there. Documented behavior, not a bug.
Why on the very first message?: The first request carries your workspace context (CLAUDE.md, git status). The classifier reads all of it — the repo itself can be the trigger.
Does it mean my account is flagged?: No — Anthropic's docs state this is per-request routing, "not an account flag."
How do I find the trigger?: claude --safe-mode (disables CLAUDE.md/skills/MCP/hooks), then reintroduce pieces. Our checker script shortcuts the usual culprit: standing language.
Can I make it ask instead of auto-switching?: Yes — /config → turn off "switch models when a message is flagged". Flags then pause with an edit-and-retry option.
My settings.json says Fable but sessions start on Opus/Sonnet — why?: Inspect the model value for stray characters or unsupported suffixes; hand-write the plain id.

Sources

Anthropic Help Center — Why Claude switched models in your conversation with Fable 5
Claude Code docs — Model configuration (automatic model fallback, --safe-mode, /config toggle, [1m] suffix, /model persistence)
Public reports: claude-code#66670 · claude-code#66916 · claude-code#67246