Three Days With Claude Fable 5 — Where the 1M Context and Always-On Adaptive Thinking Fit a Solo Developer's Week

On the morning of June 9, reading the Claude Fable 5 announcement, my first thought was not "how good is it?" but "which task do I test it on?"

The headline features are all big ones: the first generally available model in the Mythos class above Opus, a one-million-token context window, up to 128k tokens of output, and adaptive thinking that is always on. But as an indie developer, I do not have the luxury of testing every corner of every new model. And this release came with a clock attached. Fable 5 is bundled into the Pro, Max, Team, and Enterprise plans at no extra cost only through June 22; from June 23 it requires usage credits. The free evaluation window is open for exactly two weeks.

The same week also brings the June 15 billing change, which moves the Agent SDK and headless runs onto API-rate monthly credits, so my time for experiments was limited. Before touching anything, I narrowed the evaluation down to three questions that would be enough to decide between "adopt" and "pass."

Why I wrote down three questions before touching it

With new models, I have learned that the moment before you start matters most. More than once I have begun with "let me just play with it," lost an entire day, and ended up with nothing more concrete than "seems nice." To avoid repeating that, I now write down the questions first and deliberately ignore everything else.

This time the three questions were:

Does the 1M context eliminate file selection as a preprocessing step? Under the old 200K assumption, I had to choose which files to show before asking for a review — and that choice carries my own blind spots
Does always-on adaptive thinking reduce the "it didn't think where I needed it to" failure? Until now I switched models and thinking options myself, and I wanted to know if I could hand that judgment over
Does the Opus 4.8 fallback in high-risk domains get in the way of everyday work? If the safety mechanism interrupts normal development, that changes where the model fits

I deliberately skipped the benchmark numbers. The announcement materials can tell you which scores went up; they cannot tell you which step of your own workflow the model should replace. Only your own work can.

The 1M context — "can pass" and "should pass" turned out to be different things

My first test was a cross-module review of the Android wallpaper app I run. I had carried a vague unease about the image-caching layer for a while, so I handed over the app source together with the shared library and asked for weaknesses in the cache design, viewed across module boundaries.

One finding made me sit up: the two modules generate cache keys by slightly different rules. One uses the URL as-is; the other strips query parameters first. Each looks perfectly natural in isolation, and under the pick-your-files workflow I would almost certainly have shown only one of them. Cross-cutting inconsistencies only surface when you pass the code in cross-cutting form.

But three days in, the real lesson was that "can pass" and "should pass" are different things. Hand over a huge context with a vague question and you get an exhaustive, scattershot list of improvements — which I then cannot digest. The bigger the context, the more the responsibility shifts to you to sharpen the question.

Where I landed: keep passing code module by module for everyday work, and reserve whole-repository handoffs for questions that are explicitly about cross-module dependencies and inconsistencies. The "context is a budget" idea I wrote about in how I allocate Claude's 200K context window does not go away when the budget grows to 1M.

Always-on adaptive thinking — the quiet value of not deciding myself

The second question was about adaptive thinking. Fable 5 thinks on every request, and the model itself decides how deeply to think based on the task.

Honestly, I was wary at first. If it deliberated over trivial questions, the rhythm of daily use would fall apart. After three days, that worry has not materialized. Light requests, like wording a commit message, come back light. Concurrency questions, like duplicate WorkManager executions, visibly get more deliberate treatment before the answer arrives.

One exchange stuck with me. Asking about duplicate runs of a periodic job, I received the answer I expected — plus a boundary condition I had not put in the question: the same problem re-enters through a different path when the process restarts and jobs are re-enqueued. Looking back, most of my "it didn't think where it should have" incidents were really my own fault for forgetting to request deeper reasoning. Letting the model own that judgment is a small change that pays off daily.

The flip side: the output style is thorough, which can feel verbose when all you want is a verdict. I find myself adding constraints like "conclusion first, three lines" more often than I did with Opus 4.8.

The Opus 4.8 fallback — I saw it exactly once in three days

The third question concerned the safety mechanism. In high-risk domains such as cybersecurity and the biological or chemical sciences, Fable 5 blocks its own response and falls back automatically to Claude Opus 4.8. Anthropic says this triggers in under 5% of sessions.

In ordinary app development — code review, refactoring discussions, build-error hunts — the fallback never fired. The single time I saw it was when I pressed for fine-grained details about a known vulnerability in a dependency; that one response was handed over to Opus 4.8.

That single experience left me thinking the design is well made. The conversation does not stop at "I can't answer that." An answer within answerable bounds comes back from a different model, so the flow of investigation is never severed. Put a strong safety mechanism on a highly capable model, and instead of refusing, hand off. When I design feature restrictions in my own apps, I tend to settle for a "not available" dialog, so this "don't stop, hand off" pattern is one I want to add to my own toolbox. I covered the background of the Mythos class in Claude Mythos — Anthropic's new frontier model, but it took using the generally available version in real work to feel how this safety design actually behaves.

The routing I settled on for June 23 and beyond

Based on these three days, I decided my post-trial position in advance. The API price is $10 input / $50 output per million tokens — exactly double Opus 4.8 standard at $5/$25. On usage credits, that is too heavy for me to run as a daily driver at my scale.

So here is my conclusion: everyday code consultations and reviews stay on Opus 4.8, whole-repository design reviews and the few genuinely hard problems each month go to Fable 5, and lightweight batch work stays on Haiku. Fable 5 is not my default model; it is the specialist I call in.

To keep that decision honest, I wrote a small helper that routes API calls to a model per task type. The point of the code is to stop model names from being hard-coded all over the codebase, so that when prices or availability change, there is exactly one place to revisit.

import anthropic
 
client = anthropic.Anthropic(api_key="YOUR_API_KEY")
 
# Keep the model choice per task type in one place.
# When prices or availability change, this dict is the only thing to revisit.
MODEL_BY_TASK = {
    "repo_review": "claude-fable-5",             # cross-repository reviews (a few times a month)
    "daily_coding": "claude-opus-4-8",           # everyday code consultations and reviews
    "light_batch": "claude-haiku-4-5-20251001",  # lightweight batch processing
}
 
def ask(task_type: str, prompt: str) -> str:
    """Route the request to the model assigned to this task type."""
    model = MODEL_BY_TASK.get(task_type, "claude-opus-4-8")  # unknown tasks fall back to the standard model
    response = client.messages.create(
        model=model,
        max_tokens=4096,
        messages=[{"role": "user", "content": prompt}],
    )
    return response.content[0].text
 
# Example: only repository-wide reviews go to Fable 5
# print(ask("repo_review", "Find inconsistencies in cache-key generation across these modules"))

It is nothing more than a dictionary in front of the client, but in a month where the June 15 billing change and the June 23 end of the free window land back to back, having one place to revisit earns its keep. For the availability side of running multiple models, the patterns from designing multi-model fallback across Sonnet, Haiku, and Opus carried over unchanged, so I will leave that part to the earlier piece.

One thing worth trying before the window closes

These are only three days of impressions, and my assessment may well shift by next month. Still, one thing I can say with confidence: if you have a repository you long ago gave up showing to an AI because it was simply too large, hand it to Fable 5 once, as-is, before June 22 and ask for a cross-cutting review. What comes back from an unfiltered, whole-repository pass is worth finding out while the free window is open.

I landed on "the specialist I call in," but with a different codebase and different workflow, your answer may differ. If you are also short on time for evaluations, I hope these notes save you some of yours.