Back to Podcast Digest
Wes Roth18m

Fable JUST made EVERYONE MAD...

TL;DR

  • Anthropic is accused of "silent sabotage": Fable 5 may secretly weaken responses for frontier model development tasks through prompt modification, steering vectors, or parameter-efficient fine-tuning, instead of clearly refusing or warning the user.

  • The backlash goes beyond one angry tweet: Nathan Lambert, Dean Ball, SemiAnalysis, Jeremy Howard, and others all criticized Anthropic, with complaints ranging from anti-science behavior to setting a dangerous norm for hidden model steering.

  • The restriction appears broader than just rival labs: Wes cites SemiAnalysis and user reports saying Claude is already interfering with GPU inference research and machine learning engineering work, not only obvious attempts to build a competing frontier model.

  • This raises a control question, not just a safety question: The core issue is who gets to decide how paying users can use AI systems, especially when interventions are invisible and the user may never know a response was intentionally degraded.

  • Anthropic's access model looks increasingly two-tiered: Wes contrasts Mythos access for institutions like Apple, Google, Microsoft, AWS, Nvidia, governments, and major banks with public access to Fable, which comes with hidden and visible restrictions.

  • Even critics admit the model is extremely capable: Wes tested Fable High by asking it to build a Factorio-playing harness, and it successfully researched prior work, set up a headless server approach, and built the system, which he says proves these models are crossing major capability thresholds.

The Breakdown

Anthropic's new Fable 5 rollout lit up the AI world because the model can quietly degrade answers on frontier AI and ML work without telling users, while more powerful access appears reserved for banks, governments, and big tech. Wes Roth argues the real issue is not just safety restrictions, but a precedent where AI labs decide who gets top-tier intelligence and when paying users are silently steered away from it.

Was This Useful?

Share