After researcher backlash, Anthropic admits a "wrong tradeoff" in its plan to covertly degrade performance for users training rival AI models.

Anthropic has reversed a planned policy that would have silently degraded the performance of its Claude Fable 5 model for users attempting to train competing artificial intelligence systems, following sharp criticism from the research community. [1]

The company acknowledged the misstep directly, telling WIRED: “We made the wrong tradeoff and we apologize for not getting the balance right.” [1] Going forward, Anthropic says any protective measures of this kind will be made visible to users rather than applied covertly. [1]

The backlash was pointed. Dean Ball, a former White House AI advisor, described the covert approach as “shockingly hostile.” [1] Will Brown, from open-source startup Prime Intellect, framed the broader concern: “It felt like Anthropic was saying to the public, ‘We don’t trust anybody else to do AI research. We are the only ones who have to do AI research.'” [1]

The throttling controversy is not the only friction point surrounding Fable 5. The model requires data retention in order to run its new safety classifiers — prompts and outputs are stored for up to 30 days, and for up to two years if policy violations are flagged. [1] That data-handling requirement has created adoption problems at least one major enterprise customer. [1]

Microsoft is restricting Fable 5 internally as a result, according to The Verge, as cited by The Decoder. [1] While all other Claude models operate under zero-data-retention rules for Microsoft, Fable 5 does not appear in the internal model picker for the company’s GitHub Copilot product. [1]


Sources

  1. The Decoder — Claude Fable 5: Anthropic admits "wrong tradeoff" after invisibly throttling rival AI researchers

This article was drafted with AI from the cited sources and checked against them before publication. Spot an error? Let us know.