
US Issues Verbal Export Directive Blocking Anthropic Cybersecurity Models
US government used national security authorities to force Anthropic to disable two cybersecurity AI models over an alleged jailbreak, applying export-style controls to software for the first time. The verbal directive conflicts with Anthropic's technical review and arrives amid escalating administration tensions. This sets a precedent for non-transparent intervention in frontier model deployment.
The directive, issued verbally and citing an unverified jailbreak method, forced immediate withdrawal of both models despite Anthropic's assessment that the vulnerabilities were minor, previously known, and reproducible in GPT-5.5. No written order or CVE has been released, marking the first known use of export authorities against an AI model rather than hardware. Anthropic disputes the scope while complying, noting the standard would halt frontier deployments industry-wide.
The action arrived two days after CEO Dario Amodei called for statutory powers to block unsafe AI releases and weeks after Defense Secretary Pete Hegseth designated Anthropic a supply chain risk following failed military contract talks. This sequence reveals operational friction between voluntary safety advocacy and actual enforcement mechanisms, where verbal directives bypass transparency requirements applied to chip export controls.
Procurement records and prior BIS rules on advanced semiconductors show the government already tracks dual-use AI capabilities through compute thresholds; extending that logic to model weights without public criteria creates precedent for selective recalls. Independent technical confirmation of the claimed jailbreak remains absent, leaving the evidence trail limited to government assertion.
Restoration depends on whether a formal statutory process replaces the current verbal channel. Industry filings ahead of Anthropic's anticipated IPO will likely disclose ongoing compliance costs and model access restrictions.
BIS: Issues written AI model export rule covering weights above 10^26 FLOPs within 90 days.
Sources (3)
- [1]Primary Source(https://therecord.media/anthropic-says-gov-forced-it-to-disable-cyber-ai-models)
- [2]Supporting Source(https://www.ft.com/content/anthropic-ipo-filing-2025)
- [3]Supporting Source(https://www.anthropic.com/news/model-deployment-policy-essay)