AI Safety vs. Cybersecurity: Is America Making Its Own Defenders Weaker?

++ EU Parliament approves simplification measures and “nudifier” app ban; Israel approves a national AI plan..& more

Jun 21, 2026

This week’s highlights:

On June 12, the US government ordered Anthropic to suspend foreign-national access to its powerful Fable 5 and Mythos 5 AI models over national-security concerns. Because Anthropic could not filter users by nationality in real-time, the company had to abruptly disable the models globally for everyone

Dozens of cybersecurity experts are now urging the US to lift restrictions on these models, arguing that powerful AI cyber capabilities should help defenders find and fix vulnerabilities before criminals or rival states exploit them. Their argument is not that the models carry no risk: they accept that AI can make hacking easier. But they say Anthropic had built safeguards into Fable, similar cyber capabilities already exist in other leading and open models, and abruptly removing these tools may hurt security teams more than attackers. They are therefore calling for clear, science-based, and transparent rules- not an opaque shutdown based on a claimed jailbreak risk that the government has not publicly detailed.

Key aspects highlighted:

AI is having significant impacts on cybersecurity, including by greatly reducing the difficulty of finding flaws in software and writing exploits for those flaws.
Anthropic’s Mythos-class models are quite good at finding flaws and weaponizing exploits.
However, they are not uniquely good at these tasks, and many of the undersigned individuals regularly use other foundation and open-source models for security audits and red-teaming every day.
Anthropic has built multiple protections into the Fable model to prevent its use for cyber offensive uses. These protections were so aggressive as to be the source of humor in the cyber community on launch day.
It is essential to provide AI to coders and security teams so they can find and fix flaws in their own newly-written as well as decades of legacy code faster than our adversaries.
The Chinese open-weight models are only months behind the best American models, and those are the models we know about. It seems likely that the PRC government has access to private capabilities beyond what has been published.
To pull the best capabilities away from defenders without a good reason when our adversaries are rapidly advancing is dangerous.

The Responsible AI Digest by School of Responsible AI- SoRAI

AI Safety vs. Cybersecurity: Is America Making Its Own Defenders Weaker?

++ EU Parliament approves simplification measures and “nudifier” app ban; Israel approves a national AI plan..& more

This week’s highlights:

⚖️ AI Ethics

Israel Approves National AI Plan to Boost Technological Self-Reliance and Global Competitiveness

European Parliament Approves AI Act Simplification, Delays Key Deadlines, and Bans AI Nudifier Apps

Europe 2031 Initiative Warns AI Inaction Could Leave Europe Economically and Politically Dependent

LifeSciBench Sets New Standard for Evaluating AI on Real-World Life Science Research Tasks

Deployment Simulation Helps Predict Model Behavior and Risks Before Release Through Realistic Traffic Replays

DOJ Backs xAI Turbines, Citing National Security in Memphis Data Center Pollution Lawsuit

Survey Finds 60% of US Consumers Are Turned Off by AI in Brand Messaging

Pew Study Finds Only 16% of Americans Expect AI to Benefit Society Long Term

FERC Orders Grid Operators to Fast-Track AI Data Center Connections Amid Power Capacity Strains

Match Survey Finds Nearly Half of US Singles Hold Negative Views on AI Dating

At G7 Summit, PM Modi Warns AI Misuse Could Fuel Deepfakes, Misinformation, Child Exploitation

🚀 AI Breakthroughs

Anthropic Becomes First AI Startup to Join Frontier’s $915 Million Carbon Removal Funding Round

Microsoft Makes Copilot Cowork Generally Available Worldwide With Usage-Based Pricing and New Cost Controls

Google Releases Android 17 With Multitasking Upgrades and Expanded Gemini Features Across Pixel Devices

Meta Rolls Out Facebook AI Mode Using Public Posts to Power Search and Engagement

NASA Trains AI on Billions of Earth Observations to Speed Climate Research and Analysis

OpenAI Launches Partner Network With $150 Million Investment to Accelerate Enterprise AI Adoption

ChatGPT Health Upgrades Improve Medical Guidance, Urgent Care Detection, and Response Accuracy for Millions

ChatGPT Enterprise Adds Credit Usage Analytics and Updated Spend Controls for Enterprise Administrators

🎓AI Academia

Study Details Barriers Marginalized Grassroots Groups Face in Shaping AI Policy and Governance

Fujitsu Research Study Defines AI Sandbox Threat Model, Taxonomy, and Measurement Framework for Assurance

Study Proposes Framework to Detect and Measure AI Risks to Democratic Institutions

Study Proposes Commons-Governed AI Taxonomy for Collective Oversight of Data, Compute, Models, and Energy

Study Examines Open Source AI Contributor Policies Amid Rising Governance and Compliance Gaps

Study Warns Failed AI Systems Leave Lasting Risks Beyond Decommissioning and Model Withdrawal

Discussion about this post

Ready for more?