Anthropic rejects Pentagon’s final offer to remove AI safeguards

Anthropic stands firm on two red lines for his company's AI technology: it must not be used for mass domestic surveillance or fully autonomous weapons systems...

Feb 27, 2026

++ OpenAI tightens safety checks and sets direct police contact after Canada scrutiny; New York weighs a three-year AI data center permitting moratorium; Pew: 12% of US teens use AI chatbots for emotional support; US tells diplomats to oppose foreign data sovereignty laws over AI growth; “Humanity’s Last Exam” benchmark shows current AI systems still fail; Anthropic valuation hits $380B; markets slide after viral 2028 job-loss loop report; EU delays high-risk AI guidance again; OpenAI details malicious AI use with traditional tools; “vibe researching” debate grows around AI research agents

Today’s highlights:

Anthropic CEO Dario Amodei said Thursday he would not grant the Pentagon unrestricted access to the company’s AI systems, arguing some military uses could undermine democratic values or exceed what current technology can do safely. He said Anthropic is seeking two guardrails: no mass surveillance of Americans and no fully autonomous weapons without a human in the loop, while the Defense Department maintains it should be free to use the model for any lawful purpose. Amodei’s comments came less than a day before a stated Friday 5:01 p.m. deadline, after the Pentagon reportedly warned it could label the firm a supply-chain risk or push action under the Defense Production Act. Amodei called the threats contradictory and said the company would prefer to keep working with the military under safeguards but would support a smooth transition if the Defense Department ends the relationship.

The Responsible AI Digest by School of Responsible AI- SoRAI

Anthropic rejects Pentagon’s final offer to remove AI safeguards

Anthropic stands firm on two red lines for his company's AI technology: it must not be used for mass domestic surveillance or fully autonomous weapons systems...

Today’s highlights:

⚖️ AI Ethics

OpenAI to Tighten Safety Checks, Set Direct Police Contact After Canada Scrutiny

AI Data Center Backlash Grows as New York Weighs Three-Year Statewide Permitting Moratorium

Pew Survey Finds 12% of US Teens Use AI Chatbots for Emotional Support

US Orders Diplomats to Oppose Foreign Data Sovereignty Laws, Citing Risks to AI Growth

Researchers Detail ‘Humanity’s Last Exam’ Benchmark That Current AI Systems Consistently Fail

Anthropic valuation hits $380 billion, surpassing combined market cap of India’s listed IT firms

Markets Slide After Viral AI Report Warns of 2028 Job Loss Loop and Recession

European Commission Delays High-Risk AI Guidance Again as EU AI Act Timelines Slip

OpenAI Report Details How Malicious Actors Combine AI Models With Traditional Tools

🚀 AI Breakthroughs

Google Launches Nano Banana 2 Image Model as Faster Default Across Gemini Apps

Google Adds Gemini 3 Flash Agent to Opal for Automated Workflow Mini-Apps

Google Translate Adds Gemini-Powered Context, Idiom Alternatives, and “Ask” Follow-Ups in Updates

Google Brings Gemini Task Automation, Enhanced Circle to Search, Scam Detection to Galaxy S26

Bumble Adds AI Photo Feedback and Profile Guidance Tools to Improve Dating Matches Globally

Atlassian Adds AI Agents in Jira Dashboard to Manage Work Alongside Humans

Adobe Firefly Adds Quick Cut AI Tool to Auto-Edit Footage Into First-Draft Videos

Amazon Adds Brief, Chill, and Sweet Personality Styles to AI-Powered Alexa+ Assistant

Anthropic expands enterprise agents with finance, engineering, and design plug-ins, plus new connectors

Oura launches proprietary AI model to power women’s health insights in Oura Advisor

Perplexity Launches Computer System to Orchestrate Multi-Model AI Workflows Across Tools

🎓AI Academia

AI Agents With Research Skills Spur ‘Vibe Researching’ Debate on Social Science Roles

AGI Economics Study Says Human Verification Bandwidth, Not Intelligence, Will Constrain Growth

Study Finds Usefulness, Trust, Enjoyment, and Social Norms Drive Students’ AI Chatbot Adoption

OpenPort Protocol Sets Governance Rules for AI Agent Tool Access With Auditing and Risk-Gated Writes

Preprint Red-Teams Autonomous AI Agents, Finding Tool-Use Failures and System Takeover Risks

Position Paper Urges Machine Learning Community to Practise Data Frugality for Responsible AI Development

Study Finds Community Norms Outweigh Platform Policies in Open AI Model Marketplaces

Discussion about this post

Ready for more?