Anthropic ditches its core safety promise

Hacker News

Clustered Story

Published 1 day ago

Anthropic ditches its core safety promise

Hacker News · Feb 26, 2026 · Collected from RSS

Summary

Article URL: https://www.cnn.com/2026/02/25/tech/anthropic-safety-policy-change Comments URL: https://news.ycombinator.com/item?id=47165397 Points: 59 # Comments: 31

Full Article

Anthropic, a company founded by OpenAI exiles worried about the dangers of AI, is loosening its core safety principle in response to competition. Instead of self-imposed guardrails constraining its development of AI models, Anthropic is adopting a nonbinding safety framework that it says can and will change. In a blog post Tuesday outlining its new policy, Anthropic said shortcomings in its two-year-old Responsible Scaling Policy could hinder its ability to compete in a rapidly growing AI market. The announcement is surprising, because Anthropic has described itself as the AI company with a “soul.” It also comes the same week that Anthropic is fighting a significant battle with the Pentagon over AI red lines. The policy change is separate and unrelated to Anthropic’s discussions with the Pentagon, according to a source familiar with the matter. Defense Secretary Pete Hegseth gave Anthropic CEO Dario Amodei an ultimatum on Tuesday to roll back the company’s AI safeguards or risk losing a $200 million Pentagon contract. The Pentagon threatened to put Anthropic on what is effectively a government blacklist. But the company said in its blog post that its previous safety policy was designed to build industry consensus around mitigating AI risks – guardrails that the industry blew through. Anthropic also noted its safety policy was out of step with Washington’s current anti-regulatory political climate. Anthropic’s previous policy stipulated that it should pause training more powerful models if their capabilities outstripped the company’s ability to control them and ensure their safety — a measure that’s been removed in the new policy. Anthropic argued that responsible AI developers pausing growth while less careful actors plowed ahead could “result in a world that is less safe.” As part of the new policy, Anthropic said it will separate its own safety plans from its recommendations for the AI industry. Anthropic wrote that it had hoped its original safety principles “would encourage other AI companies to introduce similar policies. This is the idea of a ‘race to the top’ (the converse of a ‘race to the bottom’), in which different industry players are incentivized to improve, rather than weaken, their models’ safeguards and their overall safety posture.” The company now suggests that hasn’t played out. In a statement to CNN, an Anthropic spokesperson described the updated policy as “the strongest to date on the level of public accountability and transparency.” “We’ve gone a significant step further from our prior policies by committing to publicly publish detailed reports at regular intervals on our plans to strengthen our risk mitigations, as well as the threat models and capabilities of all our models,” the statement said. “From the beginning, we’ve said the pace of AI and uncertainties in the field would require us to rapidly iterate and improve the policy.” Anthropic’s new safety policy includes a “Frontier Safety Roadmap” that outlines the company’s self-imposed guidelines and safeguards. But the company acknowledged the new framework is more flexible than its past policy. “Rather than being hard commitments, these are public goals that we will openly grade our progress towards,” the company said in its blog post. The change comes a day after Defense Secretary Pete Hegseth gave Anthropic CEO Dario Amodei a Friday deadline to roll back the company’s AI safeguards, or risk losing a $200 million Pentagon contract and being put on what is effectively a government blacklist. Anthropic has concerns over two issues that it isn’t willing to drop, according to a source familiar with the company’s meeting with Hegseth: AI-controlled weapons and mass domestic surveillance of American citizens. Anthropic believes AI is not reliable enough to operate weapons, and there are no laws or regulations yet that cover how AI could be used in mass surveillance, a source said. AI researchers applauded Anthropic’s stance on social media on Tuesday and expressed concerns about the idea of AI being used for government surveillance. The company has long positioned itself as the AI business that prioritizes safety. Anthropic has published research showing how its own AI models could be capable of blackmail under certain conditions. The company recently donated $20 million to Public First Action, a political group pushing for AI safeguards and education. But the company has faced increasing pressure and competition from both the government and its rivals. Hegseth, for example, plans to invoke the Defense Production Act on Anthropic and designate the company a supply chain risk if it does not comply with the Pentagon’s demands, CNN reported on Tuesday. OpenAI and Anthropic have also been locked in a race to launch new enterprise AI tools in a bid to win the workplace. Jared Kaplan, Anthropic’s chief science officer, suggested in an interview with Time that the change was made in the name of safety more than increased competition. “We felt that it wouldn’t actually help anyone for us to stop training AI models,” Kaplan told the magazine. “We didn’t really feel, with the rapid advance of AI, that it made sense for us to make unilateral commitments … if competitors are blazing ahead.” CNN’s Hadas Gold contributed to this story. This story has been updated with additional information.

Share this story

Read Original at Hacker News

Bloomberg2 days ago

Anthropic Drops Hallmark Safety Pledge in Race With AI Peers

Anthropic, which for years billed itself as a safer alternative to artificial intelligence rivals, has loosened its commitment to maintaining its guardrails, one of the most dramatic policy shifts in the AI industry yet as startups once focused on helping humanity turn their attention to profit and success. The company in 2023 said in its Responsible Scaling Policy that it would delay AI development that might be dangerous. In a Tuesday blog post, Anthropic said it was updating its rules to say it would no longer do so if it believes it lacks a significant lead over a competitor. Bloomberg News Tech & National Security Reporter Katrina Manson joins Bloomberg Businessweek Daily to discuss. She speaks with Carol Massar and Tim Stenovec. (Source: Bloomberg)

Engadget2 days ago

Anthropic weakens its safety pledge in the wake of the Pentagon's pressure campaign

Two stories about the Claude maker Anthropic broke on Tuesday that, when combined, arguably paint a chilling picture. First, US Defense Secretary Pete Hegseth is reportedly pressuring Anthropic to yield its AI safeguards and give the military unrestrained access to its Claude AI chatbot. The company then chose the same day that the Hegseth news broke to drop its centerpiece safety pledge. On Tuesday, Anthropic said it was modifying its Responsible Scaling Policy (RSP) to lower safety guardrails. Up until now, the company's core pledge has been to stop training new AI models unless specific safety guidelines can be guaranteed in advance. This policy, which set hard tripwires to halt development, was a big part of Anthropic's pitch to businesses and consumers. “Two and a half years later, our honest assessment is that some parts of this theory of change have played out as we hoped, but others have not,” Anthropic wrote. Now, its updated policy approaches safety relatively, rather than with strict red lines. Anthropic's quotes in an interview with Time sound reasonable enough in a vacuum. "We felt that it wouldn't actually help anyone for us to stop training AI models," Jared Kaplan, Anthropic's chief science officer, told Time. "We didn't really feel, with the rapid advance of AI, that it made sense for us to make unilateral commitments… if competitors are blazing ahead." Anthropic CEO Dario Amodei (Photo by David Dee Delgado/Getty Images for The New York Times) David Dee Delgado via Getty Images But you could also read those quotes as the latest example of a hot startup’s ethics becoming grayer as its valuation rises. (Remember Google’s old “Don’t be evil” mantra that it later removed from its code of conduct?) The latest versions of Claude have drawn widespread praise, especially in coding. In February, Anthropic raised $30 billion in new investments. It now has a valuation of $380 billion. (Speaking of the competition Kaplan referred to, rival OpenAI is currently

Hacker Newsabout 1 hour ago

Statement on the comments from Secretary of War Pete Hegseth

Article URL: https://www.anthropic.com/news/statement-comments-secretary-war Comments URL: https://news.ycombinator.com/item?id=47188697 Points: 88 # Comments: 11

Hacker Newsabout 1 hour ago

We Will Not Be Divided

Article URL: https://notdivided.org Comments URL: https://news.ycombinator.com/item?id=47188473 Points: 284 # Comments: 68

Hacker Newsabout 3 hours ago

Qt45: A small polymerase ribozyme that can synthesize itself

Article URL: https://www.science.org/doi/10.1126/science.adt2760 Comments URL: https://news.ycombinator.com/item?id=47187649 Points: 37 # Comments: 4

Hacker Newsabout 3 hours ago

Show HN: I built a site where you hire yourself instead of applying for jobs

Article URL: https://hired.wtf Comments URL: https://news.ycombinator.com/item?id=47187450 Points: 3 # Comments: 1

All Articles

Hacker News

Clustered Story

Published 1 day ago

Anthropic ditches its core safety promise

Hacker News · Feb 26, 2026 · Collected from RSS

Summary

Article URL: https://www.cnn.com/2026/02/25/tech/anthropic-safety-policy-change Comments URL: https://news.ycombinator.com/item?id=47165397 Points: 59 # Comments: 31

Full Article

Share this story

Read Original at Hacker News

Bloomberg2 days ago

Anthropic Drops Hallmark Safety Pledge in Race With AI Peers

Engadget2 days ago

Anthropic weakens its safety pledge in the wake of the Pentagon's pressure campaign

Hacker Newsabout 1 hour ago

Statement on the comments from Secretary of War Pete Hegseth

Article URL: https://www.anthropic.com/news/statement-comments-secretary-war Comments URL: https://news.ycombinator.com/item?id=47188697 Points: 88 # Comments: 11

Hacker Newsabout 1 hour ago

We Will Not Be Divided

Article URL: https://notdivided.org Comments URL: https://news.ycombinator.com/item?id=47188473 Points: 284 # Comments: 68

Hacker Newsabout 3 hours ago

Qt45: A small polymerase ribozyme that can synthesize itself

Article URL: https://www.science.org/doi/10.1126/science.adt2760 Comments URL: https://news.ycombinator.com/item?id=47187649 Points: 37 # Comments: 4

Hacker Newsabout 3 hours ago

Show HN: I built a site where you hire yourself instead of applying for jobs

Article URL: https://hired.wtf Comments URL: https://news.ycombinator.com/item?id=47187450 Points: 3 # Comments: 1

Anthropic ditches its core safety promise

Full Article

Related Articles

Anthropic Drops Hallmark Safety Pledge in Race With AI Peers

Anthropic weakens its safety pledge in the wake of the Pentagon's pressure campaign

Statement on the comments from Secretary of War Pete Hegseth

We Will Not Be Divided

Qt45: A small polymerase ribozyme that can synthesize itself

Show HN: I built a site where you hire yourself instead of applying for jobs

Anthropic ditches its core safety promise

Full Article

Related Articles

Anthropic Drops Hallmark Safety Pledge in Race With AI Peers

Anthropic weakens its safety pledge in the wake of the Pentagon's pressure campaign

Statement on the comments from Secretary of War Pete Hegseth

We Will Not Be Divided

Qt45: A small polymerase ribozyme that can synthesize itself

Show HN: I built a site where you hire yourself instead of applying for jobs