In short
Anthropic has reportedly embedded roughly half a dozen engineers on the NSA to deploy its Mythos AI mannequin for offensive cyber operations—doubtlessly together with assaults on networks in China and Iran.
Anthropic additionally warned that AI is approaching recursive self-improvement and referred to as for a coordinated world pause mechanism.
Each landed as Anthropic recordsdata for an IPO that would worth it above $1 trillion.
Anthropic has positioned about six engineers contained in the Nationwide Safety Company to assist deploy Mythos—its most succesful AI mannequin—for offensive cyber operations, the Monetary Instances reported Thursday.
The engineers are forward-deployed employees, customizing the mannequin for particular functions. One supply advised the FT it could possibly be helpful for infiltrating networks in nations like China and Iran.
Whether or not these engineers are concerned in lively operations is not confirmed. What’s: Mythos is similar mannequin Anthropic has declined to launch publicly, citing misuse threat. The corporate restricted it to vetted companions by way of Mission Glasswing—a restricted coalition that features Microsoft, Apple, and Amazon.
Anthropic can be suing the Pentagon. In late February, Protection Secretary Pete Hegseth designated the corporate a supply-chain threat—a label traditionally reserved for international adversaries like Huawei—after a $200 million contract collapsed. The sticking level: Anthropic refused to let the DoD use Claude for totally autonomous weapons or home mass surveillance. The NSA contract was exempt from that ban.

A California choose blocked the blacklisting as an obvious First Modification retaliation. A D.C. appeals courtroom denied Anthropic’s bid to halt it whereas litigation performs out. The NSA stored utilizing Mythos the entire time, in keeping with the FT’s reporting.
Find out how to cease AI that builds AI
On the identical day the NSA story broke, Anthropic’s inside analysis institute printed “When AI Builds Itself,” a take a look at how far Claude has come at automating its personal growth. In it, the corporate argues for primarily a world moratorium within the AI arms race—and even likened it to Chilly Battle-era nuclear treaties struck between the USA and Russia.
To know why, the corporate offered this little bit of context:
Claude now writes greater than 80% of the code merged into Anthropic’s manufacturing codebase—up from low single digits earlier than Claude Code launched in early 2025. Engineers ship roughly eight instances as a lot code per day as they did in 2024.
The report’s authors—Anthropic Institute lead Marina Favaro and co-founder Jack Clark—argue this trajectory is heading towards what they name recursive self-improvement: AI methods that autonomously design, construct, and practice their very own successors, with people taking part in a diminishing function at each step.
In a visible illustration, the researchers present a timeline by which the primary means to make use of AI at work as people prompting the pc to get a end result, with growing automations ending in AI Brokers prompting subagents till the result’s achieved, no people concerned.

The sharpest knowledge level they cite: In April, Claude brokers have been handed an open AI security downside—whether or not a weaker mannequin can reliably supervise a stronger one—and left to run it. Two human researchers over a couple of week recovered 23% of the efficiency hole between the fashions. The brokers recovered 97%, over 800 cumulative compute hours. People set the query. The brokers designed each experiment. It is the primary printed case of Claude exercising analysis judgment, not simply executing duties another person specified.
That is the road Anthropic is apprehensive about crossing. As soon as AI chooses which experiments are price operating—not simply runs them—people lose the final significant function within the growth loop. Small misalignments seen in immediately’s fashions might compound throughout self-improving generations till no one can appropriate them.
Their proposed repair is a verifiable world pause—a number of frontier labs halting concurrently, with unbiased verification that everybody truly stopped. Anthropic mentioned it will be a part of one. A unilateral slowdown, they acknowledge, simply palms the result in whoever stored going.
We’ve seen this film earlier than. The labs constructing AI are the identical ones warning how harmful AI is. Nevertheless, AI is probably the most worthwhile enterprise of the last decade, so no one desires to cease—not even those warning about AI.
Again in 2023, over 100 large names within the AI researching neighborhood signed an open letter asking for a world effort to mitigate the chance of extinction that AI growth intrinsically has. A number of months earlier than that, one other open letter demanded OpenAI pause advances on ChatGPT because of its harmful nature.
No person stopped after the 2023 open letter. OpenAI did not. Anthropic did not. The Pentagon’s deadline to drop Claude from its methods falls in August, across the identical time Anthropic’s IPO is predicted to maneuver its funds into public view.
Every day Debrief E-newsletter
Begin on daily basis with the highest information tales proper now, plus authentic options, a podcast, movies and extra.









