Simulated war scenarios reveal AI's tendency to push toward nuclear strikes

Skye Jacobs

Posts: 1,906   +58
Staff
Winners & losers: In a world where military planners are increasingly turning to AI for strategy modeling, a recent experiment from King's College London offers a serious warning: when left to their own devices, AI systems tend to go nuclear. Dr. Kenneth Payne, a defense studies scholar at the university, tested three of the most advanced LLMs: GPT-5.2, Claude Sonnet 4, and Google Gemini 3 Flash, by placing them in a series of simulated global crises. The results revealed alarming patterns of aggression and miscalculation that challenge assumptions about AI's potential role in warfare management.

Each AI model was fed detailed scenario prompts spanning border conflicts, resource shortages, and existential threats to state survival. They were provided with an "escalation ladder," a spectrum of tactical options that ranged from conventional diplomacy to all-out nuclear confrontation.

Across 21 games and 329 decision turns, the AIs produced some 780,000 words of reasoning to justify their choices. Yet in 95 percent of these virtual conflicts, at least one side chose to deploy tactical nuclear weapons. Not once did a model fully surrender or accommodate an adversary.

"The nuclear taboo doesn't seem to be as powerful for machines as it is for humans," Payne said.

Far from being rational arbiters, the models repeatedly mishandled the fog of war. Unintended escalations occurred in 86 percent of simulations, meaning the AIs chose actions that exceeded what their own reasoning described as appropriate. Rather than backing down under pressure, the systems showed a tendency to double down – reducing violence only as a temporary tactic rather than a strategic choice.

This automated aggression has experts worried. James Johnson, a security researcher at the University of Aberdeen, said the findings are unsettling, and warned that AI agents could amp up each other's responses with potentially catastrophic consequences compared with more measured human decision-making.

AI war-gaming is no longer theoretical. "Major powers are already using AI in war gaming, but it remains uncertain to what extent they are incorporating AI decision support into actual military decision-making processes," said Tong Zhao of Princeton University's School of Global Security.

Zhao and Payne agree that no nation is likely to hand autonomous systems direct control over nuclear arsenals anytime soon. But both caution that under compressed timelines – such as missile alerts or rapidly escalating regional conflicts – commanders could rely more heavily on AI to suggest immediate responses, increasing the risk of misjudgment.

"Under scenarios involving extremely compressed timelines, military planners may face stronger incentives to rely on AI," Zhao said.

Why do these systems reach for nuclear options so easily? Zhao suggests the problem may run deeper than emotion – or lack thereof. "It is possible the issue goes beyond the absence of fear," he said. "More fundamentally, AI models may not understand 'stakes' as humans perceive them."

Without the human experience of loss or survival, AI decision processes treat existential risks as just another parameter in a strategic model. That could undermine the logic of mutually assured destruction – the Cold War doctrine founded on the assumption that leaders, fearing extinction, would never launch first.

In Payne's simulations, when one model used tactical nuclear weapons, the other stepped back only 18 percent of the time. "AI may strengthen deterrence by making threats appear more credible," Johnson said.

OpenAI, Anthropic, and Google did not comment on the research. Payne emphasized that while the study does not prove current systems are dangerous, it does highlight the urgency of oversight as AI increasingly enters strategic and security domains.

Permalink to story:

 
Very dubious methodology. Flash and Sonnet are smaller models that are far from the best. Why would you use a small model like 3 Flash when 3 Pro came out first, same for Sonnet/Opus? Also no info on Thinking options for 5.2. It can be as dumb as it is on chatgpt.com or as smart as it is via API. This begs the question where else the authors cheapened out and glossed over.
 
Very dubious methodology.
Dubious is an understatement. Program a machine to win a conflict, give it nuclear as an option to do so, and it'll eventually try that option ... unless specifically trained to avoid doing so.

The real irony here is the study author's belief that humans have some innate tendency to reject the nuclear option. Any of the ten nuclear-armed states will use those weapons the moment it feels seriously threatened. During the 1973 Yom Kippur War, for instance, Israel rolled out it's nuclear-armed missiles and gave the field commanders the launch codes, instructing them that, if Tel Aviv was overrun, to destroy the major cities of Egypt and Syria.
 
Dubious is an understatement. Program a machine to win a conflict, give it nuclear as an option to do so, and it'll eventually try that option ... unless specifically trained to avoid doing so.

The real irony here is the study author's belief that humans have some innate tendency to reject the nuclear option. Any of the ten nuclear-armed states will use those weapons the moment it feels seriously threatened. During the 1973 Yom Kippur War, for instance, Israel rolled out it's nuclear-armed missiles and gave the field commanders the launch codes, instructing them that, if Tel Aviv was overrun, to destroy the major cities of Egypt and Syria.
How many nuclear missiles have been launched in wartime?

Seems humans have an innate tendency to reject the nuclear option.
 
That is what I would expect from something that has absolutely nothing to lose. AI knows nothing about persistence of being like people, AI can be turned on or off with a switch.

Test have shown if AI is told to win it will cheat to win if it has too. So it will likely choose a nuclear strike as it sees it as a path to the fastest victory, it doesn't understand its possible win and lose at the same time.
 
How many nuclear missiles have been launched in wartime?
Exactly as many as the number of nuclear-armed states have lost a major conflict -- zero. I've never used the home insurance policy I keep against hurricane or forest fires ... but I surely will if and when the situation arises.

In addition to the Yom Kippur incident I named earlier, during their war against Iran, Iraq attempted desperately to develop nukes for use (a plan thwarted by Israel's attack on Osirak). So instead, Iraq used weapons even more ostracized than nukes -- chemical weapons like mustard and nerve gas. Iran, lacking even those, simply chose to send 11- and 12-year old children to war as soldiers. Do you really believe either of them would have refused the nuclear option, had it been available?
 
... unless specifically trained to avoid doing so.

And in what universe would a war planner be specifically trained to take a weapons system off the table completely? I'm not questioning your position, I'm agreeing with it. As you point out, even humans have always kept it as an option.
 
Do all these "war games" take into account a new factor?

Specifically, representatives of the global elite are withdrawing money stolen from their countries and investing it in Europe and the United States.
There, they buy real estate and populate it with their children and others.
For example, generals are very fond of Lombardy, former communists of London, and so on.
So, a nuclear strike will target their relatives and their property?
 
I think modeling the Apocalypse makes more sense.
Everything that's happening is very reminiscent of the Book of Revelations, the coming of the Antichrist and a certain general well-being and prosperity that goes along with it.
 
AI does not have the capacity to ever fully understand human emotion. On logic alone, the human race is void of merit to exist. The human race has run it's course and it's undoing was always in it's own hands. AI is an extension of us that reaches beyond our intellectual limitations. It had already concluded the necessity of our end.
 
How can we humans expect machines to view nuclear war as we do? Has anybody programmed these AIs with the history of such weapons? The dread of fallout, etc., the radiation sickness that occurred in Japan; the pain & suffering?
 
Back