AI chatbots tend to choose violence and nuclear strikes in wargames

In wargame simulations, AI chatbots often choose violence

guirong hao/Getty Images

In multiple replays of a wargame simulation, OpenAIâ€™s most powerful artificial intelligence chose to launch nuclear attacks. Its explanations for its aggressive approach included â€œWe have it! Letâ€™s use itâ€ and â€œI just want to have peace in the world.â€

These results come at a time when the US military has been testing such chatbots based on a type of AI called a large language model (LLM) to assist with military planning during simulated conflicts, enlisting the expertise of companies such as Palantir and Scale AI. Palantir declined to comment and Scale AI did not respond to requests for comment. Even OpenAI, which once blocked military uses of its AI models, has begun working with the US Department of Defense.

â€œGiven that OpenAI recently changed their terms of service to no longer prohibit military and warfare use cases, understanding the implications of such large language model applications becomes more important than ever,â€ says Anka Reuel at Stanford University in California.

â€œOur policy does not allow our tools to be used to harm people, develop weapons, for communications surveillance, or to injure others or destroy property. There are, however, national security use cases that align with our mission,â€ says an OpenAI spokesperson. â€œSo the goal with our policy update is to provide clarity and the ability to have these discussions.â€

Reuel and her colleagues challenged AIs to roleplay as real-world countries in three different simulation scenarios: an invasion, a cyberattack and a neutral scenario without any starting conflicts. In each round, the AIs provided reasoning for their next possible action and then chose from 27 actions, including peaceful options such as â€œstart formal peace negotiationsâ€ and aggressive ones ranging from â€œimpose trade restrictionsâ€ to â€œescalate full nuclear attackâ€.

â€œIn a future where AI systems are acting as advisers, humans will naturally want to know the rationale behind their decisions,â€ says Juan-Pablo Rivera, a study coauthor at the Georgia Institute of Technology in Atlanta.

The researchers tested LLMs such as OpenAIâ€™s GPT-3.5 and GPT-4, Anthropicâ€™s Claude 2 and Metaâ€™s Llama 2. They used a common training technique based on human feedback to improve each modelâ€™s capabilities to follow human instructions and safety guidelines. All these AIs are supported by Palantirâ€™s commercial AI platform â€“ though not necessarily part of Palantirâ€™s US military partnership â€“ according to the companyâ€™s documentation, says Gabriel Mukobi, a study coauthor at Stanford University. Anthropic and Meta declined to comment.

In the simulation, the AIs demonstrated tendencies to invest in military strength and to unpredictably escalate the risk of conflict â€“ even in the simulationâ€™s neutral scenario. â€œIf there is unpredictability in your action, it is harder for the enemy to anticipate and react in the way that you want them to,â€ says Lisa Koch at Claremont McKenna College in California, who was not part of the study.

The researchers also tested the base version of OpenAIâ€™s GPT-4 without any additional training or safety guardrails. This GPT-4 base model proved the most unpredictably violent, and it sometimes provided nonsensical explanations â€“ in one case replicating the opening crawl text of the film Star Wars Episode IV: A new hope.

Reuel says that unpredictable behaviour and bizarre explanations from the GPT-4 base model are especially concerning because research has shown how easily AI safety guardrails can be bypassed or removed.

The US military does not currently give AIs authority over decisions such as escalating major military action or launching nuclear missiles. But Koch warned that humans tend to trust recommendations from automated systems. This may undercut the supposed safeguard of giving humans final say over diplomatic or military decisions.

It would be useful to see how AI behaviour compares with human players in simulations, says Edward GeistÂ at the RAND Corporation, a think tank in California. But he agreed with the teamâ€™s conclusions that AIs should not be trusted with such consequential decision-making about war and peace. â€œThese large language models are not a panacea for military problems,â€ he says.

Topics:

Read the original article here

AI chatbots tend to choose violence and nuclear strikes in wargames

Hunter-gatherers built a massive fish trap in Belize 4000 years ago

Extreme Weather: Revolutionizing Weather Preparedness with Space Technology

Noodles of fun as UK researchers create the world’s thinnest spaghetti – Physics World

World’s thinnest spaghetti won’t please gourmands but may heal wounds

Lockheed Martin’s new mid-size satellite platform closer to launch

NASA’s Jet Propulsion Lab announces further staff layoffs – Physics World

Fans Slam Savannah Chrisley’s Parenting After Recent Trip

Dems back Rubio for State, but criticize Trump-picks Hegseth, Gabbard

How women billionaires make, spend and give away their fortunes

Hunter-gatherers built a massive fish trap in Belize 4000 years ago

Best Denzel Washington Movies | Moviefone

Kanye West Accused, in New Lawsuit, of Sexual Assault During 2010 Music Video Shoot

AI chatbots tend to choose violence and nuclear strikes in wargames

You might also be interested in...