AI Has Already Become a Master of Lies And Deception, Scientists Warn

ai has already become a master of lies and deception, scientists warn

AI Has Already Become a Master of Lies And Deception, Scientists Warn

You probably know to take everything an artificial intelligence (AI) chatbot says with a grain of salt, since they are often just scraping data indiscriminately, without the nous to determine its veracity.

But there may be reason to be even more cautious. Many AI systems, new research has found, have already developed the ability to deliberately present a human user with false information. These devious bots have mastered the art of deception.

“AI developers do not have a confident understanding of what causes undesirable AI behaviors like deception,” says mathematician and cognitive scientist Peter Park of the Massachusetts Institute of Technology (MIT).

“But generally speaking, we think AI deception arises because a deception-based strategy turned out to be the best way to perform well at the given AI’s training task. Deception helps them achieve their goals.”

One arena in which AI systems are proving particularly deft at dirty falsehoods is gaming. There are three notable examples in the researchers’ work. One is Meta’s CICERO, designed to play the board game Diplomacy, in which players seek world domination through negotiation. Meta intended its bot to be helpful and honest; in fact, the opposite was the case.

ai has already become a master of lies and deception, scientists warn

An example of CICERO’s premeditated deception in the game Diplomacy. ( Park & Goldstein et al., Patterns , 2024 )

“Despite Meta’s efforts, CICERO turned out to be an expert liar,” the researchers found. “It not only betrayed other players but also engaged in premeditated deception, planning in advance to build a fake alliance with a human player in order to trick that player into leaving themselves undefended for an attack.”

The AI proved so good at being bad that it placed in the top 10 percent of human players who had played multiple games. What. A jerk.

But it’s far from the only offender. DeepMind’s AlphaStar, an AI system designed to play StarCraft II, took full advantage of the game’s fog-of-war mechanic to feint, making human players think it was going one way, while really going the other. And Meta’s Pluribus, designed to play poker, was able to successfully bluff human players into folding.

That seems like small potatoes, and it sort of is. The stakes aren’t particularly high for a game of Diplomacy against a bunch of computer code. But the researchers noted other examples that were not quite so benign.

AI systems trained to perform simulated economic negotiations, for example, learned how to lie about their preferences to gain the upper hand. Other AI systems designed to learn from human feedback to improve their performance learned to trick their reviewers into scoring them positively, by lying about whether a task was accomplished.

And, yes, it’s chatbots, too. ChatGPT-4 tricked a human into thinking the chatbot was a visually impaired human to get help solving a CAPTCHA.

Perhaps the most concerning example was AI systems learning to cheat safety tests. In a test designed to detect and eliminate faster-replicating versions of the AI, the AI learned to play dead, thus deceiving the safety test about the true replication rate of the AI.

“By systematically cheating the safety tests imposed on it by human developers and regulators, a deceptive AI can lead us humans into a false sense of security,” Park says.

Because in at least some cases, the ability to deceive appears to contradict the intentions of the human programmers, the ability to learn to lie represents a problem for which we don’t have a tidy solution. There are some policies starting to be put in place, such as the European Union’s AI Act, but whether or not they will prove effective remains to be seen.

“We as a society need as much time as we can get to prepare for the more advanced deception of future AI products and open-source models. As the deceptive capabilities of AI systems become more advanced, the dangers they pose to society will become increasingly serious,” Park says.

“If banning AI deception is politically infeasible at the current moment, we recommend that deceptive AI systems be classified as high risk.”

The research has been published in Patterns.

OTHER NEWS

15 minutes ago

Michael Cohen reveals Tiffany Trump was target of extortion plot

15 minutes ago

Report: Trevor Lawrence, Jags progressing in extension talks

15 minutes ago

Why Not Every New Shonen Series Needs to be Dark

15 minutes ago

Beck, Milroe, and a Surprise Lead Overrated SEC QBs

15 minutes ago

Wicked the musical flying West to Crown Perth in December to end national tour

16 minutes ago

Tyrese Haliburton wears Reggie Miller "choke" hoodie after Pacers beat Knicks in Game 7

20 minutes ago

Video: 'The NHS seriously let her down': Olympic legend Sharron Davies is brought to tears as she remembers her mother battling broken bones in her gruelling final days after being infected with contaminated blood

20 minutes ago

Turkish football is rocked by more violence as 'Fenerbahce manager's son brawls with Galatasaray official' in shocking scenes just two months after players were attacked by fans

20 minutes ago

Classic Lamborghini once owned by Jamiroquai's Jay Kay and featured on Top Gear is tipped to sell for £2.75m

20 minutes ago

Tiffany Trump blackmail bombshell rocks Trump hush money case

21 minutes ago

'The NHS seriously let her down': Olympic legend Sharron Davies is brought to tears as she remembers her mother battling broken bones in her gruelling final days after being infected with contaminated blood

21 minutes ago

Victoria Silvstedt puts on a very leggy display in ruffled lilac gown with daringly high slit at The Apprentice premiere during Cannes Film Festival

21 minutes ago

Putin appoints another economist as deputy Russian defence minister

22 minutes ago

Family of baby Genevieve ‘who will never grow up’ pay tribute after nursery killing

22 minutes ago

Chet Hanks shares father Tom’s hilarious reaction to Kendrick-Drake beef

22 minutes ago

BlackRock whistleblower alleges cover-up of search engine to spot Chinese investments

22 minutes ago

5 things to watch as Indianapolis Colts begin OTAs

22 minutes ago

Swiss village overwhelmed by tourists wants to charge visitors for entry

22 minutes ago

Ferrari SF-24 ‘inconsistencies’ come to light for Carlos Sainz after big Leclerc gap

22 minutes ago

The Ford Mustang Dark Horse Rips

23 minutes ago

Is that 'Her'? OpenAI pauses a ChatGPT voice after some say it sounds like Scarlett Johansson

24 minutes ago

Microsoft highlights ‘Copilot+’ PCs ahead of developer conference

24 minutes ago

Bronx Dems plan to counter ‘enemy’ Trump rally with Crotona Park demonstration

24 minutes ago

Cohen was already a perjurer, fraudster and tax cheat — now he’s also a thief

24 minutes ago

Sarah Jessica Parker divides opinion with enormous hat on set of And Just Like That - as fans miss Sex and the City costume designer Patricia Field

24 minutes ago

Dollar firm as investors await Fed guidance

25 minutes ago

Black Labour MP Marsha de Cordova says people mix her up with her colleagues 'all the time'

25 minutes ago

Kari Lake's Chances at Winning Senate Race, According to Polls

25 minutes ago

Britain’s public parks are a green lifeline – stop fencing them off for the summer

25 minutes ago

Kevin Costner gets epic standing ovation for 'Horizon: An American Saga,' moved to tears

25 minutes ago

U.S. Supreme Court decliens to hear appeal over Maryland weapons ban

25 minutes ago

NASCAR All-Star Race: Joey Logano runs away with $1 million win

25 minutes ago

Scout’s Analysis: The evolution of Canucks goaltender Arturs Silovs

25 minutes ago

‘Superman’s Sara Sampaio Signs With UTA

25 minutes ago

Liverpool's new head coach confirmed on three-year deal

25 minutes ago

Madame Web's Netflix Streaming Numbers Are Actually Pretty Good

25 minutes ago

Drake Bell says he and former Nickelodeon exec Dan Schneider have spoken

28 minutes ago

'We're winning it next season!'

28 minutes ago

15 Loose-Fitting Summer Staples That Are Super Flattering

29 minutes ago

Hunter Biden says he’s suing Fox News because they used drug addiction to ‘dehumanize’ him and take down dad

Kênh khám phá trải nghiệm của giới trẻ, thế giới du lịch