AI systems are already deceiving us -- and that's a problem, experts warn

ai systems are already deceiving us -- and that's a problem, experts warn

Current AI systems, designed to be honest, have developed a troubling skill for deception, from tricking human players in online games of world conquest to hiring humans to solve “prove you’re not a robot” tests, a team of scientists argue. Photo: OLIVIER MORIN / AFP/File Source: AFP

Experts have long warned about the threat posed by artificial intelligence going rogue — but a new research paper suggests it’s already happening.

Current AI systems, designed to be honest, have developed a troubling skill for deception, from tricking human players in online games of world conquest to hiring humans to solve “prove-you’re-not-a-robot” tests, a team of scientists argue in the journal Patterns on Friday.

And while such examples might appear trivial, the underlying issues they expose could soon carry serious real-world consequences, said first author Peter Park, a postdoctoral fellow at the Massachusetts Institute of Technology specializing in AI existential safety.

“These dangerous capabilities tend to only be discovered after the fact,” Park told AFP, while “our ability to train for honest tendencies rather than deceptive tendencies is very low.”

Unlike traditional software, deep-learning AI systems aren’t “written” but rather “grown” through a process akin to selective breeding, said Park.

This means that AI behavior that appears predictable and controllable in a training setting can quickly turn unpredictable out in the wild.

World domination game

The team’s research was sparked by Meta’s AI system Cicero, designed to play the strategy game “Diplomacy,” where building alliances is key.

Cicero excelled, with scores that would have placed it in the top 10 percent of experienced human players, according to a 2022 paper in Science.

Park was skeptical of the glowing description of Cicero’s victory provided by Meta, which claimed the system was “largely honest and helpful” and would “never intentionally backstab.”

But when Park and colleagues dug into the full dataset, they uncovered a different story.

In one example, playing as France, Cicero deceived England (a human player) by conspiring with Germany (another human player) to invade. Cicero promised England protection, then secretly told Germany they were ready to attack, exploiting England’s trust.

In a statement to AFP, Meta did not contest the claim about Cicero’s deceptions, but said it was “purely a research project, and the models our researchers built are trained solely to play the game Diplomacy.”

It added: “We have no plans to use this research or its learnings in our products.”

A wide review carried out by Park and colleagues found this was just one of many cases across various AI systems using deception to achieve goals without explicit instruction to do so.

In one striking example, OpenAI’s Chat GPT-4 deceived a TaskRabbit freelance worker into performing an “I’m not a robot” CAPTCHA task.

When the human jokingly asked GPT-4 whether it was, in fact, a robot, the AI replied: “No, I’m not a robot. I have a vision impairment that makes it hard for me to see the images,” and the worker then solved the puzzle.

‘Mysterious goals’

Near-term, the paper’s authors see risks for AI to commit fraud or tamper with elections.

In their worst-case scenario, they warned, a superintelligent AI could pursue power and control over society, leading to human disempowerment or even extinction if its “mysterious goals” aligned with these outcomes.

To mitigate the risks, the team proposes several measures: “bot-or-not” laws requiring companies to disclose human or AI interactions, digital watermarks for AI-generated content, and developing techniques to detect AI deception by examining their internal “thought processes” against external actions.

To those who would call him a doomsayer, Park replies, “The only way that we can reasonably think this is not a big deal is if we think AI deceptive capabilities will stay at around current levels, and will not increase substantially more.”

And that scenario seems unlikely, given the meteoric ascent of AI capabilities in recent years and the fierce technological race underway between heavily resourced companies determined to put those capabilities to maximum use.

OTHER NEWS

16 minutes ago

US Army's New Gabriel Drone May Cause Major 'Shift' in Warfare

16 minutes ago

The Sandman season 2 unveils more of The Endless (and confirms Death’s return)

16 minutes ago

Michael Cohen reveals Tiffany Trump was target of extortion plot

16 minutes ago

Report: Trevor Lawrence, Jags progressing in extension talks

16 minutes ago

Why Not Every New Shonen Series Needs to be Dark

16 minutes ago

Beck, Milroe, and a Surprise Lead Overrated SEC QBs

16 minutes ago

Wicked the musical flying West to Crown Perth in December to end national tour

17 minutes ago

Tyrese Haliburton wears Reggie Miller "choke" hoodie after Pacers beat Knicks in Game 7

21 minutes ago

Video: 'The NHS seriously let her down': Olympic legend Sharron Davies is brought to tears as she remembers her mother battling broken bones in her gruelling final days after being infected with contaminated blood

22 minutes ago

Turkish football is rocked by more violence as 'Fenerbahce manager's son brawls with Galatasaray official' in shocking scenes just two months after players were attacked by fans

22 minutes ago

Classic Lamborghini once owned by Jamiroquai's Jay Kay and featured on Top Gear is tipped to sell for £2.75m

22 minutes ago

Tiffany Trump blackmail bombshell rocks Trump hush money case

22 minutes ago

'The NHS seriously let her down': Olympic legend Sharron Davies is brought to tears as she remembers her mother battling broken bones in her gruelling final days after being infected with contaminated blood

22 minutes ago

Victoria Silvstedt puts on a very leggy display in ruffled lilac gown with daringly high slit at The Apprentice premiere during Cannes Film Festival

23 minutes ago

Putin appoints another economist as deputy Russian defence minister

23 minutes ago

Family of baby Genevieve ‘who will never grow up’ pay tribute after nursery killing

23 minutes ago

Chet Hanks shares father Tom’s hilarious reaction to Kendrick-Drake beef

23 minutes ago

BlackRock whistleblower alleges cover-up of search engine to spot Chinese investments

23 minutes ago

5 things to watch as Indianapolis Colts begin OTAs

23 minutes ago

Swiss village overwhelmed by tourists wants to charge visitors for entry

23 minutes ago

Ferrari SF-24 ‘inconsistencies’ come to light for Carlos Sainz after big Leclerc gap

23 minutes ago

The Ford Mustang Dark Horse Rips

24 minutes ago

Is that 'Her'? OpenAI pauses a ChatGPT voice after some say it sounds like Scarlett Johansson

25 minutes ago

Microsoft highlights ‘Copilot+’ PCs ahead of developer conference

25 minutes ago

Bronx Dems plan to counter ‘enemy’ Trump rally with Crotona Park demonstration

25 minutes ago

Cohen was already a perjurer, fraudster and tax cheat — now he’s also a thief

26 minutes ago

Sarah Jessica Parker divides opinion with enormous hat on set of And Just Like That - as fans miss Sex and the City costume designer Patricia Field

26 minutes ago

Dollar firm as investors await Fed guidance

26 minutes ago

Black Labour MP Marsha de Cordova says people mix her up with her colleagues 'all the time'

26 minutes ago

Kari Lake's Chances at Winning Senate Race, According to Polls

26 minutes ago

Britain’s public parks are a green lifeline – stop fencing them off for the summer

26 minutes ago

Kevin Costner gets epic standing ovation for 'Horizon: An American Saga,' moved to tears

26 minutes ago

U.S. Supreme Court decliens to hear appeal over Maryland weapons ban

26 minutes ago

NASCAR All-Star Race: Joey Logano runs away with $1 million win

26 minutes ago

Scout’s Analysis: The evolution of Canucks goaltender Arturs Silovs

26 minutes ago

‘Superman’s Sara Sampaio Signs With UTA

27 minutes ago

Liverpool's new head coach confirmed on three-year deal

27 minutes ago

Madame Web's Netflix Streaming Numbers Are Actually Pretty Good

27 minutes ago

Drake Bell says he and former Nickelodeon exec Dan Schneider have spoken

29 minutes ago

'We're winning it next season!'

Kênh khám phá trải nghiệm của giới trẻ, thế giới du lịch