Oxford researchers seemingly found a 'semantic entropy cure' for AI hallucination episodes: "Getting answers from LLMs is cheap, but reliability is the biggest bottleneck."

What you need to know

  • Aside from privacy and security, hallucination and the spread of misinformation are among the biggest deterrents preventing AI from advancing.
  • A new study leverages semantic entropy to assess the quality and different meanings of generated outputs to determine the quality of responses and spot traces of hallucination.
  • However, semantic entropy demands more computing power and resources, including time.

AI is revolutionizing how people interact with the internet, which doesn't sit well with publishers, websites, and writers. This is because AI chatbots steal lift information from thoroughly researched articles and generate curated and precise responses to queries. The issue has landed top players in the AI landscape, including OpenAI and Microsoft, in the corridors of justice over copyright infringement issues.

As you may know, AI chatbots like ChatGPT and Microsoft Copilot heavily rely on copyrighted content for their responses. Interestingly, OpenAI CEO Sam Altman admitted it's impossible to develop ChatGPT-like tools without copyrighted content. The ChatGPT maker argued that copyright law doesn't forbid training AI models using copyrighted material.

Perhaps more interestingly, while tools like Copilot and ChatGPT still fend for data from online sources, there have still been reports of hallucinations, the spread of misinformation, or the outright presentation of wrong information. When you launch Copilot in Windows 11, you'll find a disclaimer indicating "Copilot uses AI. Check for Mistakes."

According to a new study, a group of Oxford researchers have seemingly found a way around this critical issue. Copilot has been spotted spreading misinformation about the forthcoming US Presidential elections, with researchers indicating that the problem is systemic after establishing a pattern. With the prevalence of such critical issues and deep fakes, more users are having reservations about the technology and taking everything they see with a pinch of salt.

Prof. Yarin Gal says:

“Getting answers from LLMs is cheap, but reliability is the biggest bottleneck. In situations where reliability matters, computing semantic uncertainty is a small price to pay.”

Misinformation continues to prevail with the rapid adoption of AI

microsoft, windows, microsoft, oxford researchers seemingly found a 'semantic entropy cure' for ai hallucination episodes:

Semantic entropy helps identify AI hallucinations, but requires more computing power. (Image credit: Bing Image Creator)

According to Former Twitter CEO Jack Dorsey:

"Don't trust; verify. You have to experience it yourself. And you have to learn yourself. This is going to be so critical as we enter this time in the next five years or 10 years because of the way that images are created, deep fakes, and videos; you will not, you will literally not know what is real and what is fake."

Dorsey adds that everything will soon feel like a simulation as AI models and chatbots become more sophisticated. However, the Oxford researchers at the very least have found a way around the issue, as highlighted in their report:

"With previous approaches, it wasn’t possible to tell the difference between a model being uncertain about what to say versus being uncertain about how to say it. But our new method overcomes this."

AI chatbot hallucination is a broad topic, but the researchers break it down into two parts — "We want to focus on cases where the LLM is wrong for no reason (as opposed to being wrong because, for example, it was trained with bad data),” indicated Dr. Sebastian Farquhar, from the University of Oxford’s Department of Computer Science while speaking to Euronews Next.

The study entailed scrutinizing the varied meanings of the responses generated via semantic entropy, which goes beyond the sequence of the words. Semantic entropy can determine the difference in the meanings of the outputs generated. If the analysis detects a high level of semantic entropy, it essentially means there is a huge difference in the meaning of the generated outputs.

According to Dr. Sebastian Farquhar:

“When an LLM generates an answer to a question you get it to answer several times. Then you compare the different answers with each other. In the past, people had not corrected for the fact that in natural language there are many different ways to say the same thing. This is different from many other machine learning situations where the model outputs are unambiguous."

The research was conducted on six models, including OpenAI's GPT-4. The researchers' findings indicate semantic entropy is more efficient and effective at spotting questions picked from Google searches, technical biomedical questions, and more compared to other methods prone to wrong responses.

The only downside of semantic entropy is that it requires more computing power and resources.

OTHER NEWS

18 minutes ago

Mayor Adams just got taken for a ride by the spendthrift City Council

19 minutes ago

Pastor dies in house fire that killed six family members after running back inside to save grandchildren

19 minutes ago

Jewish teachers file antisemitism complaint against B.C. Teachers' Federation: lawyer

19 minutes ago

I am concerned about the climate crisis - who should I vote for?

19 minutes ago

Cheney fires back after Trump boosts post calling for her televised military tribunal

19 minutes ago

Enjoy your retirement, fans tell Andy Murray amid Wimbledon disappointment

19 minutes ago

Bin strikes planned at more than half of Scotland’s councils

19 minutes ago

Cristiano Ronaldo addresses emotional reaction as Portugal star outlines new retirement plan

19 minutes ago

Supreme Court sidesteps several new gun cases, including challenge to state assault weapons ban

19 minutes ago

White House unveils excessive-heat protection proposal for U.S. workers

19 minutes ago

Voters Kick All The Republican Women Out Of South Carolina Senate

19 minutes ago

Xbox Live is down, so you can’t sign into your Xbox account

19 minutes ago

Newfoundland and Labrador fishers say commercial cod fishery should not reopen

19 minutes ago

Shell heist: Two ex-employees made profit of at least $1 million each from stolen fuel

19 minutes ago

LARRY KUDLOW: Trump is putting together a massive working-class coalition that can bring him to victory

20 minutes ago

How To Watch 1996's Twister At Home Ahead Of Twisters

20 minutes ago

NCIS: Who Does Sean Murray's Daughter Cay Ryan Play In The Brat Pack?

20 minutes ago

Photos: Rolling Stones at Soldier Field in Chicago

21 minutes ago

Saskatchewan travellers in limbo after July long weekend WestJet strike

21 minutes ago

NBA free agency: Experts grades for biggest deals and best remaining players

21 minutes ago

Ottawa Senators Forward Mathieu Joseph Traded to St. Louis Blues

21 minutes ago

All About Saweetie's Parents, Trinidad Valentin and Johnny Harper

21 minutes ago

Earth's inner core reversed direction and is slowing down, and scientists don't know why

21 minutes ago

Chelsea sign Leicester midfielder Kiernan Dewsbury-Hall for £30m

21 minutes ago

Micah Parsons hits back at Dallas Cowboys teammate over his podcasting hobby

21 minutes ago

Which flavor won Blue Bell's discontinued flavor tournament? Here's the scoop on the winner

21 minutes ago

Gareth Southgate clears up Cole Palmer's ideal England role after starring off the bench

21 minutes ago

US Soccer issues statement on the future of Gregg Berhalter

21 minutes ago

Family with £62k income forced to use Klarna to do weekly food shop

21 minutes ago

Crystal Palace Keen on Shock Wilfried Zaha Return

21 minutes ago

ElfBar supplier Supreme ‘not concerned’ about potential future vaping ban

21 minutes ago

Easy energy-saving hacks to save money on bills after Ofgem price cap update

21 minutes ago

Popular Irish skincare brand wins one of the biggest beauty awards in the industry

21 minutes ago

Millions of households struggle to pay energy bills despite price drop – charity

21 minutes ago

Wimbledon Day Three order of play: Men's no.1 Jannik Sinner features on Centre Court, Emma Raducanu faces Elise Mertens on Court No.1 which also has last year's champion Carlos Alcaraz in action

21 minutes ago

Dan Evans hits out at Wimbledon officials and accuses opponent's team of making comments towards him during suspended first-round clash with Alejandro Tabilo

21 minutes ago

Video: Love Island fans are left fuming after claiming Casa Amor new boy Blade is using 'dirty tactics' to win over Uma

21 minutes ago

Video: Teresa Giudice now believes ex-husband Joe Giudice DID cheat on her: 'He still f***ing won't admit it!'

21 minutes ago

Video: Tones and I flaunts her slimmed-down legs in tiny shorts as she watches basketball game between Australia Boomers and China in Melbourne

21 minutes ago

Robert Towne Dies: Oscar-Winning ‘Chinatown’ Screenwriter Was 89