Meta says Llama 3 beats most other models, including Gemini

19/04/2024

The next generation of Meta’s large language model Llama, which releases today to cloud providers like AWS and to model libraries like Hugging Face soon, performs better than most current AI models, the company said in a blog post.

Llama 3 currently features two model weights, with 8B and 70B parameters. (The B is for billions and represents how complex a model is and how much of its training it understands.) It only offers text-based responses so far, but Meta says these are “a major leap” over the previous version. Llama 3 showed more diversity in answering prompts, had fewer false refusals where it declined to respond to questions, and could reason better. Meta also says Llama 3 understands more instructions and writes better code than before.

In the post, Meta claims both sizes of Llama 3 beat similarly sized models like Google’s Gemma and Gemini, Mistral 7B, and Anthropic’s Claude 3 in certain benchmarking tests. In the MMLU benchmark, which typically measures general knowledge, Llama 3 8B performed significantly better than both Gemma 7B and Mistral 7B, while Llama 3 70B slightly edged Gemini Pro 1.5.

(It is perhaps notable that Meta’s 2,700-word post does not mention GPT-4, OpenAI’s flagship model.)

It should also be noted that benchmark testing AI models, though helpful in understanding just how powerful they are, is imperfect. The datasets used to benchmark models have been found to be part of a model’s training, meaning the model already knows the answers to the questions evaluators will ask it.

Benchmark testing shows both sizes of Llama 3 outperforming similarly sized language models.

Meta says human evaluators also marked Llama 3 higher than other models, including OpenAI’s GPT-3.5. Meta says it created a new dataset for human evaluators to emulate real-world scenarios where Llama 3 might be used. This dataset included use cases like asking for advice, summarization, and creative writing. The company says the team that worked on the model did not have access to this new evaluation data, and it did not influence the model’s performance.

“This evaluation set contains 1,800 prompts that cover 12 key use cases: asking for advice, brainstorming, classification, closed question answering, coding, creative writing, extraction, inhabiting a character/persona, open question answering, reasoning, rewriting, and summarization,” Meta says in its blog post.

Llama 3 performed better than most models in human evaluations, says Meta.

Llama 3 is expected to get larger model sizes (which can understand longer strings of instructions and data) and be capable of more multimodal responses like, “Generate an image” or “Transcribe an audio file.” Meta says these larger versions, which are over 400B parameters and can ideally learn more complex patterns than the smaller versions of the model, are currently training, but initial performance testing shows these models can answer many of the questions posed by benchmarking.

Meta did not release a preview of these larger models, though, and did not compare them to other big models like GPT-4.

News Related

Window opens for Zahid to ride off into the sunset – but at Anwar's cost

Window opens for Zahid to ride off into the sunset – but at Anwar’s cost Sources within Umno have not ruled out strong speculation since last month about a so-called “exit plan” for Ahmad Zahid Hamidi, a move that could pave the way for his political retirement having survived more ...
See Details: Window opens for Zahid to ride off into the sunset – but at Anwar's cost
Murder-accused teens 'had preoccupation with torture'

Brianna Ghey died after she was found with fatal stab wounds in a park Two teenagers accused of murdering Brianna Ghey showed a “preoccupation” with “violence, torture and death”, a court has heard. The body of Brianna, 16, who was transgender, was discovered by dog walkers in a park in ...
See Details: Murder-accused teens 'had preoccupation with torture'
A plea for Islamic voices against using human shields - opinion

ISLAMIC REVOLUTIONARY Guard Corps Commander-in-Cheif Major General Hossein Salami speaks at an anti-Israel protest in Tehran on Saturday. The IRGC trained Hezbollah to use human shields, say the writers. The conflict in Gaza has raised a deeply troubling issue: Reports suggest that Hamas is deliberately using civilians as shields, a ...
See Details: A plea for Islamic voices against using human shields - opinion
Strengthen MM2H programme, promote multiple entry visa

Photo for illustration purposes only. – BERNAMA FILE PIX SHAH ALAM – The government needs to strengthen the Malaysia My Second Home (MM2H) programme, especially with the exemption of visas for Chinese and Indian citizens visiting the country starting Dec 1. Universiti Tun Abdul Razak economic expert Emeritus Professor Dr ...
See Details: Strengthen MM2H programme, promote multiple entry visa
GEG element removed from anti-smoking Bill

GEG element removed from anti-smoking Bill KUALA LUMPUR: The generational end-game (GEG) element has been removed from the revised Control of Smoking Products for Public Health 2023 Bill tabled for the first reading in the Dewan Rakyat on Tuesday (Nov 28). This is as the Health Ministry tries for the ...
See Details: GEG element removed from anti-smoking Bill
Health Ministry tables revised anti-tobacco law, omits generational smoking ban

Malay Mail KUALA LUMPUR, Nov 28 ― Health Minister Dr Zaliha Musafa today tabled the Control of Smoking Products for Public Health Bill 2023 for its first reading in the Dewan Rakyat, a revision of the previous anti-tobacco law. The Bill now omits the Generational End Game (GEG) policy, which ...
See Details: Health Ministry tables revised anti-tobacco law, omits generational smoking ban
Work together with Anwar to tackle economic issues, Perikatan MP tells Muhyiddin and Ismail Sabri

Malay Mail KUALA LUMPUR, Nov 28 — Perikatan Nasional’s (PN) Bukit Gantang MP Datuk Syed Abu Hussin Hafiz Syed Abdul Fasal today urged Datuk Seri Anwar Ibrahim’s predecessors to join forces with the prime minister. The Parti Pribumi Bersatu Malaysia lawmaker, who had declared his support for Anwar, said a ...
See Details: Work together with Anwar to tackle economic issues, Perikatan MP tells Muhyiddin and Ismail Sabri
Malaysia Airlines launches year-end sale

KUALA LUMPUR (Nov 28): National carrier Malaysia Airlines Bhd has launched its year-end sale with ticket prices starting from RM79 to domestic destinations and from RM229 to international destinations, for travellers who book flights between Nov 28 and Dec 11 this year. In a statement, Malaysia Airlines said all-in one-way Economy Class ...
See Details: Malaysia Airlines launches year-end sale
Dr M accuses govt of bribery over allocations

KUALA LUMPUR (Nov 28): National carrier Malaysia Airlines Bhd has launched its year-end sale with ticket prices starting from RM79 to domestic destinations and from RM229 to international destinations, for travellers who book flights between Nov 28 and Dec 11 this year. In a statement, Malaysia Airlines said all-in one-way Economy Class ...
See Details: Dr M accuses govt of bribery over allocations
Malaysia to check if the Netherlands still keen to send flood experts

KUALA LUMPUR (Nov 28): National carrier Malaysia Airlines Bhd has launched its year-end sale with ticket prices starting from RM79 to domestic destinations and from RM229 to international destinations, for travellers who book flights between Nov 28 and Dec 11 this year. In a statement, Malaysia Airlines said all-in one-way Economy Class ...
See Details: Malaysia to check if the Netherlands still keen to send flood experts
Appeals court to rule in Isa’s graft case on Jan 31

KUALA LUMPUR (Nov 28): National carrier Malaysia Airlines Bhd has launched its year-end sale with ticket prices starting from RM79 to domestic destinations and from RM229 to international destinations, for travellers who book flights between Nov 28 and Dec 11 this year. In a statement, Malaysia Airlines said all-in one-way Economy Class ...
See Details: Appeals court to rule in Isa’s graft case on Jan 31
Elephants Trample On Axia With Family Of Three Inside

KUALA LUMPUR (Nov 28): National carrier Malaysia Airlines Bhd has launched its year-end sale with ticket prices starting from RM79 to domestic destinations and from RM229 to international destinations, for travellers who book flights between Nov 28 and Dec 11 this year. In a statement, Malaysia Airlines said all-in one-way Economy Class ...
See Details: Elephants Trample On Axia With Family Of Three Inside
Sirul fitted with monitoring device

KUALA LUMPUR (Nov 28): National carrier Malaysia Airlines Bhd has launched its year-end sale with ticket prices starting from RM79 to domestic destinations and from RM229 to international destinations, for travellers who book flights between Nov 28 and Dec 11 this year. In a statement, Malaysia Airlines said all-in one-way Economy Class ...
See Details: Sirul fitted with monitoring device
Nigerian airliner lands at wrong airport

KUALA LUMPUR (Nov 28): National carrier Malaysia Airlines Bhd has launched its year-end sale with ticket prices starting from RM79 to domestic destinations and from RM229 to international destinations, for travellers who book flights between Nov 28 and Dec 11 this year. In a statement, Malaysia Airlines said all-in one-way Economy Class ...
See Details: Nigerian airliner lands at wrong airport

Meta says Llama 3 beats most other models, including Gemini

Window opens for Zahid to ride off into the sunset – but at Anwar's cost

Murder-accused teens 'had preoccupation with torture'

A plea for Islamic voices against using human shields - opinion

Strengthen MM2H programme, promote multiple entry visa

GEG element removed from anti-smoking Bill

Health Ministry tables revised anti-tobacco law, omits generational smoking ban

Work together with Anwar to tackle economic issues, Perikatan MP tells Muhyiddin and Ismail Sabri

Malaysia Airlines launches year-end sale

Dr M accuses govt of bribery over allocations

Malaysia to check if the Netherlands still keen to send flood experts

Appeals court to rule in Isa’s graft case on Jan 31

Elephants Trample On Axia With Family Of Three Inside

Sirul fitted with monitoring device

Nigerian airliner lands at wrong airport

OTHER NEWS

Big market marred by poor upkeep

Olive Grove: Phase 1 sold out, Phase 2 now open for sale

Cops arrest teen who pulled knife on elderly e-hailing driver

Sprint Highway’s Semantan To KL Slip Road Fully Closed Until Dec 31

Genshin Impact Version 4.3 Leak Showcases Update to Domains

Urban Republic Warehouse Clearance: Get iPhone for as low as RM699 and many more

Malaysia has never experienced hyperinflation - Economy Ministry