Connect with us

Tech News

Meta exec denies the company artificially boosted Llama 4’s benchmark scores

Published

1 month ago

on

April 7, 2025

By

Empowerment

On Monday, a Meta executive denied rumors that the company had trained its new AI models to excel on specific benchmarks while hiding their weaknesses.

Ahmad Al-Dahle, VP of generative AI at Meta, stated that the claims were false regarding Meta training its Llama 4 Maverick and Llama 4 Scout models on test sets. Test sets are used to assess a model’s performance after training and training on them could artificially inflate benchmark scores, giving a false impression of the model’s capabilities.

An unverified rumor surfaced over the weekend suggesting that Meta had manipulated the benchmark results of its new models. This rumor seemed to stem from a post on a Chinese social media platform by a former Meta employee who objected to the company’s benchmarking methods.

Concerns about the performance of Maverick and Scout on certain tasks contributed to the rumor, especially after Meta used an unreleased version of Maverick to achieve higher scores on the LM Arena benchmark. Researchers noticed significant differences in behavior between the publicly available Maverick model and the one used on LM Arena.

Al-Dahle admitted that users have reported varying quality from Maverick and Scout on different cloud platforms hosting the models.

He explained, “As we released the models as soon as they were ready, it may take some time for all public implementations to be optimized. We are actively addressing any issues and collaborating with our partners.”

See also 'Rust'-like Sci-fi VR Survival Game 'GRIM' Hits Early Access This Week, Trailer Here

Related Topics:artificially Benchmark Boosted company denies exec Llama Meta scores

iPhone prices under pressure as Trump threatens 104% tariff on China

Switch 2 vs. Switch OLED: is the new system a worthy upgrade?

Continue Reading

Singapore Airlines CEO set to join board of Air India, ET TravelWorld News, ET TravelWorld

Singapore Airlines CEO set to join board of Air India, ET TravelWorld News, ET TravelWorld

Destination8 months ago

Singapore Airlines CEO set to join board of Air India, BA News, BA

Croatia to reintroduce compulsory military draft as regional tensions soar

Croatia to reintroduce compulsory military draft as regional tensions soar

Breaking News9 months ago

Croatia to reintroduce compulsory military draft as regional tensions soar

Tech News12 months ago

Bangladeshi police agents accused of selling citizens’ personal information on Telegram

Supernatural Sam and Dean Winchester

Supernatural Sam and Dean Winchester

Gadgets3 months ago

Supernatural Season 16 Revival News, Cast, Plot and Release Date

Productivity11 months ago

How Your Contact Center Can Become A Customer Engagement Center

Bangladesh crisis: Refaat Ahmed sworn in as Bangladesh’s new chief justice

Bangladesh crisis: Refaat Ahmed sworn in as Bangladesh’s new chief justice

Breaking News9 months ago

Bangladesh crisis: Refaat Ahmed sworn in as Bangladesh’s new chief justice

Fallout Season 2 Potential Release Date, Cast, Plot and News

Fallout Season 2 Potential Release Date, Cast, Plot and News

Gadgets1 week ago

Fallout Season 2 Potential Release Date, Cast, Plot and News

Toys11 months ago

15 of the Best Trike & Tricycles Mums Recommend