Close Menu
  • Home
  • AI & Technology
  • Politics
  • Business
  • Cryptocurrency
  • Sports
  • Finance
  • Fitness
  • Gadgets
  • World
  • Marketing

Subscribe to Updates

Subscribe to our newsletter and never miss our latest news

Subscribe my Newsletter for New Posts & tips Let's stay updated!

What's Hot

XRP Price Flashes Bullish Signal as BMIC Solves Crypto’s Biggest Problem

March 29, 2026

England Test captain Stokes sidelined as he recovers from injury

March 29, 2026

Iran conflict shows how digital fight is ingrained in warfare

March 28, 2026
Facebook X (Twitter) Instagram
  • Home
  • About US
  • Advertise
  • Contact US
  • DMCA
  • Privacy Policy
  • Terms & Conditions
Facebook X (Twitter) Instagram
MNK NewsMNK News
  • Home
  • AI & Technology
  • Politics
  • Business
  • Cryptocurrency
  • Sports
  • Finance
  • Fitness
  • Gadgets
  • World
  • Marketing
MNK NewsMNK News
Home » Meta’s vanilla Maverick AI model ranks below rivals on a popular chat benchmark
Finance

Meta’s vanilla Maverick AI model ranks below rivals on a popular chat benchmark

MNK NewsBy MNK NewsApril 12, 2025No Comments2 Mins Read
Facebook Twitter Pinterest LinkedIn Tumblr Email
Share
Facebook Twitter LinkedIn Pinterest Email


Earlier this week, Meta landed in hot water for using an experimental, unreleased version of its Llama 4 Maverick model to achieve a high score on a crowdsourced benchmark, LM Arena. The incident prompted the maintainers of LM Arena to apologize, change their policies, and score the unmodified, vanilla Maverick.

Turns out, it’s not very competitive.

The unmodified Maverick, “Llama-4-Maverick-17B-128E-Instruct,” was ranked below models including OpenAI’s GPT-4o, Anthropic’s Claude 3.5 Sonnet, and Google’s Gemini 1.5 Pro as of Friday. Many of these models are months old.

Why the poor performance? Meta’s experimental Maverick, Llama-4-Maverick-03-26-Experimental, was “optimized for conversationality,” the company explained in a chart published last Saturday. Those optimizations evidently played well to LM Arena, which has human raters compare the outputs of models and choose which they prefer.

As we’ve written about before, for various reasons, LM Arena has never been the most reliable measure of an AI model’s performance. Still, tailoring a model to a benchmark — besides being misleading — makes it challenging for developers to predict exactly how well the model will perform in different contexts.

In a statement, a Meta spokesperson told TechCrunch that Meta experiments with “all types of custom variants.”

“‘Llama-4-Maverick-03-26-Experimental’ is a chat optimized version we experimented with that also performs well on LM Arena,” the spokesperson said. “We have now released our open source version and will see how developers customize Llama 4 for their own use cases. We’re excited to see what they will build and look forward to their ongoing feedback.”

This article originally appeared on TechCrunch at https://techcrunch.com/2025/04/11/metas-vanilla-maverick-ai-model-ranks-below-rivals-on-a-popular-chat-benchmark/



Source link

Share. Facebook Twitter Pinterest LinkedIn Tumblr Email
MNK News
  • Website

Related Posts

Rite Aid files for bankruptcy — again

May 6, 2025

How to Track Driver Performance Without Micromanaging

May 6, 2025

Ford says its Q1 profit fell by two-thirds and it expects a $1.5 billion hit from tariffs this year

May 6, 2025
Add A Comment
Leave A Reply Cancel Reply

Editors Picks

England Test captain Stokes sidelined as he recovers from injury

March 29, 2026

Tiger Woods arrested, charged with DUI after Florida crash

March 28, 2026

Sabalenka, Sinner keep ‘Sunshine Double’ in sight with Miami Open wins

March 27, 2026

Hasan’s pace, all-round Ali give Kings victory over Gladiators

March 27, 2026
Our Picks

XRP Price Flashes Bullish Signal as BMIC Solves Crypto’s Biggest Problem

March 29, 2026

Bearish Continuation Looms Despite Short-Term Bounce Setup

March 28, 2026

Stablecoins Will Be Crypto’s “ChatGPT Moment,” Says Ripple

March 28, 2026

Recent Posts

  • XRP Price Flashes Bullish Signal as BMIC Solves Crypto’s Biggest Problem
  • England Test captain Stokes sidelined as he recovers from injury
  • Iran conflict shows how digital fight is ingrained in warfare
  • Bearish Continuation Looms Despite Short-Term Bounce Setup
  • Stablecoins Will Be Crypto’s “ChatGPT Moment,” Says Ripple

Recent Comments

No comments to show.
MNK News
Facebook X (Twitter) Instagram Pinterest Vimeo YouTube
  • Home
  • About US
  • Advertise
  • Contact US
  • DMCA
  • Privacy Policy
  • Terms & Conditions
© 2026 mnknews. Designed by mnknews.

Type above and press Enter to search. Press Esc to cancel.