Top StoriesTechFinanceHealthEnergySportsCulture
Tech & Science

AI benchmarks are broken. Here’s what we need instead.

By MIT Technology Review · 2026-03-31
AI benchmarks are broken. Here’s what we need instead.
Why it matters: Flawed AI benchmarks risk misdirecting development and misrepresenting the technology's actual societal value.
The long-standing paradigm of benchmarking AI against human performance in tasks like chess or essay writing is fundamentally flawed, failing to accurately assess AI's true capabilities and societal impact. A new approach is needed to move beyond simple human-outperformance metrics.

Share this story

More tech & science → Read original →

Get tech & science in your inbox

The best stories, summarized daily. Free.

No spam. Unsubscribe anytime.