💻 Technology Mar 31, 2026 · Angela Aristidou

AI benchmarks are broken. Here’s what we need instead.

MIT Technology Review
Authoritative reporting on emerging technologies
View Channel →
Source ↗ 👁 8 💬 0
For decades, artificial intelligence has been evaluated through the question of whether machines outperform humans. From chess to advanced math, from coding to essay writing, the performance of AI models and applications is tested against that of individual humans completing tasks. 



This framing is seductive: An AI vs. human comparison on isolated problems with clear right or wrong answers is easy to standardize, compare, and optimize. It generates rankings and headlines. 



But th

Comments (0)

Sign in to join the discussion

More Like This

📰
Grafana says stolen GitHub token let hackers steal codebase
BleepingComputer · 6d ago
Microsoft remembers that taskbars used to move
www.theregister.com - Articles · 6d ago
📰
Open source tool maker Grafana Labs says hackers stole its code, refuses to pay ransom
TechCrunch · 6d ago
The Catastrophic Swatch x Audemars Piguet Launch Was Entirely Predictable and Utterly Avoidable
WIRED · 6d ago
Google has sold so much TPU capacity that its own researchers are queueing for the rest
The Next Web · 6d ago
‘The Boys’ Finale Promises ‘Superheroes Are Done’
Gizmodo · 6d ago