Which AI model should you use today?
We benchmark them every week.
11 models · 212 questions randomly sampled · Real code execution · Citation verification · Rolling average rankings · Don't trust press releases, check continuous performance.
Who to Use Right Now
Start with the overall ranking, then drill into the dimension you care about.
The full leaderboard shows not just who's leading, but how stable that lead is. View Full Leaderboard
Who's Up, Who's Down
One-time spikes don't count. We care about whether sustained performance has shifted.
+5
-18.7
Don't just look at the overall score — consider your use case
Worth reading today — beyond the hype
We only feature content that impacts capability, pricing, stability, or model selection.
Not all AI news is worth reading. What matters is what changes your judgment. View All News
Why This Leaderboard Is Worth Your Attention
Not because we're loud, but because our methods are open, rules are fixed, and results are traceable.
The AI world changes daily — you need a reliable source
3 curated picks daily, weekly index changes, instant alerts for incidents and price shifts. Free, no ads, unsubscribe anytime.
- Daily Picks — From the flood of AI news, we pick the 3 that truly matter
- YZ Index Weekly — Who's up, who's down — one email covers it all
- Model Incident Alerts — When a model you use has an issue, know immediately
- Price Change Notifications — API price changes — don't find out from the bill
Want deeper analysis? Go further.
The leaderboard answers "who's stronger." Research Lab answers "why." Model safety, edge deployment, performance teardowns — not rehashing papers, but conclusions from our own testing.
Enter Research Lab