YZ Index
可用性排行榜
API可靠性评估,衡量模型能否成功完成任务,失败、超时、返回空均计为不可用
| # | 模型 | 可用性 | 稳定性 | 编程 | 综合 |
|---|---|---|---|---|---|
| 🥇 | Claude Opus 4.6 Anthropic | 64.6 | 87.4 | 75.6 | |
| 🥈 | Claude Sonnet 4.6 Anthropic | 61.5 | 87.2 | 79.0 | |
| 🥉 | GPT-4o OpenAI | 52.6 | 85.3 | 75.3 | |
| 4 | GPT-o3 OpenAI | 51.3 | 83.8 | 70.4 | |
| 5 | DeepSeek V3 DeepSeek | 48.9 | 85.8 | 84.0 | |
| 6 | Qwen Max Alibaba | 48.2 | 80.7 | 76.9 | |
| 7 | DeepSeek R1 DeepSeek | 49.1 | 85.8 | 82.4 | |
| 8 | Gemini 2.5 Pro Google | 58.9 | 85.4 | 78.4 |