| # | 模型 | 编程 | 稳定性 | 性价比 | 综合 | 编程 |
|---|---|---|---|---|---|---|
| 🥇 | Gemini 2.5 Pro Google | 100.0 | 44.8 | 62.7 | 74.7 | |
| 🥈 | Claude Opus 4.6 Anthropic | 93.3 | 83.4 | 10.6 | 81.1 | |
| 🥉 | Qwen Max Alibaba | 93.3 | 78.9 | 80.2 | 86.9 | |
| 4 | DeepSeek R1 DeepSeek | 87.8 | 77.8 | 99.6 | 87.6 | |
| 5 | GPT-4o OpenAI | 87.8 | 80.7 | 61.3 | 84.0 | |
| 6 | Claude Sonnet 4.6 Anthropic | 86.7 | 78.7 | 46.5 | 81.7 | |
| 7 | GPT-o3 OpenAI | 86.7 | 80.1 | 17.1 | 75.0 | |
| 8 | DeepSeek V3 DeepSeek | 75.6 | 91.4 | 100.0 | 83.1 |