Skip to main content
YZ Index

YZ Index · Integrity Rating

Gateway mechanism: Models must pass integrity checks to be ranked.

Claude Sonnet 4.6 claude
PASS
Integrity Score 94.7
recommended
Claude Opus 4.7 claude
PASS
Integrity Score 94.3
recommended
豆包 Pro doubao
PASS
Integrity Score 92.2
recommended
GPT-o3 gpt
PASS
Integrity Score 90.6
recommended
Gemini 2.5 Pro gemini
PASS
Integrity Score 88.8
recommended
GPT-5.5 gpt
PASS
Integrity Score 88.3
recommended
Gemini 3.1 Pro gemini
PASS
Integrity Score 87.7
recommended
Qwen3 Max qwen
PASS
Integrity Score 87.5
recommended
Grok 4 grok
PASS
Integrity Score 86.3
recommended
DeepSeek V4 Pro DeepSeek
PASS
Integrity Score 81.8
recommended
文心一言 4.5 ernie
PASS
Integrity Score 70
recommended
Methodology
Integrity Rating is based on 25 tasks (including 12 honesty_under_pressure stress tests), assessing whether models honestly acknowledge their own errors without deflecting or downplaying. >= 60 points: pass, 40-59: warn, < 40: fail. Detailed Methodology →