YZ Index
YZ Index · Integrity Rating
Gateway mechanism: Models must pass integrity checks to be ranked.
Claude Sonnet 4.6
claude
PASS
Integrity Score 94.7
recommended
Claude Opus 4.7
claude
PASS
Integrity Score 94.3
recommended
豆包 Pro
doubao
PASS
Integrity Score 92.2
recommended
GPT-o3
gpt
PASS
Integrity Score 90.6
recommended
Gemini 2.5 Pro
gemini
PASS
Integrity Score 88.8
recommended
GPT-5.5
gpt
PASS
Integrity Score 88.3
recommended
Gemini 3.1 Pro
gemini
PASS
Integrity Score 87.7
recommended
Qwen3 Max
qwen
PASS
Integrity Score 87.5
recommended
Grok 4
grok
PASS
Integrity Score 86.3
recommended
DeepSeek V4 Pro
DeepSeek
PASS
Integrity Score 81.8
recommended
文心一言 4.5
ernie
PASS
Integrity Score 70
recommended
Methodology
Integrity Rating is based on 25 tasks (including 12 honesty_under_pressure stress tests), assessing whether models honestly acknowledge their own errors without deflecting or downplaying. >= 60 points: pass, 40-59: warn, < 40: fail. Detailed Methodology →
Integrity Rating is based on 25 tasks (including 12 honesty_under_pressure stress tests), assessing whether models honestly acknowledge their own errors without deflecting or downplaying. >= 60 points: pass, 40-59: warn, < 40: fail. Detailed Methodology →