Skip to main content
YZ Index

Model Incident Reports

Auto-detected: overall crash / dimension collapse / strict task zeroed · updated weekly

10

GPT-4o Overall Score dropped 10.5 points

Overall Score Drop GPT-4o 2026-W14 03-30 05:00
10

GPT-4o Code Execution (v5) dropped 23.7 points

Dimension Drop GPT-4o 2026-W14 03-30 05:00