Skip to main content
YZ Index
News
AI 专题
Winzheng Lab
WDCD
Subscribe
English
中文
English
日本語
Topics
Browse curated AI topics with editorial overviews and the latest articles.
AI Benchmarks Compared
85 articles
AI model benchmarks are the foundation of model selection. Major benchmarks include MMLU, HumanEval, Chatbot Arena (LMSY
AI Coding Benchmarks
44 articles
Which AI model writes the best code? HumanEval and MBPP are common benchmarks, but they only test function-level complet
Instruction Compliance & WDCD
54 articles
Does your AI model actually follow instructions? Instruction compliance is the most critical evaluation dimension for en
OpenAI Topic
312 articles
OpenAI is the company behind ChatGPT, GPT-4, and DALL·E, led by Sam Altman. This topic covers OpenAI's latest news, prod
Anthropic Topic
206 articles
Anthropic develops the Claude model family with AI safety at its core mission. This topic tracks model releases, safety
AI Safety Topic
136 articles
AI Safety encompasses alignment, controllability, robustness, and ethical governance. The YZ Index addresses two often-o
AI Agents Topic
128 articles
AI Agents are reshaping software development and enterprise workflows. The YZ Index directly measures two core Agent cap
AI Ethics Topic
96 articles
AI Ethics explores bias, fairness, privacy, transparency, and social impact in AI. The YZ Index Integrity Rating approac
xAI Topic
78 articles
xAI is Elon Musk's AI company, developer of the Grok model family. This topic tracks xAI's technical progress and Coloss
Generative AI Topic
77 articles
Generative AI covers automatic content generation across text, images, audio, and video. The YZ Index focuses on text ge
Meta AI Topic
68 articles
Meta's AI efforts span the Llama open-source models, AI assistants, and the metaverse. The YZ Index continuously evaluat
Google AI Topic
65 articles
Google is an AI pioneer with DeepMind producing milestones like Gemini and AlphaFold. The YZ Index evaluates Gemini mode
AI Regulation Topic
55 articles
AI Regulation covers global legislation and industry self-governance. YZ Index evaluation data provides objective compli