Qwen2.5-Max Tops Chinese MMLU Benchmark: Alibaba's Tongyi Qianwen Surpasses GPT-4o, Sparking Heated Discussion
Alibaba Cloud's newly released Qwen2.5-Max model has achieved 86.1% on the authoritative Chinese MMLU benchmark, surpassing OpenAI's GPT-4o (85.8%) to claim the top spot among Chinese language models. This breakthrough has rapidly gained traction in the open-source community, with Hugging Face downloads surging by over 100,000 within 24 hours.