Qwen2.5 - AI News | 赢政天下

Qwen2.5-Max Tops Chinese MMLU Benchmark: Alibaba's Tongyi Qianwen Surpasses GPT-4o, Sparking Heated Discussion

Alibaba Cloud's newly released Qwen2.5-Max model has achieved 86.1% on the authoritative Chinese MMLU benchmark, surpassing OpenAI's GPT-4o (85.8%) to claim the top spot among Chinese language models. This breakthrough has rapidly gained traction in the open-source community, with Hugging Face downloads surging by over 100,000 within 24 hours.

Alibaba Cloud Releases Qwen2.5-Max: Math and Coding Benchmarks Surpass Gemini 1.5 Pro, Open Source Strategy Ignites Domestic AI Discussion

Alibaba Cloud officially launched the Tongyi Qianwen Qwen2.5-Max large language model, which outperforms Google's Gemini 1.5 Pro in mathematics and coding benchmarks. The open-source free strategy quickly went viral in the Chinese AI community with over 30,000 reposts.

Alibaba's Qwen2.5-Max Makes Strong Debut: Surpasses GPT-4o on Multiple Benchmarks, Setting New Heights for China's Closed-Source AI Models

Alibaba Cloud's Tongyi Qianwen team launches Qwen2.5-Max, outperforming OpenAI's GPT-4o on multiple authoritative benchmarks. This breakthrough has ignited enthusiasm in the Chinese AI community, with related discussions on X platform quickly exceeding 80,000 posts.

Alibaba's Qwen2.5-Max Tops Arena-Hard Leaderboard, Surpassing GPT-4o and Sparking New AI Debate

Alibaba Cloud's Qwen2.5-Max model has achieved the top position on the authoritative Arena-Hard leaderboard, surpassing GPT-4o and demonstrating breakthrough performance with its 128K context support capability. The news has generated over 200,000 interactions across social platforms, marking a significant milestone for Chinese AI development.

Qwen2.5 (4 articles)

Qwen2.5-Max Tops Chinese MMLU Benchmark: Alibaba's Tongyi Qianwen Surpasses GPT-4o, Sparking Heated Discussion

Alibaba Cloud Releases Qwen2.5-Max: Math and Coding Benchmarks Surpass Gemini 1.5 Pro, Open Source Strategy Ignites Domestic AI Discussion

Alibaba's Qwen2.5-Max Makes Strong Debut: Surpasses GPT-4o on Multiple Benchmarks, Setting New Heights for China's Closed-Source AI Models

Alibaba's Qwen2.5-Max Tops Arena-Hard Leaderboard, Surpassing GPT-4o and Sparking New AI Debate