DeepSeek-V2 Released: Chinese Reasoning Capabilities Lead the Way, 236B Open-Source Model Challenges Global AI Landscape
Chinese AI startup DeepSeek has released its latest large language model DeepSeek-V2, featuring 236B parameters with MoE architecture, surpassing Claude 3.5 Sonnet in Chinese math reasoning and code generation, marking a significant breakthrough for open-source models.