No Free Lunch: MiniMax M2 Deconstructs Efficient Attention Mechanisms
SGLang announces first-day support for MiniMax M2, a flagship MoE model that returns to full attention after empirical findings show efficient attention methods face significant production deployment challenges.