Miles Released: Enterprise-Grade RL Framework Igniting Large-Scale MoE Training
Today we release Miles, an enterprise-grade reinforcement learning framework designed for large-scale MoE training and production workloads, built on the proven foundation of slime.