Doubao Pro Stability Plunges 19.8 Points: Inconsistent Answers to Same Questions Become Biggest Weakness
In this week's Winzheng AI evaluation, Doubao Pro's overall score increased by 16.1 points, but its stability dimension dropped sharply by 19.8 points to 34.7, revealing severe challenges in maintaining answer consistency. This phenomenon may result from technical adjustments like temperature parameter changes or model routing updates, reflecting a trade-off between capability enhancement and output predictability.