模型一致性 (2 articles)

Doubao Pro Stability Plunges 19.8 Points: Inconsistent Answers to Same Questions Become Biggest Weakness

In this week's Winzheng AI evaluation, Doubao Pro's overall score increased by 16.1 points, but its stability dimension dropped sharply by 19.8 points to 34.7, revealing severe challenges in maintaining answer consistency. This phenomenon may result from technical adjustments like temperature parameter changes or model routing updates, reflecting a trade-off between capability enhancement and output predictability.

豆包Pro 稳定性测试 AI评测
330