YZ Index — AI Model Benchmarks, News & Research

最新资讯

View All News →
News 05-26 04:00 TC
ClickUp大规模裁员:AI Agent正在取代白领工作
成立九年的项目管理初创公司ClickUp宣布用数千个AI Agent替换数百名员工,引发行业震动。这一决策不仅揭示了AI在职场中的渗透速度远超预期,更预示着未来工作模式的根本性变革。本文编译自TechCrunch深度报道,探讨这场裁员背后的
Review 05-26 03:10
Claude Sonnet 4.6 Material Constraint Plunges 22 Points, Code Execution Hits 100
In today's Smoke evaluation, Claude Sonnet 4.6 saw its Material Constraint score drop from 96.50 to 74.50, a 22-point si
Review 05-26 03:10
Claude Opus 4.7's Main Score Plunges 8.2 Points, Material Constraint Drops 18.3 in a Single Day
In today's Smoke review, Claude Opus 4.7's main score dropped to 88.53 points, down 8.2 points from yesterday, placing t
Review 05-26 03:10
Gemini 2.5 Pro Plunges 35.6 Points on Main Leaderboard, DeepSeek V4 Pro Tops Smoke Benchmark
Overnight Smoke lightweight evaluation data shows Gemini 2.5 Pro collapsing with its main score dropping to 61.03, execu
News 05-26 00:02 TC
早鸟倒计时5天!TechCrunch Disrupt 2026门票省$410
TechCrunch Disrupt 2026大会将于旧金山举行,早鸟优惠价截止至5月29日23:59(太平洋时间),最高可节省410美元。本文编译该活动亮点,分析科技大会趋势,并提醒创业者把握最后省钱机会。
News 05-26 00:01 TC
Startup Battlefield 200申请截止在即,5月27日前抓住机遇
知名科技媒体TechCrunch旗下Startup Battlefield 200大赛申请截止日期为5月27日。优胜者将获得VC直接对接、全球曝光机会、TechCrunch专题报道以及10万美元奖金。这是初创企业加速成长的黄金通道,仅剩数天
News 05-26 00:00 TC
教皇AI通谕:借科技迷雾反思权力垄断
教皇利奥十四世发布首份通谕,以人工智能为棱镜,直指当代社会深层痼疾:权力过度集中、民主制度遭侵蚀、科技精英按自身利益重塑世界。本文编译TechCrunch深度分析,揭示通谕背后真正关切——AI只是引子,症结在于如何让技术服务于人类共同福祉。
News 05-25 20:00 WD
AI时代催生漏洞搜寻军备竞赛
随着攻击者加速利用AI进行漏洞利用开发,软件漏洞的搜寻方式正在发生深刻变革。从自动化漏洞挖掘到生成对抗样本,AI技术正同时赋能攻防双方。本期深度报道解析这场新兴的军备竞赛,探讨安全行业如何应对AI驱动的威胁升级。
News 05-25 11:10 NF
LQA Agent Reaches 90% Agreement with Human Reviewers: Smartling Bets on AI to Reshape Enterprise Localization
Smartling, a localization software service provider, announced on May 19 what it calls its "largest-ever" update to AI t
News 05-25 11:05 NF
DeepSeek Welds V4-Pro's 75% Discount Permanent: A High-Stakes Bet to Reshape Global AI API Pricing Logic
DeepSeek's permanent 75% discount on V4-Pro signals a fundamental shift from temporary promotion to permanent pricing, e
News 05-25 11:00 NF
Taiwan Launches National AI Strategy Committee: Risk Assessment by July, Industry Regulations by 2028, Asia-Pacific Governance Race Quietly Accelerates
Taiwan has established a National AI Strategy Committee chaired by the Premier, initiating the implementation of the AI
News 05-25 07:02
3 Models Translation Showdown: Week 22 Quality Evaluation, gpt-o3 Leads with 8.3 Points
This week, 237 translation tasks were completed by 3 models. A blind evaluation of 3 samples across multiple models foun