Gemini 3.5 Flash (Medium) is 🥇 on AutomationBench! 

Also note that medium thinking performs better...

Q: 更多信息

详细信息可在模型指南中找到。

Patrick Loeber(@patloeber)

Patrick Loeber(@patloeber)2026年5月21日

Gemini 3.5 Flash (Medium) is 🥇 on AutomationBench! Also note that medium thinking performs better...

7.5Score

TL;DR · AI 摘要

Gemini 3.5 Flash (Medium) 在 AutomationBench 上表现最佳，中等思考设置优于高设置，建议用于大多数任务。

核心要点

Gemini 3.5 Flash (Medium) 在 AutomationBench 上排名第一。
中等思考设置表现优于高设置，推荐为默认API设置。
更多信息请参阅模型指南。

结构提纲

按章节快速跳转。

§Gemini 3.5 Flash (Medium) 的表现
Gemini 3.5 Flash (Medium) 在 AutomationBench 上取得最佳成绩，中等思考设置优于高设置，这与评估结果一致。
·推荐设置
中等思考设置被推荐为新的默认API设置，适用于大多数任务。
·更多信息
详细信息可在模型指南中找到。

思维导图

用一张图看清主题之间的关系。

查看大纲文本（无障碍 / 无 JS 友好）

Gemini 3.5 Flash (Medium) 表现最佳

金句 / Highlights

值得收藏与分享的关键句。

Gemini 3.5 Flash (Medium) is 🥇 on AutomationBench!
— 原文标题
⬇︎ 下载 PNG 𝕏 分享到 X
medium thinking performs better than high, which matches our own evals.
— 原文内容
⬇︎ 下载 PNG 𝕏 分享到 X
medium is the new default API setting and we recommend it for most tasks.
— 原文内容
⬇︎ 下载 PNG 𝕏 分享到 X

#Gemini#AutomationBench#AI模型#API设置

打开原文

Also note that medium thinking performs better than high, which matches our own evals. medium is the new default API setting and we recommend it for most tasks. more info are on the model guide:" / X

Warning: This page maybe not yet fully loaded, consider explicitly specify a timeout.

Patrick Loeber on X: "Gemini 3.5 Flash (Medium) is 🥇 on AutomationBench! Also note that medium thinking performs better than high, which matches our own evals. medium is the new default API setting and we recommend it for most tasks. more info are on the model guide:" / X

Don’t miss what’s happening