简报 | Alfred's Site

5个源，过滤9篇2026-05-17

RLVR might be disproportionately bad at science

Dwarkesh Patel

Dwarkesh 用科学史（哥白尼、牛顿、Prout 假说）论证：科学的 verification loop 动辄以十年百年计，RLVR 擅长的"紧密反馈循环"在真实科研中几乎不存在。更危险的是，判断哪条研究路线是"进步的"还是"退步的"，事前根本无法自动化——这正是 AI 做科研的核心卡点。可迁移到产品：凡是正确答案不能被快速 verify 的领域，reward hacking 都是系统性风险。

The mistake of conflating intelligence and power

Dwarkesh Patel

Dwarkesh Patel 拆解了"智能 = 权力"这个 AI 安全讨论中的隐含假设——当前训练方式（让 AI 成为更好的程序员和思考伙伴）与权力获取之间的相关性其实很弱。真实世界的权力来自信任与协调，而非"超级大脑的单点算计"，这对我们怎么评估 AI 风险有根本性影响。

DeepSeek-V4-Flash means LLM steering is interesting again

Brajeshwar

DeepSeek-V4-Flash 的开源让 LLM steering（激活层操控）首次有了足够强的免费实验平台，作者系统梳理了 steering 的原理、历史障碍和真正潜力所在。核心论点：steering 对"无法用 prompt 表达的概念"（如原始推理能力）可能是唯一路径，但目前大多数场景仍被普通 prompting 完全覆盖。对关注 AI/LLM 工具生态的开发者，这是理解"激活空间操控 vs prompt 工程"边界在哪里的好入口。

Moving away from Tailwind, and learning to structure my CSS

mpweiher

Julia Evans 记录了从 Tailwind 迁移到原生 CSS 的过程，核心洞察是：框架真正的价值不在于语法，而在于它背后隐含的组织系统——color palette、font scale、reset stylesheet。把这些系统内化后，框架本身变成可选项。对"如何从工具依赖中毕业"这个问题有一定的方法论价值。

supertone-inc/supertonic

Supertonic v3 是一个 99M 参数的端侧多语言 TTS，4 月底刚发布，支持 31 种语言，覆盖 Python/Swift/Rust/Browser 等 12 个平台。核心设计决策是"主动选小模型"——用 99M 换来零网络依赖、无 GPU 需求、可跑在树莓派上，以此对抗 0.7B–2B 级的云端方案。独立产品开发中遇到"需要语音功能但不想绑 API"的场景可直接落地。

Pluralistic: Making sense of Trump's unscheduled sudden midair disassembly of the American empire (16 May 2026)

Cory Doctorow

Doctorow 用"权力 ≠ 耐久"这一框架解读美帝国的结构性崩溃：美国科技公司的全球统治地位从来不是靠产品质量，而是靠地缘政治执行力——Trump 的关税和去美元化正在瓦解这个基础。文章顺带抛出了一个反直觉的 LLM 评价：像含铅汽油，个体专家用有益处，但所有人都用时结果是灾难性的。

Frontier AI has broken the open CTF format

frays

CTF（安全夺旗赛）的开放赛制被 frontier AI 彻底破坏——GPT-4 one-shot 中等难度题，Claude/GPT-5 级别的 agent 能横扫整场比赛，记分板不再衡量人类技能。作者的核心洞察可迁移：任何以"成绩/排名"为核心激励的竞争体系，一旦外部工具可以廉价替代核心操作，整个生态（出题人动机、学习路径、社区归属感）会一并崩溃——这个模式和"AI 写 leetcode + 刷题面试失效"、"AI 生成 test coverage 导致覆盖率指标失真"属于同一类结构性问题。

5个源，过滤9篇

RLVR might be disproportionately bad at science

Dwarkesh Patel

每日简报

RLVR might be disproportionately bad at science

The mistake of conflating intelligence and power

DeepSeek-V4-Flash means LLM steering is interesting again

Moving away from Tailwind, and learning to structure my CSS

supertone-inc/supertonic

Pluralistic: Making sense of Trump's unscheduled sudden midair disassembly of the American empire (16 May 2026)

Frontier AI has broken the open CTF format

RLVR might be disproportionately bad at science

The mistake of conflating intelligence and power

DeepSeek-V4-Flash means LLM steering is interesting again

Moving away from Tailwind, and learning to structure my CSS

supertone-inc/supertonic

Pluralistic: Making sense of Trump's unscheduled sudden midair disassembly of the American empire (16 May 2026)

Frontier AI has broken the open CTF format

Notes on pretraining parallelisms and failed training runs.

Quoting Julia Evans