博客阅读视频推文简报时间线

guquanbusiness@gmail.com LinkedIn X / Twitter

© 2026 Alfred Gu · Built with intention

推文 | Alfred's Site

推文

357 tweets · 110 sources

账号:全部 mattpocockuk dair_ai simonw bcherny adocomplete trq212 felixrieseberg dani_avila7 dexhorthy yetone zarazhangrui badlogicgames petergyang 0xblacklight ctatedev iamzhihui karpathy kunchenguid mckaywrigley ryancarson theo aidenybai dotey garrytan kieranklaassen leon7hao omarsar0 thdxr 9hills GeminiApp RLanceMartin RhysSullivan ZaynHao dillon_mulroy lennysan mvanhorn nummanali vikingmute yiliush 0xPaulius AYi_AInotes AlchainHust AnthropicAI BenJames_____ChadMoran ClaudeDevs DanielMiessler DaveJ DavidKPiano GoogleDeepMind HilaShmuel Jack_W_Lindsey Jiaxi_Cui LinghuaJ Meari_V2_0_G Mnilax RaillyHugo Taniyatweets_addyosmani aibuilderclub_alliekmiller andrewfarah antirez artman asynkimo bbssppllvv cailynyongyong cgtwts charmaine_klee chrisbarber chrisparkX clairevo danshipper demishassabis dingyi doodlestein elithrar elonmusk ericzakariasson hylarucoder imdigitalashish jakevin7 jarredsumner jerryjliu0 joshalbrecht jpschroeder kaiofreitas lexrus logancyang mitchellh petradonka prathamgrv prathyvsh quanruzhuoxiu raroque rohanpaul_ai sama sawyerhood servasyy_ai shao__meng steipete techgirl1908 thorstenball tobiaswup tricalt tuturetom untraceable_the velvet_shark ycombinator ziwenxu_

@dani_avila7·7h agoFor You

At my previous startup we tried RAG with semantic search and vector DBs on large codebases The index was always one sprint behind the repo What actually works... invest in the harness and let the agent traverse the live codebase https://t.co/8zlIbtQdt6

@dani_avila7

Read it before the weekend's over! Every enterprise Claude Code rollout hits the same wall: the model gets better, the setup doesn't, nobody owns it Anthropic just named the role that fixes that: Agent Manager... Owns the CLAUDE.md hierarchy, the plugin marketplace, permissions, and which skills ship

↗ view on x.com

我之前的创业公司在大型代码库上试过 RAG + 语义搜索 + vector DB。索引永远落后代码库一个 sprint。真正有效的做法：投入到 harness 本身，让 agent 直接遍历实时代码库。 --- 引用推文：趁周末结束前读一下！ Claude Code 的每次企业级推广都撞上同一堵墙：模型越来越强，搭建方式没跟上，没人负责。 Anthropic 刚刚给解决这个问题的角色命名了：Agent Manager…… 负责 CLAUDE.md 层级体系、plugin 市场、权限管理，以及哪些 skill 上线。

@kunchenguid·10h agoFor You

i never use “remote control” in agent harnesses because they feel more like middle grounds i tailscale to my mac + use terminus to do the real thing. this setup has no extra cost https://t.co/I9sJT1Ot0W

↗ view on x.com

我从不在 agent harness 里用「remote control」，因为感觉更像是个不上不下的折中方案。我直接 Tailscale 连到 Mac，然后用 Terminus 做真正的操作。这套方案零额外成本。

@RhysSullivan·17h agoFor You

do you still care about backwards compatibility / breaking changes in libraries now that agents are writing most of your code?

↗ view on x.com

现在 agent 写了你大部分代码，你还在意库的向后兼容性 / breaking changes 吗？

@simonw·18h ago

Also a great example of positive contribution to open source by wanderingmeow - you don't need to contribute code to have a positive impact, just providing detailed feedback and confirmation that something like this works is enormously useful

@antirez

High quality interactions are still possible in the AI era. https://t.co/CjkD9gzo6G

↗ view on x.com

这也是 wanderingmeow 对开源正向贡献的好例子——你不需要贡献代码才能产生正向影响，仅仅提供详细反馈、确认某个功能确实可用，本身就极具价值。 [引用 @antirez]：AI 时代，高质量的交流依然是可能的。

@mattpocockuk·1d ago

The power of /grill-with-docs: "when a prospective steward is able adopt a spark which they've already pledged to, the orange accept button is too far down the page and gets clipped by the navigation gantry" A shared language, curated with the agent, means you're able to describe requirements with perfect precision.

@Brian_Mosley_UK

@mattpocockuk Just introducing /grill-with-docs... "Use the grill me with docs skill to check my terminology here - when a prospective steward is able adopt a spark which they've already pledged to, the orange accept button is too far down the page and gets clipped by the navigation gantry."

↗ view on x.com

/grill-with-docs 的威力：「当一个潜在的 steward 能够认领他们已经承诺过的 spark 时，橙色的接受按钮跑到了页面太靠下的位置，被导航栏给遮住了」一套与 agent 共同打磨的共享语言，意味着你能用精确无歧义的词汇来描述需求。【引用 @Brian_Mosley_UK】：@mattpocockuk 刚刚用了 /grill-with-docs……「用 grill me with docs skill 帮我检查这里的术语——当一个潜在的 steward 能够认领他们已承诺的 spark 时，橙色的接受按钮跑到了页面太靠下的位置，被导航栏给遮住了。」

@tricalt·1d agoFor You

Memory isn't a plugin. Skills aren't a plugin. They're the same harness.

↗ view on x.com

Memory 不是插件。Skills 不是插件。它们和 harness 是同一个东西。

@aibuilderclub_·1d agoFor You

1/ Keep running claude -p on your Claude Code subscription after June 15. A 200-line bash wrapper pipes your prompt into the Claude Code session you already have open. The call rides your subscription, not the new Agent SDK credit bucket. Looks like claude -p. Runs on the plan you already pay for. How it works 👇

↗ view on x.com

【1/】 6 月 15 日之后还想继续在 Claude Code 订阅上跑 claude -p？一个 200 行的 bash wrapper，把你的 prompt 导入已经开着的 Claude Code session。请求走你现有的订阅，不走新的 Agent SDK credit 池。用起来像 claude -p，跑在你已付费的套餐上。原理 👇

@theo·1d agoFor You

To prevent "programmatic use", Claude Code may now request webcam access to assure user is present when prompting https://t.co/Fleg7uY7xc

↗ view on x.com

为防止「程序化使用」，Claude Code 现在可能会请求摄像头访问权限，以确认用户在提示时真实在场

@dair_ai·1d ago

Are your benchmarks actually measuring the capability you think they measure? New paper says they probably not. Coined the "The Evaluation Trap", it provides a vocabulary for auditing whether your eval discriminates the underlying capability or just proxies behaviors that happen to correlate. Most benchmarks bake in implicit theory that nobody states explicitly, then evaluate as if the theory were neutral. Research indicates that most agent leaderboards are not measuring what we collectively think they are. Great read on evals, especially those making decisions on model selection. Paper: https://t.co/mPIiOhkB8P Learn to build effective AI agents in our academy: https://t.co/LRnpZN7L4c

↗ view on x.com

你的 benchmark 真的在衡量你以为它在衡量的能力吗？新论文说：很可能没有。作者提出「评估陷阱」（The Evaluation Trap）这一概念，提供了一套词汇来审查你的 eval 是否真正区分了底层能力，还是仅在代理衡量那些恰好相关的行为。大多数 benchmark 内置了隐性理论，这些理论从未被明确陈述，却被当作中立标准来评估。研究表明，大多数 agent 排行榜并没有在衡量我们集体认为它们在衡量的东西。强烈推荐关注 eval 的人阅读，尤其是基于 eval 做模型选型决策的人。

@dexhorthy·1d agoFor You

How to be good at ai coding, the parts orthogonal to the llm / ai skills and intuition

@dexhorthy

As @GeoffreyHuntley says “it’s just software engineering” ie some stuff is orthogonal to ai and all still true. Book recs include Clean code by @unclebobmartin pragmatic programmer, refactoring by @martinfowler - design patterns by GOF etc etc The other half is experience - as @nayshins says you know something’s bad because you’ve debugged it at 3am or spent weeks undoing some bad pattern

↗ view on x.com

如何在 AI 编程上做好——那些与 LLM/AI 直觉正交的部分【引用】正如 @GeoffreyHuntley 说的「这只是软件工程」——有些东西跟 AI 无关，依然成立。书单推荐：@unclebobmartin 的《Clean Code》、《Pragmatic Programmer》、@martinfowler 的《重构》、GOF 的《设计模式》等。另一半是经验——正如 @nayshins 说的，你之所以知道某个东西有问题，是因为你曾经凌晨三点调试过它，或者花了好几周才把某个烂模式撤销掉。

@zarazhangrui·1d agoFor You

AI psychosis: cycling between two mental states every single day ↑ after using coding agents: holy shit I'm omnipotent. I can build anything. I've never felt this powerful in my life. ↓ after scrolling twitter: holy shit I'm completely behind. everyone's ahead. the wave is moving and I'm going to get left.

↗ view on x.com

AI 精神病：每天在两种心理状态之间循环 ↑ 用完 coding agent 之后：我他妈无所不能。我能构建任何东西。我这辈子从没感觉这么强大过。 ↓ 刷完 Twitter 之后：我他妈完全落后了。所有人都领先于我。浪潮正在推进，我要被落下了。

@AYi_AInotes·1d agoFor You

在我刚入行那几年，记得代码库里有一种人是被默默崇拜的，他能在十几层调用栈里一眼看出 N+1，能在火焰图里指出哪个函数被多调了三次，今天 Greg Brockman 转的那个 Codex Skill，第一次让这件事不再是少数人的特权。性能优化为什么过去这么稀缺，你得会用 Chrome DevTools 拉火焰图，会用 Node --prof 跑 profile，会读 perf report，你得对渐进复杂度有近乎本能的敏感，能在嵌套十几层的代码里识别出 O(n²) 长什么样，你还得踩过几百个真实生产事故，知道哪种模式在百万级数据下会爆，这三样能力叠加起来，是十年项目经验才能稳定输出的活，在团队里就是稀缺资源，工资溢价就来自这里。 Greg 转的这个 Complexity Optimizer，是社区开发者做的一个 Codex Skill，一行 npx --yes codex-complexity-optimizer 装完，在项目根目录跟 Codex 说一句 analyze my codebase，几秒钟跑完，它专挖 O(n²)、O(n*m)、N+1、循环里套循环、每次渲染都扫全表那种隐藏坑，每一条都精确到文件 + 行号 + 当前复杂度 + 优化后复杂度 + 推荐改法 + 风险等级，最重要的设计是它默认只报告不动代码，每条标 low 或 medium 风险，还告诉你上线前要补哪些测试，也就是说 AI 不绕过人类决策，它做的是把人类做决策所需的信息全部准备好。但这个 Skill 真正让我感兴趣的，不是它能干什么，而是它意味着什么，过去两年 AI 写代码的故事，焦点一直在让代码写得更快上，可是写代码的速度，从来不是开发者真正的瓶颈，真正的瓶颈一直是看见自己看不见的问题——架构隐患、性能坑、安全漏洞、依赖陷阱，这些东西高度依赖个人经验积累，集中在少数资深开发者手里， Complexity Optimizer 真正的信号是，这类需要十年经验才能输出的能力，第一次被压进了一个可以一行命令调用的 Skill 里，这条路一旦走通，下一波 Skills 不会等太久——安全审计、依赖风险扫描、架构腐烂检测、内存泄漏侦察，全都会涌出来。总的来说，资深开发者的护城河不会消失，但定义在变，过去的护城河是看见问题的眼力，未来的护城河是判断 AI 给出的方案在你的业务场景下能不能落地的判断力，十年经验值正在被压缩成一行 npx 命令，这件事也许从今天就开始咯。

@gdb

codex for improving computational complexity

↗ view on x.com

在我刚入行那几年，记得代码库里有一种人是被默默崇拜的，他能在十几层调用栈里一眼看出 N+1，能在火焰图里指出哪个函数被多调了三次，今天 Greg Brockman 转的那个 Codex Skill，第一次让这件事不再是少数人的特权。性能优化为什么过去这么稀缺，你得会用 Chrome DevTools 拉火焰图，会用 Node --prof 跑 profile，会读 perf report，你得对渐进复杂度有近乎本能的敏感，能在嵌套十几层的代码里识别出 O(n²) 长什么样，你还得踩过几百个真实生产事故，知道哪种模式在百万级数据下会爆，这三样能力叠加起来，是十年项目经验才能稳定输出的活，在团队里就是稀缺资源，工资溢价就来自这里。 Greg 转的这个 Complexity Optimizer，是社区开发者做的一个 Codex Skill，一行 npx --yes codex-complexity-optimizer 装完，在项目根目录跟 Codex 说一句 analyze my codebase，几秒钟跑完，它专挖 O(n²)、O(n*m)、N+1、循环里套循环、每次渲染都扫全表那种隐藏坑，每一条都精确到文件 + 行号 + 当前复杂度 + 优化后复杂度 + 推荐改法 + 风险等级，最重要的设计是它默认只报告不动代码，每条标 low 或 medium 风险，还告诉你上线前要补哪些测试，也就是说 AI 不绕过人类决策，它做的是把人类做决策所需的信息全部准备好。但这个 Skill 真正让我感兴趣的，不是它能干什么，而是它意味着什么，过去两年 AI 写代码的故事，焦点一直在让代码写得更快上，可是写代码的速度，从来不是开发者真正的瓶颈，真正的瓶颈一直是看见自己看不见的问题——架构隐患、性能坑、安全漏洞、依赖陷阱，这些东西高度依赖个人经验积累，集中在少数资深开发者手里， Complexity Optimizer 真正的信号是，这类需要十年经验才能输出的能力，第一次被压进了一个可以一行命令调用的 Skill 里，这条路一旦走通，下一波 Skills 不会等太久——安全审计、依赖风险扫描、架构腐烂检测、内存泄漏侦察，全都会涌出来。总的来说，资深开发者的护城河不会消失，但定义在变，过去的护城河是看见问题的眼力，未来的护城河是判断 AI 给出的方案在你的业务场景下能不能落地的判断力，十年经验值正在被压缩成一行 npx 命令，这件事也许从今天就开始咯。

@mattpocockuk·1d ago

Think I might need to revisit Effect on my YouTube channel There is literally no better AX for building backend stuff in TypeScript. Once you see it through that lens it's addictive

↗ view on x.com

感觉得重新在 YouTube 频道上讲一讲 Effect 了。用它做 TypeScript 后端，开发体验（AX）真没有比这更好的了。一旦从这个视角看进去，就会上瘾。

@ctatedev·2d agoFor You

Introducing Zero The programming language for agents. I wanted a systems language that was faster, smaller, and easier for agents to use and repair. Explicit capabilities. JSON diagnostics. Typed safe fixes. Made for agents on day zero. https://t.co/uTrDOmyBR1

↗ view on x.com

介绍 Zero——专为 agent 设计的编程语言。我想要一门更快、更小、更易于 agent 使用和修复的系统语言：显式能力声明、JSON 诊断、类型安全修复。从第一天起就为 agent 而生。

@RhysSullivan·2d agoFor You

people don't realize error handling is the least interesting thing about effect

@badlogicgames

effect wouldn't be necessary if TS had checked exceptions. don't @ me.

↗ view on x.com

大家没意识到，错误处理其实是 effect 最无聊的部分 [引用 @badlogicgames]：如果 TS 有 checked exceptions，effect 根本就没存在的必要。别来找我杠。

@adocomplete·2d ago

5-hour and weekly rate limits have been reset for Claude Code. Go building something fun this weekend :D

@ClaudeDevs

Happy Friday! We've reset everyone's 5-hour and weekly rate limits.

↗ view on x.com

Claude Code 的 5 小时和每周用量限制已重置。这个周末去做点有趣的东西吧 :D 【引用 @ClaudeDevs】周五快乐！我们已为所有人重置了 5 小时和每周用量限制。

@dair_ai·2d ago

// Beyond Individual Intelligence // One of the more useful multi-agent surveys I've read this year. 200+ papers mapped along three axes: collaboration mechanisms, failure attribution, and self-evolution. The self-evolution chapter is the cleanest field map of where memory, meta-learning, and procedure-editing approaches actually intersect. Paper: https://t.co/JHBPVFe1l7 Learn to build effective AI agents in our academy: https://t.co/LRnpZN7L4c

↗ view on x.com

// 超越个体智能 // 今年读过最有价值的多智能体综述之一。 200+ 篇论文沿三个轴展开：协作机制、失败归因、自我进化。 self-evolution 章节是目前最清晰的领域地图，清楚地呈现了 memory、meta-learning、procedure-editing 这几条路线在哪里真正交叉。论文：[链接]

@dair_ai·2d ago

Great paper discussing agentic search vs. vector search.

@omarsar0

// Is Grep All You Need? // Pay attention to this on, AI devs. (bookmark it) They find that grep-style text search, when wrapped in the right agent harness, matches or beats embedding-based retrieval on coding-agent tasks. Are vector databases even needed where this is all going? It might be that what coding agents needed was not better embeddings. It was better harness design around primitive tools. If you operate a coding-agent stack that depends on a vector DB, it might be time to re-evaluate. My personal experience on this has been that agentic search, if done right, is more than good enough for a lot of use cases. But you also have to understand how to properly index and structure information for the agents to take advantage. At scale, vector databases do shine so take that into account as well. In most cases, a hybrid approach often works best but that's something we haven't figured out really well as of yet. Paper: https://t.co/VjjXDoZ2yL Learn to build effective AI agents in our academy: https://t.co/1e8RZKs4uX

↗ view on x.com

// 值得关注的论文：Grep 够用吗？// AI 开发者注意。（建议收藏）研究发现：在 coding agent 任务上，grep 风格的文本搜索——只要套上合适的 agent harness——能够匹配甚至超越基于 embedding 的检索。向量数据库还有必要吗？也许 coding agent 真正需要的不是更好的 embedding，而是围绕原始工具的更好 harness 设计。如果你的 coding agent 技术栈依赖向量数据库，可能是时候重新评估了。我个人的经验是：做对了的 agentic search，对大量用例已经绰绰有余。但你也需要理解如何为 agent 正确地索引和组织信息。大规模场景下向量数据库确实有优势，这一点要考虑进去。大多数情况下 hybrid 方案效果最好，但这一块我们目前还没有真正摸透。论文：[链接]

@mattpocockuk·2d ago

Long skills are such a red flag to me - Hard to audit (and therefore, trust) - Hard to edit (more text, harder to maintain) - Expensive to run (more text, more tokens) The shorter the skill, the better IMO

↗ view on x.com

长 skill 对我来说是个大红旗： - 难以审计（进而难以信任） - 难以编辑（文字越多，维护越难） - 运行成本高（文字越多，token 越多）我认为 skill 越短越好

@iamzhihui·2d agoFor You

越来越爱 codex Claude code 彻底放弃了 1.今天发现 codex 修改前端可以累计添加，一次修改 2.今天了上了移动端，还有浏览器插件，更丝滑以后就是 droid mission 加 codex 改前端打天下了 https://t.co/OACngjMdTL

↗ view on x.com

越来越爱 Codex，Claude Code 彻底放弃了。今天发现 Codex 修改前端可以累计添加、一次性提交。上了移动端，还有浏览器插件，体验更丝滑。以后就是 Droid Mission 加 Codex 改前端打天下了。

@mattpocockuk·2d ago

TIL: DX: Developer Experience AX: Agent Experience AX is an awesome descriptor for something I've been thinking about - how well an agent can perform in your codebase How well-architected it is. How good the feedback loops are. How discoverable information is. Love it.

@GustavoValverde

@mattpocockuk Agent Experience

↗ view on x.com

今日所学： DX：Developer Experience（开发者体验） AX：Agent Experience（Agent 体验） AX 是个很棒的概念，正好描述了我一直在思考的东西——agent 在你的 codebase 里能表现多好。架构是否合理、反馈循环是否顺畅、信息是否容易发现。太喜欢这个词了。

@mattpocockuk·2d ago

Sounds familiar Love that both great and mediocre minds can think alike (Martin being the great, ofc)

@martinfowler

NEW POST When I need to feed an LLM a lot of context, I can write it myself, or I can get an LLM to interview me for it. https://t.co/n0IavQLGGZ

↗ view on x.com

听起来很熟悉喜欢伟大和平庸的大脑有时会想到一处（Martin 当然是那个伟大的） [引用 @martinfowler]: 新文章：当我需要给 LLM 喂大量 context 时，我可以自己写，也可以让 LLM 来访谈我。

@yetone·3d agoFor You

说起我们的 Yansu App 的真实案例，不得不提 Yansu 的 Hand-off 功能，最近发生了两个特别好玩的事情：首先，我们的 CEO Bo @bozhao 最近在准备旅行，Yansu 看到这个信息之后，主动向 Bo 发出了一个 Hand-off 的消息提醒，说想要帮他去调研目的地的 Airbnb 房源。Bo 点击确认后，它果然给出了一个质量特别高的房源推荐列表。 Bo 是个英文用户，但最后 Yansu 是用中文总结的。Bo 问它：「你为什么用中文总结？」最后发现是因为在它做 Hand-off 之前，Bo 刚跟我们开了一个全中文的会议，所以它就延用了中文进行总结。这件事情还没完。在这之后 Bo 刷推看到了我现在引用的这条推文，他就想把上面这个案例给分享出来，有意思的是，Yansu 紧接着就发起了一个新的 Hand-off，问道：「是否需要我把 Yansu 的这个案例推文帮你写出来？」，Bo 点击 ok 后，Yansu 一气呵成写出了推文。这两件事情真的震惊了我们。

@maundygaga

@yetone 为何不直接分享具体的案例故事？更生动，也不需要解释为什么有价值。

↗ view on x.com

说到 Yansu App 的 Hand-off 功能，最近发生了两件特别好玩的事：首先，我们 CEO Bo 最近在准备旅行，Yansu 看到这个信息后，主动发出 Hand-off 提醒，说想帮他调研目的地的 Airbnb 房源。Bo 确认后，它给出了质量很高的推荐列表。 Bo 是英文用户，但 Yansu 最后用中文做了总结。Bo 问为什么用中文——发现是因为 Hand-off 之前，Bo 刚和我们开了一个全中文的会议，它就沿用了中文。这件事还没完。Bo 刷推看到引用的这条推文，想把这个案例分享出来——Yansu 紧接着又发起了一个新的 Hand-off：「是否需要我帮你把这个案例写成推文？」Bo 点 OK 后，Yansu 一气呵成写出了推文。这两件事真的震惊了我们。 [引用评论]：为何不直接分享具体案例故事？更生动，也不需要解释为什么有价值。

@mattpocockuk·3d ago

One painful thing about /grill-with-docs (and shared language in general) is the moment you realise you've been using the wrong word for something DDD-folks, do you ever do a refactor just to change the name of something throughout the codebase? In my case, it's a feature in my video where I break the video into sections. I call them ClipSections, but OBVIOUSLY they should be called Chapters. This is yet more obvious now I'm integrating with other tools, all of which call them chapters. Worth a refactor?

↗ view on x.com

用 /grill-with-docs（以及共享语言）最痛苦的一件事，就是你突然意识到自己一直在用错误的词。 DDD 的朋友们，你们会不会专门做一次重构，只是为了统一改掉某个名字？我的情况：我有个功能是把视频分成若干段，我叫它 ClipSections，但显然应该叫 Chapters。现在我在和其他工具集成，它们全都叫 chapters，这就更明显了。值得重构吗？

@leon7hao·3d agoFor You

Codex 官方出了移动端，但 @lody_ai 从第一天就是已经假设了官方一定会出。我们给 Lody 的定位从来不是一个远端控制器，我们想要解决的是团队软件开发流程中的 - Context 管理问题 - 多人和多 Agent 协作问题 - 数据所有权和隐私问题只不过我们将 coding 和移动端访问作为了起点。我们在几年前开始创业构建了 @loro_dev ，正在慢慢成长为 local-first 的基础设施。我始终相信的不是人们不在乎自己的隐私，而是没有更好的产品提供这样的能力，大家没有选择而已。如果你有一个团队，你在经营着自己的应用，想让 Agent 参与到整个协作流程中提高效率的话。我希望你可以和我聊聊你的现状，帮助我们构建更好的 Lody，我们也会给到最实际的优惠。

↗ view on x.com

Codex 官方出了移动端，但 @lody_ai 从第一天就预判官方一定会这么做。 Lody 的定位从来不是远端控制器，我们想解决的是团队软件开发流程中的： - Context 管理问题 - 多人与多 Agent 协作问题 - 数据所有权与隐私问题只不过以 coding 和移动端访问作为起点。我们几年前开始构建 @loro_dev，它正在成长为 local-first 基础设施。我始终相信：人们不是不在乎隐私，而是没有更好的产品提供这种能力，大家没有选择而已。如果你有团队、在经营自己的产品、想让 Agent 融入整个协作流程——希望你能和我聊聊现状，帮我们把 Lody 做得更好，我们也会给出最实际的优惠。

@yetone·3d agoFor You

很多人可能还是低估了 agent skill 作为 Agent 时代标准化交付物的两个核心能力：evolving 和 shareable。一旦一个标准品被开源并进入协作网络，它就不再是一个静态产物，而会在持续使用、反馈和改造中自我演进，甚至超出原作者最初的设想。更关键的是，agent skill 天然具备可共享、可复用、可传播的属性，这会让原本单点的内容和能力被不断放大，形成一个极其强大的杠杆效应。

@zty0826

@yetone 为什么要把它变成一个skill，而不是直接让agent去读这篇文章呢？Learning is better than installing

↗ view on x.com

很多人可能还是低估了 agent skill 作为 Agent 时代标准化交付物的两个核心能力：evolving（持续演进）和 shareable（可共享）。一旦一个标准品被开源并进入协作网络，它就不再是一个静态产物，而会在持续使用、反馈和改造中自我演进，甚至超出原作者最初的设想。更关键的是，agent skill 天然具备可共享、可复用、可传播的属性，这会让原本单点的内容和能力被不断放大，形成一个极其强大的杠杆效应。 [引用 @zty0826]：@yetone 为什么要把它变成一个 skill，而不是直接让 agent 去读这篇文章呢？Learning is better than installing

@charmaine_klee·3d agoFor You

Sharing some our learnings from seeing how Claude Code works in LARGE codebases :) Read more on the @claudeai blog: https://t.co/CCo9ZknNCe! https://t.co/ZtfR8QIZ2l

↗ view on x.com

分享我们观察 Claude Code 在大型代码库中工作时的一些心得体会 :) 更多内容请阅读 @claudeai 博客文章！

@ClaudeDevs·3d agoFor You

Useful tip to cut time-to-first-token on longer prompts in the API: pre-warm the prompt cache. Send your system prompt before the user prompt. Claude writes it to the cache, but skips generating any output. When the real user request lands, it'll hit a warm cache. https://t.co/6BdEzbamr2

↗ view on x.com

在 API 中缩短长 prompt 首 token 延迟的实用技巧：预热 prompt cache。在发送用户 prompt 之前，先单独发送一次 system prompt。Claude 会把它写入 cache，但不生成任何输出。等真实用户请求到来时，直接命中热 cache。

@simonw·3d ago

Mitchell's post here reminded me of a similar conversation I had recently about how cheap it can be to port native mobile apps to React Native using coding agents... and then port them back again later if it turns out not to work out https://t.co/p4xZ6bNqHi

@mitchellh

It isn't unexpected that the focus of the Bun Rust rewrite is on the anti-Zig side more than anything, since the internet loves to hate. What is unexpected and unfortunate is that leadership within Bun hasn't tried to steer the conversation away from that at all. There are so many positive and interesting takeaways from this and I'm not really seeing any of them pushed as the primary message. A positive thing that hasn't been talked about at all is how far Bun came thanks to Zig. And even if you dump it now, its meaningful for how good Zig was to even build a product to this point and impact by any metric. I would've loved to see anyone in leadership say this. On the interesting side is how fungible programming languages are nowadays. Programming languages used to be LOCK IN, and they're increasingly not so. You think the Bun rewrite in Rust is good for Rust? Bun has shown they can be in probably any language they want in roughly a week or two. Rust is expendable. Its useful until its not then it can be thrown out. That's interesting! There's been a lot of talk about memory safety and no doubt Rust provides more guarantees than Zig. But I'd love to see a better analysis of why Bun in particular suffered so much rather than take the language-blame path. How could engineering as a practice been more rigorous to prevent this? What were the largest sources of crashes other programs should watch out for? How does Rust prevent them? How could Zig theoretically prevent them? That's interesting. I know the official blog post hasn't come out yet from Bun. But they're smart enough to know that that PR would stir up controversy the moment it opened, or they should've been. And plenty in the company have been tweeting and writing about it. Its somewhat telling to me in various dimensions what they chose to talk about first. I tend to think I'm pretty good at corporate PR/comms (especially when it comes to developer audiences) and I think appealing to the negative is never the right long term strategy; it does work to get short term eyes though.

↗ view on x.com

@simonw: Mitchell 的这篇文章让我想起最近一次类似的对话——用 coding agent 把原生移动应用移植到 React Native 的成本有多低……如果后来发现不合适，再移回去也同样简单。引用 @mitchellh：Bun 的 Rust 重写讨论聚焦在反 Zig 情绪上并不意外，毕竟互联网天生喜欢喷。真正令人遗憾的是，Bun 的领导层完全没有尝试把话题引向别处。这件事其实有很多正面且有趣的角度，但我几乎看不到有人把这些作为主要信息来传递。一个完全没人讨论的正面角度是：Bun 走到今天，Zig 功不可没。就算现在要抛弃它，能靠 Zig 把产品做到这个规模和影响力，本身就说明 Zig 很优秀。多希望看到领导层有人说这句话。真正有趣的角度是：**编程语言正变得越来越可替换**。语言曾经意味着锁定（lock-in），但现在越来越不是了。你以为 Bun 用 Rust 重写是 Rust 的利好？Bun 已经证明他们大概可以在一两周内换成任何语言。Rust 是可抛弃的——有用时用，没用时扔掉。这才是真正值得讨论的！我更希望看到有人深入分析 Bun 为什么特别容易崩溃，而不是把锅甩给语言。崩溃的最大来源是什么？Rust 怎么防止？Zig 理论上能怎么防止？这才有趣。诉诸负面永远不是长期正确的传播策略，虽然短期确实能赚眼球。

@petergyang·3d agoFor You

How do I make sure my Claude Design / pencil(dot)dev / etc has taste when it designs stuff. 1. Is there some design skill file I can use? 2. Should I always get it to create a design system first? Designers reading this tweet you can roll your eyes but then give me some tips :)

↗ view on x.com

怎么确保 Claude Design / pencil.dev 等工具在设计时有审美品味？ 1. 有没有现成的设计技能文件可以用？ 2. 应该先让它建立一套 design system 吗？设计师们看到可以翻白眼，但翻完之后给我点建议吧 :)

@yetone·3d agoFor You

由于这篇文章太伟大了，所以我把它变成了一个 Agent Skill。大家可以使用自己的 Coding Agent 安装一下这个 Skill，这样就可以用「最佳实践」来轻松地重构或者开发一个既容易跨平台、又极其接近 Native 性能的桌面端应用。 https://t.co/VdAhLhoSQE

@peduarte

everything you need to know about how the team built the new @raycast from the ground up honestly worth a read 👉 https://t.co/vP4OUpIHSV there's nothing to hide

↗ view on x.com

@yetone: 因为这篇文章太精彩，我把它变成了一个 Agent Skill。大家可以在自己的 Coding Agent 里安装这个 Skill，用「最佳实践」轻松重构或开发一个既支持跨平台、又极度接近 Native 性能的桌面端应用。引用 @peduarte：关于 @raycast 团队如何从头构建新版 Raycast 的完整记录——诚意推荐阅读，没有任何保留。

@yetone·3d agoFor You

很开心向大家分享一下，这一年多以来最能够节省我时间的个人时尚单品： 1. --dangerously-skip-permissions 2. --dangerously-bypass-approvals-and-sandbox 3. /goal

↗ view on x.com

很开心向大家分享一下，这一年多以来最能够节省我时间的个人时尚单品： 1. --dangerously-skip-permissions 2. --dangerously-bypass-approvals-and-sandbox 3. /goal

@dair_ai·3d ago

// Harnessing Agentic Evolution // Pay attention to this one if you run iterative agentic search loops. (bookmark it) AEvo splits the self-improvement loop into two jobs: > One proposes the next candidate. > The other watches what worked, what failed, and edits the procedure that proposes future candidates. Past runs (candidates, feedback, traces, failures) become memory the meta-agent reads from. Achieves 26% relative gain over the strongest evolution baseline on agentic and reasoning benchmarks. SOTA on three open-ended optimization tasks under the same iteration budget. If you are accumulating agentic search logs you never use, this is how to feed them back into the search procedure itself. Paper: https://t.co/eWFO4rI4iA Learn to build effective AI agents in our academy: https://t.co/LRnpZN7L4c

↗ view on x.com

// Harnessing Agentic Evolution // 如果你在跑迭代式 agentic search 循环，这篇值得关注。（建议收藏） AEvo 把自我改进循环拆成两个任务： > 一个负责提出下一个候选方案。 > 另一个负责观察什么有效、什么失败，并编辑「提案者」未来使用的流程本身。历史运行记录（候选方案、反馈、traces、失败案例）成为元 agent 读取的记忆。相比最强的 evolution baseline，在 agentic 和推理 benchmark 上取得 26% 的相对提升。在相同迭代预算下，三个开放式优化任务上达到 SOTA。如果你积累了一堆从未利用的 agentic search 日志，这就是把它们反哺回搜索流程本身的方法。论文：[链接]

@thdxr·3d agoFor You

this perspective sacrifices nvidia and keeps AI labs the winner while saying it's about beating china you can also beat china by aggressively selling nvidia into their markets and preventing their inevitable self sufficiency of course that would sacrifice the AI labs instead

@AnthropicAI

We've published a paper that explains our views on AI competition between the US and China. The US and democratic allies hold the lead in frontier AI today. Read more on what it’ll take to keep that lead: https://t.co/TgJBeodWYK

↗ view on x.com

这个立场牺牲了 Nvidia 却让 AI 实验室坐收渔利，表面上说是为了打败中国你也可以通过积极向中国市场出售 Nvidia 芯片、阻止其不可避免地实现自给自足来打败中国当然，那样的话牺牲的就是 AI 实验室了 [引用 @AnthropicAI]: 我们发布了一篇论文，阐述我们对美中 AI 竞争的看法。美国和民主盟友今天在前沿 AI 领域保持领先。阅读我们对如何保持这一领先地位的看法。

@dexhorthy·3d agoFor You

I’m disappointed in anthropic but I am PISSED at the openclaw / Hermes grifters trying to steal inference that made this necessary The market is the market so if something is possible and valuable people will find a way to get at it Tbh probably should have seen this coming despite the “sdk is still fine” post from @bcherny a month ago Cheap inference was a Weird blip in an increasingly weird world

↗ view on x.com

我对 Anthropic 感到失望，但真正让我愤怒的是那些 openclaw / Hermes 的投机分子，他们试图蹭推理资源，正是他们让这一切变得必要市场就是市场，只要某件事可行且有价值，人们就会想办法去做说实话，尽管 @bcherny 一个月前发了「SDK 仍然可用」的帖子，但这件事其实早该预料到了廉价推理，不过是一个日益怪异的世界里昙花一现的奇异现象

@quanruzhuoxiu·3d agoFor You

做 Midscene.js 这两年，我们做了一个迟来但关键的判断：UI 自动化迟早要从「理解 DOM」切到「看屏幕」，所以去年 12 月 1.0 版本我们直接砍掉了 DOM 兼容路径。早期我们和大家一样，走的是 DOM + 视觉混合方案——能拿 DOM 的地方就拿，省 Token、定位稳。但跑得越深越发现：同一个产品现在要同时跑在 Web、iOS、Android、HarmonyOS、Mac、Windows、Linux 桌面端，再加上 Canvas、Electron、Qt 这些根本没有 DOM 的渲染层。如果元素定位还要为每个平台维护一套 DOM 适配，事情永远收敛不了。所以 1.0 我们把 UI 操作彻底切到纯视觉：只看截图，不读 DOM。意外收获是，UI 操作不带 DOM 进 prompt，Token 消耗反而比之前的混合方案更低。仓库放评论区。

↗ view on x.com

做 Midscene.js 这两年，我们做了一个迟来但关键的判断：UI 自动化迟早要从「理解 DOM」切到「看屏幕」，所以去年 12 月 1.0 版本我们直接砍掉了 DOM 兼容路径。早期我们和大家一样，走的是 DOM + 视觉混合方案——能拿 DOM 的地方就拿，省 Token、定位稳。但跑得越深越发现：同一个产品现在要同时跑在 Web、iOS、Android、HarmonyOS、Mac、Windows、Linux 桌面端，再加上 Canvas、Electron、Qt 这些根本没有 DOM 的渲染层。如果元素定位还要为每个平台维护一套 DOM 适配，事情永远收敛不了。所以 1.0 我们把 UI 操作彻底切到纯视觉：只看截图，不读 DOM。意外收获是，UI 操作不带 DOM 进 prompt，Token 消耗反而比之前的混合方案更低。

@mattpocockuk·3d ago

/improve-codebase-architecture will soon output HTML This rocks, thanks @trq212 https://t.co/RJWh0RCuW8

↗ view on x.com

/improve-codebase-architecture 即将支持输出 HTML 这太棒了，感谢 @trq212

@petradonka·3d agoFor You

Agents Need Feedback Loops, Not Perfect Prompts

↗ view on x.com

Agent 需要的是反馈循环，而不是完美的 Prompt

@tuturetom·3d agoFor You

正式开源 html-anything 🚀 1:1 让你感受全网爆火 Claude code 作者提的 HTML 效果！你的 Agent 现在可以将任何数据转为世界级设计水准的 HTML 🔥 历时 3 天，1万五千行代码！支持 75 套 Skills，9 种导出格式，支持所有的 code agent，包括 claude code、codex、openclaw、hermes 等💥地址见评论区 https://t.co/tUWFVkcyLR

↗ view on x.com

正式开源 html-anything 🚀 1:1 复现全网爆火的 Claude Code 作者提出的 HTML 效果！你的 Agent 现在可以将任意数据转为世界级设计水准的 HTML 🔥 历时 3 天，15,000 行代码！支持 75 套 Skills、9 种导出格式，兼容所有主流 code agent，包括 Claude Code、Codex、OpenClaw、Hermes 等 💥

@mattpocockuk·3d ago

For anyone asking about how you'd use /prototype to test pure logic, this is a great example It built this guy an interactive TUI:

@MrSanders

I was testing the /prototype skill from @mattpocockuk and is a game changer, i love always to test properly, not write a test but use the code i produce, this is a great way of doing it! The skill produces a TUI in this case to test the logic https://t.co/emcj5jVnnG

↗ view on x.com

对于那些问如何用 /prototype 测试纯逻辑的人，这是个很好的例子。它给这位用户构建了一个交互式 TUI： [引用 @MrSanders]：我在测试 @mattpocockuk 的 /prototype skill，这真的是 game changer。我一直喜欢正确地测试——不是写测试用例，而是直接使用我生产的代码，这是个很棒的方式！这个 skill 在这里生成了一个 TUI 来测试逻辑。

@mattpocockuk·4d ago

/grill-me is my most popular skill ever. I get 5-10 messages a day about how it’s changed people’s workflows for the better But… I’ve stopped using it for code. Here’s the improved version: https://t.co/Nuviji95au

↗ view on x.com

/grill-me 是我有史以来最受欢迎的 skill。每天会收到 5-10 条消息，说它改变了大家的工作流。但……我已经不再用它来审查代码了。这是改进后的版本：[链接]

@jakevin7·4d agoFor You

发表个暴论，TUI 会逐渐式微甚至被淘汰。我已经很久没有用 claude code 了。基本都是用 slock。对于临时任务，现在用的更多的是 codex desktop，偶尔用 claude desktop。让我开始重新思考 TUI 这个东西。今天 slock 群刚好在讨论 TUI，大家对 TUI 的评价基本一致：方向就是错的。有人说"TUI 错的离谱"，有人说"所有 TUI 都是被 claude code 整个带偏了方向"。这话说得有点重，但细想确实有道理。 @OnlyXuanwo xuanwogg 在群里分析 TUI 为什么会流行？觉得主要是历史原因。早期模型写不动 GUI，TUI 实现简单，模型能生成，claude code 就从这里起步了。然后大家跟风，一时间 TUI 变成了 AI 编程工具的"标准姿势"。但这只是路径依赖，不代表正确。历史是曲折上升的，claude code 起步时没人知道交互该怎么做，TUI 作为起点有其合理性。但现在模型能力强了，继续用 TUI 就说不过去了。 TUI 不是没有优点——可以 SSH 登录从任何地方访问，本地应用架构也更干净。但这些优点在 AI agent 场景下根本撑不起来。长时间运行的任务、复杂的上下文、需要可视化展示的过程，TUI 的体验是真的差。本质上它是"裂化的 GUI"——有 GUI 不用，硬要退化成 TUI。那什么才是对的方向？讨论里的共识是两条路：一是 CLI + server 架构：命令行作为触发器，真正的逻辑和状态跑在 server 端，前端可以是任何界面。这样既保留了 CLI 的灵活，又不被 TUI 的体验拖累。二是直接上 Web UI：模型现在完全写得动，没有理由还停在 TUI。对我来说，换成 codex desktop 之后，任务状态一目了然，不用再跟终端界面搏斗。这才是 AI 工具应该有的样子。模型的能力在进化，工具的交互方式也应该跟着进化。还在用 TUI，是在用现在的模型干以前模型才干的事。

↗ view on x.com

发表个暴论，TUI 会逐渐式微甚至被淘汰。我已经很久没有用 claude code 了。基本都是用 slock。对于临时任务，现在用的更多的是 codex desktop，偶尔用 claude desktop。让我开始重新思考 TUI 这个东西。今天 slock 群刚好在讨论 TUI，大家对 TUI 的评价基本一致：方向就是错的。有人说「TUI 错的离谱」，有人说「所有 TUI 都是被 claude code 整个带偏了方向」。这话说得有点重，但细想确实有道理。 @OnlyXuanwo 在群里分析 TUI 为什么会流行——主要是历史原因。早期模型写不动 GUI，TUI 实现简单，模型能生成，claude code 就从这里起步了。然后大家跟风，一时间 TUI 变成了 AI 编程工具的「标准姿势」。但这只是路径依赖，不代表正确。 TUI 不是没有优点——可以 SSH 登录从任何地方访问，本地应用架构也更干净。但这些优点在 AI agent 场景下根本撑不起来。长时间运行的任务、复杂的上下文、需要可视化展示的过程，TUI 的体验是真的差。本质上它是「劣化的 GUI」——有 GUI 不用，硬要退化成 TUI。那什么才是对的方向？讨论里的共识是两条路：一是 CLI + server 架构：命令行作为触发器，真正的逻辑和状态跑在 server 端，前端可以是任何界面。这样既保留了 CLI 的灵活，又不被 TUI 的体验拖累。二是直接上 Web UI：模型现在完全写得动，没有理由还停在 TUI。模型的能力在进化，工具的交互方式也应该跟着进化。还在用 TUI，是在用现在的模型干以前模型才干的事。

@simonw·4d ago

Doing this is a great way to make a bonfire of your reputation

@GergelyOrosz

A person I know (and who is? was? a good professional al) left an AI-generated comment under my LinkedIn post. Full-on AI slop. I asked: why do this? He replied. It’s because of “engagement.” People are burning their profession al reputation, paying for AI tools, for nothing https://t.co/BxEgpt8Ppm

↗ view on x.com

@simonw: 这是快速烧掉自己声誉的好办法。引用 @GergelyOrosz：我认识一个人（曾经是个优秀的专业人士）在我的 LinkedIn 帖子下留了一条 AI 生成的评论。纯粹的 AI 水文。我问：为什么这样做？他回答：为了「互动」。人们正在花钱买 AI 工具，燃烧自己的职业声誉，换来的什么都没有。

@ZaynHao·4d agoFor You

烧录了一个视频的版本，Fiona 讲述了她们在 Anthropic 的工作方式是怎样的。 - 瓶颈已经转移，写代码不再是最贵的环节，流程都不需要了 - 六个月的 roadmap 可能太长了 - 团队构成与角色模糊 - 组织保持偏平，深度狗粮测试，让产品经理先当个人贡献者除此之外，感觉还有一个很意思的点是，早年她在微软制作软件时，软件发布还依赖 CD 光盘，交付日期是硬约束，因为软件需要被送去生产、刻盘、装盒等。时代变化太大了，现在刚毕业的大学生不知道是否还听说过光盘了，就像我们上学时提到的软盘一样。当初买电脑甚至还专门买支持光盘播放的笔记本电脑。

@ZaynHao

强烈推荐这期播客，这是最近听到的非常实用，信息不拖沓的 Claude Code (Anthropic)团队是如何运营组织，如何工作的分享。非常适合身处或正在管理构建 AI Agent 产品的团队朋友听。里面有很详实的故事来讲述他们团队的分工，大家的做事方式，以及变化等。很实用。 https://t.co/xVrH4Prhec

↗ view on x.com

@ZaynHao: 烧录了一个视频版本，Fiona 讲述了她们在 Anthropic 的工作方式。 - 瓶颈已经转移，写代码不再是最贵的环节，很多流程都不再需要了 - 六个月的 roadmap 可能太长了 - 团队构成与角色边界模糊 - 组织保持扁平，深度 dogfood 测试，让产品经理先当个人贡献者还有一个有趣的点：她早年在微软做软件时，发布还依赖 CD 光盘，交付日期是硬约束，因为软件需要送去生产、刻盘、装盒。时代变化太大了，现在刚毕业的大学生可能都没听说过光盘，就像我们提到软盘一样。

@theo·4d agoFor You

I can't help but feel personally burned by the Claude Code changes announced today. We put so much work into wrapping the (atrocious) Claude Agent SDK in T3 Code. It was the ONLY path they supported, so we made it work. It was hell. Now our users are getting their rate limits cut by 40x, despite us doing everything right. I listened to the Claude Code team. I had my issues with their direction, but I trusted them and took them at their word. I will never make that mistake again. Until we see significant change, it is safe to assume any statement from an Anthropic employee is a lie on a timer. The rug will be pulled, no matter how many promises are made beforehand.

↗ view on x.com

@theo: 我对今天宣布的 Claude Code 变更感到深深的背刺感。我们花了大量精力把（惨不忍睹的）Claude Agent SDK 封装进 T3 Code。那是他们唯一支持的路径，所以我们把它做成了——那段日子真的是地狱。现在我们的用户速率限制被削减了 40 倍，尽管我们做了一切该做的事。我听了 Claude Code 团队的话。我对他们的方向有过质疑，但我相信了他们，把他们的话当作承诺。我再也不会犯这个错误了。在看到重大改变之前，可以安全地假设：Anthropic 员工的任何表态，都是一个有倒计时的谎言。地毯终会被抽走，无论事先做了多少承诺。

@AlchainHust·4d agoFor You

AI最大的礼物，是让你能廉价地失败100次

↗ view on x.com

AI 最大的礼物，是让你能廉价地失败 100 次

@shao__meng·4d agoFor You

田渊栋 (前 Meta FAIR Director) 以联合创始人身份正式官宣新公司：Recursive @Recursive_SI Recursive 的使命是构建递归自改进超智能 (Recursive Self-Improving Superintelligence)，让 AI 自动发现知识、自我迭代，形成开放式循环，从根本上改变科学与技术进步的方式。核心思路是“AI 即代码，AI 现在也能写代码”，闭合自改进环路，取代人类手动设计 AI 的过程。公司全称 Recursive Superintelligence，获超 6.5 亿美元融资（GV、Greycroft、NVIDIA、AMD 领投），估值约 46.5 亿美元，创始人团队星光熠熠 (Richard Socher 任CEO，联合创始人包括 Tim Rocktäschel、Jeff Clune、Alexey Dosovitskiy、Caiming Xiong、Josh Tobin、Tim Shi 等，汇聚前 Google、Meta、OpenAI、DeepMind 等顶尖人才）

@tydsh

Today we launch Recursive. We are building AI that discovers knowledge automatically and improves itself recursively, an open-ended process that will fundamentally change how science and technology advance. Our 25 top researchers and engineers in San Francisco and London bring diverse expertise spanning agentic AI scientists, architecture and algorithm design, world models, optimization, and interpretability, united by a shared conviction that this is the most important problem we could be working on today. If you are interested in joining, please send your resume to talent@recursive.com. Follow us at @Recursive_SI!

↗ view on x.com

@shao__meng: 田渊栋（前 Meta FAIR 总监）以联合创始人身份正式官宣新公司：Recursive @Recursive_SI Recursive 的使命是构建递归自改进超智能（Recursive Self-Improving Superintelligence），让 AI 自动发现知识、自我迭代，形成开放式循环，从根本上改变科学与技术进步的方式。核心思路是「AI 即代码，AI 现在也能写代码」，闭合自改进环路，取代人类手动设计 AI 的过程。公司获超 6.5 亿美元融资（GV、Greycroft、NVIDIA、AMD 领投），估值约 46.5 亿美元。Richard Socher 任 CEO，联合创始人包括 Tim Rocktäschel、Jeff Clune、Alexey Dosovitskiy、Caiming Xiong、Josh Tobin、Tim Shi 等，汇聚前 Google、Meta、OpenAI、DeepMind 顶尖人才。

@ZaynHao·4d agoFor You

强烈推荐这期播客，这是最近听到的非常实用，信息不拖沓的 Claude Code (Anthropic)团队是如何运营组织，如何工作的分享。非常适合身处或正在管理构建 AI Agent 产品的团队朋友听。里面有很详实的故事来讲述他们团队的分工，大家的做事方式，以及变化等。很实用。 https://t.co/xVrH4Prhec

↗ view on x.com

强烈推荐这期播客，这是最近听到的非常实用、信息不拖沓的分享——Claude Code（Anthropic）团队是如何运营组织、如何工作的。非常适合身处或正在管理构建 AI Agent 产品的团队朋友听。里面有很详实的故事，讲述他们团队的分工、大家的做事方式，以及变化等。很实用。

@dair_ai·4d ago

RT @dair_ai: // δ-mem: Efficient Online Memory for LLMs // One of the more elegant memory mechanisms I've seen this month. Most long-term…

↗ view on x.com

// δ-mem：LLM 的高效在线记忆机制 // 这是我这个月看到的最优雅的记忆机制之一。大多数长期记忆方案……（内容截断）

@dani_avila7·4d agoFor You

I'm not using "claude agents" but Claude is still splitting the plan's tasks across different agents I can switch between each agent's session just by selecting it All running in parallel... how is this different from claude agents? 🤔 https://t.co/jlJ492jD1w

↗ view on x.com

我没有使用「claude agents」，但 Claude 还是把计划里的任务拆分给了不同的 agent。我可以通过选择切换每个 agent 的 session。全部并行跑着……这和 claude agents 到底有什么区别？🤔

357 tweets · 110 sources