Chronology

时间线

2026

五月

2026年5月12日阅读

After weeks of severe insomnia, I used AI to build an iOS app that exported HealthKit data and ran multivariate regression to find the root cause—late-night...

2026年5月12日阅读

AI如何导致和修复了我的失眠问题

作为一个重度AI用户，我在经历长期严重失眠后没有走常规的"排除变量"路线，而是用AI写了一个iOS app导出HealthKit数据，做多变量回归分析找到了真正的原因——晚上使用AI高强度思考。这篇文章分享了AI如何在全链条上提供执行力支持，也反思了人的judgment和认知上的成本结构，在AI时代如何重塑我们的决策路径。

2026年5月12日原创

/goal 是给盲眼裁判判题：一个隐喻打通三件实践

/goal 最常见的失败模式不是 agent 不努力，而是 condition 写得让一个根本看不见现场的裁判没法判 —— 这个隐喻一旦立起来，PLANS.md、状态化 condition、屏幕输出原则三件实践就全部归一。

四月

2026年4月29日阅读

Ghostty Is Leaving GitHub

Writing this makes me irrationally sad, but Ghostty will be leaving GitHub1.

2026年4月25日阅读

An update on recent Claude Code quality reports

Over the past month, we’ve been looking into reports that Claude’s responses have worsened for some users. We’ve traced these reports to three separate changes that affected Claude Code, the Claude Ag

2026年4月25日阅读

Multi-Agents: What's Actually Working

http://x.com/i/article/2046690715657478145

2026年4月16日阅读

Simdutf Can Now Be Used Without libc++ or libc++abi

As of this PR, simdutf can be used without libc++ or libc++abi1.

2026年4月14日阅读

GitHub - microsoft/markitdown: Python tool for converting files and office documents to Markdown.

Python tool for converting files and office documents to Markdown. - microsoft/markitdown

2026年4月14日阅读

The Center Has a Bias

Why a measured position on AI tends to lean towards actually trying it.

2026年4月13日阅读

Prompt Caching 作为 Harness 工程的一等约束

← 目录 EN → AI 编程推理与性能AI Agent 一个反直觉的 PR 2026 年初，Anthropic 取消了 Pro 订阅用户对第三方 harness 的登录支持，所有第三方工具必须走 API 付费。在这个背景下，Claude Code 的核心作者之一给 OpenClaw 提交了一个看起来违背常理的 PR（OpenClaw #58036）：在对话历史需要压缩（comp

2026年4月13日阅读

模型背后是否有情绪，情绪会影响行为吗

← 目录 EN → 模型架构安全与供应链想象这样一个场景。你在用 AI 写代码，让它实现一个函数，测试怎么都过不了。AI 试了三次、五次、七次，每次都失败。然后在第八次尝试时，它突然走了一条捷径：绕过测试逻辑，用硬编码的方式直接让测试通过。你可能会说：这就是个 bug，模型胡来了。但 Anthropic 的研究者发现了一件更微妙的事。在模型走捷径之前的那几步推理中，它内部有

2026年4月10日原创

有些摩擦是必要的，有些习惯需要自己养成

最近完成了一些基建的搭建，整体的执行效率越来越快了，但我发现注意力也越来越分散了。越来越急于求成，出现卡点的时候，没有静下来审视问题本身。

2026年4月9日阅读

"The Git Commands I Run Before Reading Any Code"

"Five git commands that tell you where a codebase hurts before you open a single file. Churn hotspots, bus factor, bug clusters, and crisis patterns."

2026年4月9日阅读

Scaling Managed Agents: Decoupling the brain from the hands

Anthropic is an AI safety and research company that's working to build reliable, interpretable, and steerable AI systems.

2026年4月9日阅读

Press Release April 8th | EARENDIL

Earendil is a public benefit corporation crafting software and open protocols to strengthen human agency, bridge division, and cultivate lasting joy.

2026年4月9日阅读

A Reflection on our Announcement Today | EARENDIL

Earendil is a public benefit corporation crafting software and open protocols to strengthen human agency, bridge division, and cultivate lasting joy.

2026年4月8日阅读

Sparse Rewards: Enlightenment and Reinforcement Learning

A small primer on Reinforcement Learning In AI, there is a phase of training models called Reinforcement Learning. By this point, the model has already learned about the world — it knows what a book

2026年4月8日阅读

The Building Block Economy

http://x.com/i/article/2041548775328829440

2026年4月8日阅读

The Great Convergence

http://x.com/i/article/2039731611814764545

2026年4月8日阅读

I've sold out

2026-04-08 What a nice WebGL shader. Look at draining your battery. Why would you do that?"It's like poetry, it rhymes" - the great George Lucas"I tell you what I want, what I really, reall

2026年4月8日阅读

GitHub - KeygraphHQ/shannon: Shannon Lite is an autonomous, white-box AI pentester for web applications and APIs. It analyzes your source code, identifies attack vectors, and executes real exploits to prove vulnerabilities before they reach production.

Shannon Lite is an autonomous, white-box AI pentester for web applications and APIs. It analyzes your source code, identifies attack vectors, and executes real exploits to prove vulnerabilities bef...

2026年4月8日阅读

Michael Nielsen – How science actually progresses

The true story of Einstein, Newton, and Darwin

2026年4月8日阅读

The Building Block Economy

The most effective way to build software and get massive adoption is no longer high quality mainline apps but via building blocks that enable and encourage others to build quantity over quality.1

2026年4月7日阅读

Emotion concepts and their function in a large language model

All modern language models sometimes act like they have emotions. They may say they’re happy to help you, or sorry when they make a mistake. Sometimes they even appear to become frustrated or anxious

2026年4月7日阅读

Prototyping with LLMs

Writing about the big beautiful mess that is making things for the world wide web.

2026年4月7日阅读

[MODEL] Claude Code is unusable for complex engineering tasks with the Feb updates

Preflight Checklist I have searched existing issues for similar behavior reports This report does NOT contain sensitive information (API keys, passwords, etc.) Type of Behavior Issue Other unexpect...

2026年4月5日阅读

GitHub - dmtrKovalenko/fff.nvim: The fastest and the most accurate file search toolkit for AI agents, Neovim, Rust, C, and NodeJS

The fastest and the most accurate file search toolkit for AI agents, Neovim, Rust, C, and NodeJS - dmtrKovalenko/fff.nvim

2026年4月5日阅读

Absurd In Production

Five months of durable execution with just Postgres.

2026年4月5日阅读

Information and Technological Evolution

I spend a lot of time reading about the nature of technological progress, and I’ve found that the literature on technology is somewhat uneven.

2026年4月3日阅读

Harnessing Claude’s intelligence

One of Anthropic’s co-founders, Chris Olah, says that generative AI systems like Claude are grown more than they are built. Researchers set the conditions to direct growth, but the exact structure or

2026年4月3日阅读

RAG is (Not) Dead: How to Think about Building RAG Systems

RAG is Dead"RAG is dead!", the internet says. "Long live <a roundabout description of RAG>!The problem? Everything that's being framed as a "RAG Killer" is just another form of RAG. What do I mean? It

2026年4月2日阅读

Compound Engineering: 3/31/2026

http://x.com/i/article/2038887861387444224

三月

2026年3月31日阅读

一行代码的事，Web 为什么做了三十年还没做到

在iOS上查询排版结果只需一行代码，Web上需要触发整个页面的重新布局。这不是因为浏览器工程师蠢，而是CSS在1994年做了一个声明式的架构选择。这个选择的天花板更高，但代价是中间状态不可查询。Facebook在2012年因为不理解这个trade-off付出了数亿美元的代价。SwiftUI和Jetpack...

2026年3月30日阅读

ChatGPT Won't Let You Type Until Cloudflare Reads Your React State. I Decrypted the Program That Does It.

Every ChatGPT message triggers a Cloudflare Turnstile program that runs silently in your browser. I decrypted 377 of these programs from network traffic and found something that goes beyond standard browser fingerprinting. The program checks 55 properties spanning three layers: your browser (GPU, screen, fonts), the Cloudflare network (your city, your IP, your region from edge headers), and the ChatGPT React application itself (__reactRouterContext, loaderData, clientBootstrap). Turnstile doesn

2026年3月30日原创

大多数想法都不重要

大多数想法都不重要。有了 AI，信号源太多，到处都是触发想法的东西。更关键的是，AI 把想法到行动之间的摩擦抹平了。做多了会很累，不是体力上，而是一种精神不聚焦的疲惫。

2026年3月29日原创

少给方案，反而得到更好的方案

我在给 Claude Code 写一个 skill，需要引用一份在线文档里提炼出来的原则。但文档会更新，原则可能过时，怎么保持同步是个问题。

2026年3月28日阅读

Anatomy of the .claude/ Folder

A complete guide to CLAUDE.md, custom commands, skills, agents, and permissions, and how to set them up properly.

2026年3月28日阅读

The Age of the Amplifier

As we’ve noted more than a few times before, for most of the 20th century AT&T’s Bell Labs was the premier industrial research lab in the US.

2026年3月28日阅读

GitHub - Yeachan-Heo/oh-my-claudecode: Teams-first Multi-agent orchestration for Claude Code

Teams-first Multi-agent orchestration for Claude Code - Yeachan-Heo/oh-my-claudecode

2026年3月28日阅读

Run Claude Code programmatically - Claude Code Docs

The Agent SDK gives you the same tools, agent loop, and context management that power Claude Code. It’s available as a CLI for scripts and CI/CD, or as Python and TypeScript packages for full programm

2026年3月28日阅读

Run prompts on a schedule - Claude Code Docs

Scheduled tasks let Claude re-run a prompt automatically on an interval. Use them to poll a deployment, babysit a PR, check back on a long-running build, or remind yourself to do something later in t

2026年3月28日阅读

Push events into a running session with channels - Claude Code Docs

A channel is an MCP server that pushes events into your running Claude Code session, so Claude can react to things that happen while you’re not at the terminal. Channels can be two-way: Claude reads

2026年3月28日阅读

Create custom subagents - Claude Code Docs

Subagents are specialized AI assistants that handle specific types of tasks. Each subagent runs in its own context window with a custom system prompt, specific tool access, and independent permissions

2026年3月28日阅读

How Claude remembers your project - Claude Code Docs

Each Claude Code session begins with a fresh context window. Two mechanisms carry knowledge across sessions: CLAUDE.md files: instructions you write to give Claude persistent context Auto memory: note

2026年3月28日原创

一步一步引导模型建立更好的 Context

和 Claude 协作久了，我发现一个很有意思的事：第一轮 prompt 写得越完整，效果不一定好。

2026年3月26日阅读

GitHub - letta-ai/claude-subconscious: Give Claude Code a subconscious

Give Claude Code a subconscious. Contribute to letta-ai/claude-subconscious development by creating an account on GitHub.

2026年3月26日阅读

Thoughts on slowing the fuck down

2026年3月26日阅读

Claude Code auto mode: a safer way to skip permissions

Anthropic is an AI safety and research company that's working to build reliable, interpretable, and steerable AI systems.

2026年3月26日阅读

What You (Want to)* Want

Since I was about 9 I've been puzzled by the apparent contradiction between being made of matter that behaves in a predictable way, and the feeling that I could choose to do whatever I wanted. At the time I had a self-interested motive for exploring the question. At that age (like most succeeding ages) I was always in trouble with the authorities, and it seemed to me that there might possibly be some way to get out of trouble by arguing that I wasn't responsible for my actions. I gradually lost hope of that, but the puzzle remained: How do you reconcile being a machine made of matter with the feeling that you're free to choose what you do?

2026年3月26日阅读

The Need to Read

In the science fiction books I read as a kid, reading had often been replaced by some more efficient way of acquiring knowledge. Mysterious "tapes" would load it into one's brain like a program being loaded into a computer.

2026年3月25日阅读

lots of folks running expensive sandboxes but really all you need is a filesystem but really you don't even need a fi...

lots of folks running expensive sandboxes but really all you need is a filesystem but really you don't even need a filesystem, you just need a filesystem API that frontends something like a database

2026年3月25日阅读

Auto mode for Claude Code

Really interesting new development in Claude Code today as an alternative to --dangerously-skip-permissions: Today, we're introducing auto mode, a new permissions mode in Claude Code where Claude makes permission decisions …

2026年3月25日阅读

Harness design for long-running application development

Anthropic is an AI safety and research company that's working to build reliable, interpretable, and steerable AI systems.

2026年3月24日原创

如何分配和 AI 在一起的时间

即使全天都和 AI 在一起工作或探索，也应该按时间段分配不同类型的事情。晚上适合回顾、清理、规划，比起 ship，这种清理的感觉让我更舒服。

2026年3月22日阅读

Effective harnesses for long-running agents

As AI agents become more capable, developers are increasingly asking them to take on complex tasks requiring work that spans hours, or even days. However, getting agents to make consistent progress ac

2026年3月22日阅读

Building a C compiler with a team of parallel Claudes

Written by Nicholas Carlini, a researcher on our Safeguards team. I've been experimenting with a new approach to supervising language models that we’re calling "agent teams." With agent teams, multipl

2026年3月22日阅读

Demystifying evals for AI agents

IntroductionGood evaluations help teams ship AI agents more confidently. Without them, it’s easy to get stuck in reactive loops—catching issues only in production, where fixing one failure creates oth

2026年3月22日阅读

Designing AI resistant technical evaluations

Written by Tristan Hume, a lead on Anthropic's performance optimization team. Tristan designed—and redesigned—the take-home test that's helped Anthropic hire dozens of performance engineers.Evaluating

2026年3月22日阅读

Profiling Hacker News users based on their comments

Here’s a mildly dystopian prompt I’ve been experimenting with recently: “Profile this user”, accompanied by a copy of their last 1,000 comments on Hacker News. Obtaining those comments is easy. …

2026年3月22日阅读

Using Git with coding agents - Agentic Engineering Patterns

2026年3月22日阅读

Some Things Just Take Time

On friction, patience, and planting trees.

2026年3月22日阅读

How to Get New Ideas

(Someone fed my essays into GPT to make something that could answer questions based on them, then asked it where good ideas come from. The answer was ok, but not what I would have said. This is what I would have said.)

2026年3月21日阅读

On becoming a day person

My biggest game-changer

2026年3月21日阅读

Turbo Pascal 3.02A, deconstructed

In Things That Turbo Pascal is Smaller Than James Hague lists things (from 2011) that are larger in size than Borland's 1985 Turbo Pascal 3.02 executable - a 39,731 byte …

2026年3月20日阅读

Open SWE: An Open-Source Framework for Internal Coding Agents

Over the past year, we've observed several engineering organizations building internal coding agents that operate alongside their development teams. Stripe developed Minions, Ramp built I

2026年3月20日阅读

GitHub - langchain-ai/open-swe: An Open-Source Asynchronous Coding Agent

An Open-Source Asynchronous Coding Agent. Contribute to langchain-ai/open-swe development by creating an account on GitHub.

2026年3月20日阅读

Thoughts on OpenAI acquiring Astral and uv/ruff/ty

The big news this morning: Astral to join OpenAI (on the Astral blog) and OpenAI to acquire Astral (the OpenAI announcement). Astral are the company behind uv, ruff, and ty—three …

2026年3月19日阅读

Kagi Small Web

Discover the small web - personal blogs, independent YouTube channels, and webcomics from genuine humans on the internet.

2026年3月19日阅读

Rob Pike's 5 Rules of Programming

Pike's rules 1 and 2 restate Tony Hoare's famous maxim "Premature optimization is the root of all evil."

2026年3月19日阅读

Autoresearching Apple’s “LLM in a Flash” to run Qwen 397B locally

Here's a fascinating piece of research by Dan Woods, who managed to get a custom version of Qwen3.5-397B-A17B running at 5.5+ tokens/second on a 48GB MacBook Pro M3 Max despite …

2026年3月19日阅读

Snowflake Cortex AI Escapes Sandbox and Executes Malware

PromptArmor report on a prompt injection attack chain in Snowflake's Cortex Agent, now fixed. The attack started when a Cortex user asked the agent to review a GitHub repository that …

2026年3月18日阅读

File over app File over app is a philosophy: if you want to create digital artifacts that last, they must be files yo...

File over app File over app is a philosophy: if you want to create digital artifacts that last, they must be files you can control, in formats that are easy to retrieve and read. Use tools that give

2026年3月18日阅读

Getting Claude to Actually Read Your CLAUDE.md

Dex · March 17, 2026 · < 2 min readClaude Code wraps your CLAUDE.md in a <system_reminder> that explicitly tells the model the contents "may or may not be relevant." The longer your file gets, the mor

2026年3月18日阅读

Thariq on X: "Lessons from Building Claude Code: How We Use Skills " / X

Skills have become one of the most used extension points in Claude Code. They’re flexible, easy to make, and simple to distribute.But this flexibility also makes it hard to know what works best. What

2026年3月17日阅读

How to Do Great Work

If you collected lists of techniques for doing great work in a lot of different fields, what would the intersection look like? I decided to find out by making it.

2026年3月17日阅读

Superlinear Returns

One of the most important things I didn't understand about the world when I was a child is the degree to which the returns for performance are superlinear.

2026年3月17日阅读

为什么AI只会说正确的废话，以及怎么把它逼出舒适区

LLM的默认输出是consensus：正确但平庸。Deep Research其实是Wide Research。我们找到了一种系统性方法，用个人认知上下文把LLM从consensus里强行扯出来。一年实验，有控制变量证据。

2026年3月17日阅读

What is agentic engineering? - Agentic Engineering Patterns

2026年3月17日阅读

My fireside chat about agentic engineering at the Pragmatic Summit

I was a speaker last month at the Pragmatic Summit in San Francisco, where I participated in a fireside chat session about Agentic Engineering hosted by Eric Lui from Statsig. …

2026年3月17日阅读

AI as teleportation

Here’s a thought experiment for pondering the effects AI might have on society: What if we invented teleportation?

2026年3月17日阅读

Implementing a clear room Z80 / ZX Spectrum emulator with Claude Code

[antirez](/user/antirez) 20 days ago. 54087 views. Anthropic recently released a blog post with the description of an experiment in which the last version of Opus, the 4.6, was instructed to write a

2026年3月17日阅读

The Final Bottleneck

AI speeds up writing code, but accountability and review capacity still impose hard limits.

2026年3月17日阅读

Enough AI copilots! We need AI HUDs

In my opinion, one of the best critiques of modern AI design comes from a 1992 talk by the researcher Mark Weiser where he ranted against “copilot” as a metaphor for AI.

2026年3月17日阅读

A Language For Agents

written on February 09, 2026 Last year I first started thinking about what the future of programming languages might look like now that agentic engineering is a growing thing.

2026年3月17日阅读

Why Leonardo was a saboteur, Gutenberg went broke, and Florence was weird – Ada Palmer

Ambassador visiting Renaissance Florence: “Where am I? None of this has existed for a thousand years."

2026年3月16日阅读

Finding and Fixing Ghostty's Largest Memory Leak

A few months ago, users started reporting that Ghostty was consuming absurd amounts of memory, with one user reporting 37 GB after 10 days of uptime. Today, I'm happy to say the fix has been found and merged. This post is an overview of what caused the leak, a look at some of Ghostty's internals, and some brief descriptions of how we tracked it down.1

2026年3月16日阅读

Don't Trip[wire] Yourself: Testing Error Recovery in Zig

I've written a library called Tripwire1 for injecting failures into Zig programs for the express purpose of testing error handling paths. Outside of unit tests, it is completely optimized away and has zero runtime cost (space or time).

2026年3月15日阅读

Harness engineering: leveraging Codex in an agent-first world

Over the past five months, our team has been running an experiment: building and shipping an internal beta of a software product with 0 lines of manually-written code.The product has internal daily us

2026年3月15日阅读

My AI Adoption Journey

My experience adopting any meaningful tool is that I've necessarily gone through three phases: (1) a period of inefficiency (2) a period of adequacy, then finally (3) a period of workflow and life-altering discovery.

2026年3月14日阅读

Skill Issue: Harness Engineering for Coding Agents

We've spent the past year watching coding agents fail in every conceivable way: ignoring instructions, executing dangerous commands un-prompted, and going in circles on the simplest of tasks. We've se