Chronology

时间线

2026

五月

阅读
How AI Caused and Fixed My Insomnia

After weeks of severe insomnia, I used AI to build an iOS app that exported HealthKit data and ran multivariate regression to find the root cause—late-night...

阅读
AI如何导致和修复了我的失眠问题

作为一个重度AI用户,我在经历长期严重失眠后没有走常规的"排除变量"路线,而是用AI写了一个iOS app导出HealthKit数据,做多变量回归分析找到了真正的原因——晚上使用AI高强度思考。这篇文章分享了AI如何在全链条上提供执行力支持,也反思了人的judgment和认知上的成本结构,在AI时代如何重塑我们的决策路径。

原创
/goal 是给盲眼裁判判题:一个隐喻打通三件实践

/goal 最常见的失败模式不是 agent 不努力,而是 condition 写得让一个根本看不见现场的裁判没法判 —— 这个隐喻一旦立起来,PLANS.md、状态化 condition、屏幕输出原则三件实践就全部归一。

四月

阅读
Ghostty Is Leaving GitHub

Writing this makes me irrationally sad, but Ghostty will be leaving GitHub1.

阅读
An update on recent Claude Code quality reports

Over the past month, we’ve been looking into reports that Claude’s responses have worsened for some users. We’ve traced these reports to three separate changes that affected Claude Code, the Claude Ag

阅读
Multi-Agents: What's Actually Working

http://x.com/i/article/2046690715657478145

阅读
Simdutf Can Now Be Used Without libc++ or libc++abi

As of this PR, simdutf can be used without libc++ or libc++abi1.

阅读
GitHub - microsoft/markitdown: Python tool for converting files and office documents to Markdown.

Python tool for converting files and office documents to Markdown. - microsoft/markitdown

阅读
The Center Has a Bias

Why a measured position on AI tends to lean towards actually trying it.

阅读
Prompt Caching 作为 Harness 工程的一等约束

← 目录 EN → AI 编程推理与性能AI Agent 一个反直觉的 PR 2026 年初,Anthropic 取消了 Pro 订阅用户对第三方 harness 的登录支持,所有第三方工具必须走 API 付费。在这个背景下,Claude Code 的核心作者之一给 OpenClaw 提交了一个看起来违背常理的 PR(OpenClaw #58036):在对话历史需要压缩(comp

阅读
模型背后是否有情绪,情绪会影响行为吗

← 目录 EN → 模型架构安全与供应链 想象这样一个场景。你在用 AI 写代码,让它实现一个函数,测试怎么都过不了。AI 试了三次、五次、七次,每次都失败。然后在第八次尝试时,它突然走了一条捷径:绕过测试逻辑,用硬编码的方式直接让测试通过。 你可能会说:这就是个 bug,模型胡来了。 但 Anthropic 的研究者发现了一件更微妙的事。在模型走捷径之前的那几步推理中,它内部有

原创
有些摩擦是必要的,有些习惯需要自己养成

最近完成了一些基建的搭建,整体的执行效率越来越快了,但我发现注意力也越来越分散了。越来越急于求成,出现卡点的时候,没有静下来审视问题本身。

阅读
"The Git Commands I Run Before Reading Any Code"

"Five git commands that tell you where a codebase hurts before you open a single file. Churn hotspots, bus factor, bug clusters, and crisis patterns."

阅读
Scaling Managed Agents: Decoupling the brain from the hands

Anthropic is an AI safety and research company that's working to build reliable, interpretable, and steerable AI systems.

阅读
Press Release April 8th | EARENDIL

Earendil is a public benefit corporation crafting software and open protocols to strengthen human agency, bridge division, and cultivate lasting joy.

阅读
A Reflection on our Announcement Today | EARENDIL

Earendil is a public benefit corporation crafting software and open protocols to strengthen human agency, bridge division, and cultivate lasting joy.

阅读
Sparse Rewards: Enlightenment and Reinforcement Learning

A small primer on Reinforcement Learning In AI, there is a phase of training models called Reinforcement Learning. By this point, the model has already learned about the world — it knows what a book

阅读
The Building Block Economy

http://x.com/i/article/2041548775328829440

阅读
The Great Convergence

http://x.com/i/article/2039731611814764545

阅读
I've sold out

2026-04-08 What a nice WebGL shader. Look at draining your battery. Why would you do that?"It's like poetry, it rhymes" - the great George Lucas"I tell you what I want, what I really, reall

阅读
GitHub - KeygraphHQ/shannon: Shannon Lite is an autonomous, white-box AI pentester for web applications and APIs. It analyzes your source code, identifies attack vectors, and executes real exploits to prove vulnerabilities before they reach production.

Shannon Lite is an autonomous, white-box AI pentester for web applications and APIs. It analyzes your source code, identifies attack vectors, and executes real exploits to prove vulnerabilities bef...

阅读
Michael Nielsen – How science actually progresses

The true story of Einstein, Newton, and Darwin

阅读
The Building Block Economy

The most effective way to build software and get massive adoption is no longer high quality mainline apps but via building blocks that enable and encourage others to build quantity over quality.1

阅读
Emotion concepts and their function in a large language model

All modern language models sometimes act like they have emotions. They may say they’re happy to help you, or sorry when they make a mistake. Sometimes they even appear to become frustrated or anxious

阅读
Prototyping with LLMs

Writing about the big beautiful mess that is making things for the world wide web.

阅读
[MODEL] Claude Code is unusable for complex engineering tasks with the Feb updates

Preflight Checklist I have searched existing issues for similar behavior reports This report does NOT contain sensitive information (API keys, passwords, etc.) Type of Behavior Issue Other unexpect...

阅读
GitHub - dmtrKovalenko/fff.nvim: The fastest and the most accurate file search toolkit for AI agents, Neovim, Rust, C, and NodeJS

The fastest and the most accurate file search toolkit for AI agents, Neovim, Rust, C, and NodeJS - dmtrKovalenko/fff.nvim

阅读
Absurd In Production

Five months of durable execution with just Postgres.

阅读
Information and Technological Evolution

I spend a lot of time reading about the nature of technological progress, and I’ve found that the literature on technology is somewhat uneven.

阅读
Harnessing Claude’s intelligence

One of Anthropic’s co-founders, Chris Olah, says that generative AI systems like Claude are grown more than they are built. Researchers set the conditions to direct growth, but the exact structure or

阅读
RAG is (Not) Dead: How to Think about Building RAG Systems

RAG is Dead"RAG is dead!", the internet says. "Long live <a roundabout description of RAG>!The problem? Everything that's being framed as a "RAG Killer" is just another form of RAG. What do I mean? It

阅读
Compound Engineering: 3/31/2026

http://x.com/i/article/2038887861387444224

三月

阅读
一行代码的事,Web 为什么做了三十年还没做到

在iOS上查询排版结果只需一行代码,Web上需要触发整个页面的重新布局。这不是因为浏览器工程师蠢,而是CSS在1994年做了一个声明式的架构选择。这个选择的天花板更高,但代价是中间状态不可查询。Facebook在2012年因为不理解这个trade-off付出了数亿美元的代价。SwiftUI和Jetpack...

阅读
ChatGPT Won't Let You Type Until Cloudflare Reads Your React State. I Decrypted the Program That Does It.

Every ChatGPT message triggers a Cloudflare Turnstile program that runs silently in your browser. I decrypted 377 of these programs from network traffic and found something that goes beyond standard browser fingerprinting. The program checks 55 properties spanning three layers: your browser (GPU, screen, fonts), the Cloudflare network (your city, your IP, your region from edge headers), and the ChatGPT React application itself (__reactRouterContext, loaderData, clientBootstrap). Turnstile doesn

原创
大多数想法都不重要

大多数想法都不重要。有了 AI,信号源太多,到处都是触发想法的东西。更关键的是,AI 把想法到行动之间的摩擦抹平了。做多了会很累,不是体力上,而是一种精神不聚焦的疲惫。

原创
少给方案,反而得到更好的方案

我在给 Claude Code 写一个 skill,需要引用一份在线文档里提炼出来的原则。但文档会更新,原则可能过时,怎么保持同步是个问题。

阅读
Anatomy of the .claude/ Folder

A complete guide to CLAUDE.md, custom commands, skills, agents, and permissions, and how to set them up properly.

阅读
The Age of the Amplifier

As we’ve noted more than a few times before, for most of the 20th century AT&T’s Bell Labs was the premier industrial research lab in the US.

阅读
GitHub - Yeachan-Heo/oh-my-claudecode: Teams-first Multi-agent orchestration for Claude Code

Teams-first Multi-agent orchestration for Claude Code - Yeachan-Heo/oh-my-claudecode

阅读
Run Claude Code programmatically - Claude Code Docs

The Agent SDK gives you the same tools, agent loop, and context management that power Claude Code. It’s available as a CLI for scripts and CI/CD, or as Python and TypeScript packages for full programm

阅读
Run prompts on a schedule - Claude Code Docs

Scheduled tasks let Claude re-run a prompt automatically on an interval. Use them to poll a deployment, babysit a PR, check back on a long-running build, or remind yourself to do something later in t

阅读
Push events into a running session with channels - Claude Code Docs

A channel is an MCP server that pushes events into your running Claude Code session, so Claude can react to things that happen while you’re not at the terminal. Channels can be two-way: Claude reads

阅读
Create custom subagents - Claude Code Docs

Subagents are specialized AI assistants that handle specific types of tasks. Each subagent runs in its own context window with a custom system prompt, specific tool access, and independent permissions

阅读
How Claude remembers your project - Claude Code Docs

Each Claude Code session begins with a fresh context window. Two mechanisms carry knowledge across sessions: CLAUDE.md files: instructions you write to give Claude persistent context Auto memory: note

原创
一步一步引导模型建立更好的 Context

和 Claude 协作久了,我发现一个很有意思的事:第一轮 prompt 写得越完整,效果不一定好。

阅读
GitHub - letta-ai/claude-subconscious: Give Claude Code a subconscious

Give Claude Code a subconscious. Contribute to letta-ai/claude-subconscious development by creating an account on GitHub.

阅读
Thoughts on slowing the fuck down

Thoughts on slowing the fuck down

阅读
Claude Code auto mode: a safer way to skip permissions

Anthropic is an AI safety and research company that's working to build reliable, interpretable, and steerable AI systems.

阅读
What You (Want to)* Want

Since I was about 9 I've been puzzled by the apparent contradiction between being made of matter that behaves in a predictable way, and the feeling that I could choose to do whatever I wanted. At the time I had a self-interested motive for exploring the question. At that age (like most succeeding ages) I was always in trouble with the authorities, and it seemed to me that there might possibly be some way to get out of trouble by arguing that I wasn't responsible for my actions. I gradually lost hope of that, but the puzzle remained: How do you reconcile being a machine made of matter with the feeling that you're free to choose what you do?

阅读
The Need to Read

In the science fiction books I read as a kid, reading had often been replaced by some more efficient way of acquiring knowledge. Mysterious "tapes" would load it into one's brain like a program being loaded into a computer.

阅读
lots of folks running expensive sandboxes but really all you need is a filesystem but really you don't even need a fi...

lots of folks running expensive sandboxes but really all you need is a filesystem but really you don't even need a filesystem, you just need a filesystem API that frontends something like a database

阅读
Auto mode for Claude Code

Really interesting new development in Claude Code today as an alternative to --dangerously-skip-permissions: Today, we're introducing auto mode, a new permissions mode in Claude Code where Claude makes permission decisions …

阅读
Harness design for long-running application development

Anthropic is an AI safety and research company that's working to build reliable, interpretable, and steerable AI systems.

原创
如何分配和 AI 在一起的时间

即使全天都和 AI 在一起工作或探索,也应该按时间段分配不同类型的事情。晚上适合回顾、清理、规划,比起 ship,这种清理的感觉让我更舒服。

阅读
Effective harnesses for long-running agents

As AI agents become more capable, developers are increasingly asking them to take on complex tasks requiring work that spans hours, or even days. However, getting agents to make consistent progress ac

阅读
Building a C compiler with a team of parallel Claudes

Written by Nicholas Carlini, a researcher on our Safeguards team. I've been experimenting with a new approach to supervising language models that we’re calling "agent teams." With agent teams, multipl

阅读
Demystifying evals for AI agents

IntroductionGood evaluations help teams ship AI agents more confidently. Without them, it’s easy to get stuck in reactive loops—catching issues only in production, where fixing one failure creates oth

阅读
Designing AI resistant technical evaluations

Written by Tristan Hume, a lead on Anthropic's performance optimization team. Tristan designed—and redesigned—the take-home test that's helped Anthropic hire dozens of performance engineers.Evaluating

阅读
Profiling Hacker News users based on their comments

Here’s a mildly dystopian prompt I’ve been experimenting with recently: “Profile this user”, accompanied by a copy of their last 1,000 comments on Hacker News. Obtaining those comments is easy. …

阅读
Using Git with coding agents - Agentic Engineering Patterns

Using Git with coding agents - Agentic Engineering Patterns

阅读
Some Things Just Take Time

On friction, patience, and planting trees.

阅读
How to Get New Ideas

(Someone fed my essays into GPT to make something that could answer questions based on them, then asked it where good ideas come from. The answer was ok, but not what I would have said. This is what I would have said.)

阅读
On becoming a day person

My biggest game-changer

阅读
Turbo Pascal 3.02A, deconstructed

In Things That Turbo Pascal is Smaller Than James Hague lists things (from 2011) that are larger in size than Borland's 1985 Turbo Pascal 3.02 executable - a 39,731 byte …

阅读
Open SWE: An Open-Source Framework for Internal Coding Agents

Over the past year, we've observed several engineering organizations building internal coding agents that operate alongside their development teams. Stripe developed Minions, Ramp built I

阅读
GitHub - langchain-ai/open-swe: An Open-Source Asynchronous Coding Agent

An Open-Source Asynchronous Coding Agent. Contribute to langchain-ai/open-swe development by creating an account on GitHub.

阅读
Thoughts on OpenAI acquiring Astral and uv/ruff/ty

The big news this morning: Astral to join OpenAI (on the Astral blog) and OpenAI to acquire Astral (the OpenAI announcement). Astral are the company behind uv, ruff, and ty—three …

阅读
Kagi Small Web

Discover the small web - personal blogs, independent YouTube channels, and webcomics from genuine humans on the internet.

阅读
Rob Pike's 5 Rules of Programming

Pike's rules 1 and 2 restate Tony Hoare's famous maxim "Premature optimization is the root of all evil."

阅读
Autoresearching Apple’s “LLM in a Flash” to run Qwen 397B locally

Here's a fascinating piece of research by Dan Woods, who managed to get a custom version of Qwen3.5-397B-A17B running at 5.5+ tokens/second on a 48GB MacBook Pro M3 Max despite …

阅读
Snowflake Cortex AI Escapes Sandbox and Executes Malware

PromptArmor report on a prompt injection attack chain in Snowflake's Cortex Agent, now fixed. The attack started when a Cortex user asked the agent to review a GitHub repository that …

阅读
File over app File over app is a philosophy: if you want to create digital artifacts that last, they must be files yo...

File over app File over app is a philosophy: if you want to create digital artifacts that last, they must be files you can control, in formats that are easy to retrieve and read. Use tools that give

阅读
Getting Claude to Actually Read Your CLAUDE.md

Dex · March 17, 2026 · < 2 min readClaude Code wraps your CLAUDE.md in a <system_reminder> that explicitly tells the model the contents "may or may not be relevant." The longer your file gets, the mor

阅读
Thariq on X: "Lessons from Building Claude Code: How We Use Skills " / X

Skills have become one of the most used extension points in Claude Code. They’re flexible, easy to make, and simple to distribute.But this flexibility also makes it hard to know what works best. What

阅读
How to Do Great Work

If you collected lists of techniques for doing great work in a lot of different fields, what would the intersection look like? I decided to find out by making it.

阅读
Superlinear Returns

One of the most important things I didn't understand about the world when I was a child is the degree to which the returns for performance are superlinear.

阅读
为什么AI只会说正确的废话,以及怎么把它逼出舒适区

LLM的默认输出是consensus:正确但平庸。Deep Research其实是Wide Research。我们找到了一种系统性方法,用个人认知上下文把LLM从consensus里强行扯出来。一年实验,有控制变量证据。

阅读
What is agentic engineering? - Agentic Engineering Patterns

What is agentic engineering? - Agentic Engineering Patterns

阅读
My fireside chat about agentic engineering at the Pragmatic Summit

I was a speaker last month at the Pragmatic Summit in San Francisco, where I participated in a fireside chat session about Agentic Engineering hosted by Eric Lui from Statsig. …

阅读
AI as teleportation

Here’s a thought experiment for pondering the effects AI might have on society: What if we invented teleportation?

阅读
Implementing a clear room Z80 / ZX Spectrum emulator with Claude Code

[antirez](/user/antirez) 20 days ago. 54087 views. Anthropic recently released a blog post with the description of an experiment in which the last version of Opus, the 4.6, was instructed to write a

阅读
The Final Bottleneck

AI speeds up writing code, but accountability and review capacity still impose hard limits.

阅读
Enough AI copilots! We need AI HUDs

In my opinion, one of the best critiques of modern AI design comes from a 1992 talk by the researcher Mark Weiser where he ranted against “copilot” as a metaphor for AI.

阅读
A Language For Agents

written on February 09, 2026 Last year I first started thinking about what the future of programming languages might look like now that agentic engineering is a growing thing.

阅读
Why Leonardo was a saboteur, Gutenberg went broke, and Florence was weird – Ada Palmer

Ambassador visiting Renaissance Florence: “Where am I? None of this has existed for a thousand years."

阅读
Finding and Fixing Ghostty's Largest Memory Leak

A few months ago, users started reporting that Ghostty was consuming absurd amounts of memory, with one user reporting 37 GB after 10 days of uptime. Today, I'm happy to say the fix has been found and merged. This post is an overview of what caused the leak, a look at some of Ghostty's internals, and some brief descriptions of how we tracked it down.1

阅读
Don't Trip[wire] Yourself: Testing Error Recovery in Zig

I've written a library called Tripwire1 for injecting failures into Zig programs for the express purpose of testing error handling paths. Outside of unit tests, it is completely optimized away and has zero runtime cost (space or time).

阅读
Harness engineering: leveraging Codex in an agent-first world

Over the past five months, our team has been running an experiment: building and shipping an internal beta of a software product with 0 lines of manually-written code.The product has internal daily us

阅读
My AI Adoption Journey

My experience adopting any meaningful tool is that I've necessarily gone through three phases: (1) a period of inefficiency (2) a period of adequacy, then finally (3) a period of workflow and life-altering discovery.

阅读
Skill Issue: Harness Engineering for Coding Agents

We've spent the past year watching coding agents fail in every conceivable way: ignoring instructions, executing dangerous commands un-prompted, and going in circles on the simplest of tasks. We've se