推文 | Alfred's Site

账号:全部 mattpocockuk dair_ai simonw bcherny adocomplete trq212 dexhorthy yetone felixrieseberg 0xblacklight dani_avila7 zarazhangrui AlchainHust badlogicgames dotey petergyang vikingmute ryancarson ClaudeDevs kunchenguid leon7hao ctatedev iamzhihui kieranklaassen mckaywrigley lennysan thdxr RhysSullivan garrytan jakevin7 johnlindquist karpathy servasyy_ai theo 0xMovez 9hills RLanceMartin aidenybai dingyi hylarucoder mitchellh mitsuhiko mvanhorn omarsar0 steipete zeeg GeminiApp GergelyOrosz ZaynHao addyosmani antirez danshipper dillon_mulroy elonmusk ewind_dev idoubicc nummanali realWeZZard swyx thsottiaux yiliush 0xPaulius 0x_rody AYi_AInotes AnthropicAI Barret_China BenJames_____CMGS1988 ChadMoran DanielMiessler DaveJ DavidKPiano Dimillian FactoryAI FardeemM FarzaTV GoogleDeepMind HamelHusain HiTw93 HilaShmuel IfanJew IndieDevHailey Jack_W_Lindsey Jackywine JasonZX Jiaxi_Cui JinjingLiang KhalidWarsa LinearUncle LinghuaJ MatthewBerman MaxForAI Meari_V2_0_G Mnilax QingQ77 RaillyHugo Saccc_c Taniyatweets_VincentLogic ai_explorer25 aibuilderclub_alexalbert__alliekmiller andrewfarah antoinecojp anxue201 arkuy99 artman asynkimo bbssppllvv bearliu bentossell bridgemindai buaaxhm cailynyongyong cgtwts charmaine_klee chrisbarber chrisparkX clairevo delba_oliveira demishassabis doodlestein driaforall elithrar elvissun ericzakariasson francoisfleuret gabriell_lab iamsahaj_xyz imdigitalashish imvihv jarredsumner jerryjliu0 jianshuo jiayuan_jy jlongster jonas_nelle joshalbrecht jpschroeder kaiofreitas kepano lexrus logancyang mattyp nabeel nifinet ninthbit_ai oran_ge patrickc pejmanjohn petradonka prathamgrv prathyvsh quanruzhuoxiu quant_sheep raroque realCaigu repsiace rohanpaul_ai sama samuelstroschei sawyerhood shadcn shao__meng shaogefenhao shreyansj techgirl1908 techwith_ram thorstenball tianyi tobiaswup toddsaunders tricalt turingou tuturetom uniswap12 untraceable_the vanillaCitron velvet_shark wengtianxin wquguru xiaogaifun xicilion ycombinator zachlloydtweets ziwenxu_zodchiii

@yiliush·4月7日For You

before collaborator i built out this whole agentic knowledge base app with an entire source docs → knowledge graph → mcp pipeline... and just never released it. you guys want me to release it? https://t.co/jStkiTOgq2

@karpathy

LLM Knowledge Bases Something I'm finding very useful recently: using LLMs to build personal knowledge bases for various topics of research interest. In this way, a large fraction of my recent token throughput is going less into manipulating code, and more into manipulating knowledge (stored as markdown and images). The latest LLMs are quite good at it. So: Data ingest: I index source documents (articles, papers, repos, datasets, images, etc.) into a raw/ directory, then I use an LLM to incrementally "compile" a wiki, which is just a collection of .md files in a directory structure. The wiki includes summaries of all the data in raw/, backlinks, and then it categorizes data into concepts, writes articles for them, and links them all. To convert web articles into .md files I like to use the Obsidian Web Clipper extension, and then I also use a hotkey to download all the related images to local so that my LLM can easily reference them. IDE: I use Obsidian as the IDE "frontend" where I can view the raw data, the the compiled wiki, and the derived visualizations. Important to note that the LLM writes and maintains all of the data of the wiki, I rarely touch it directly. I've played with a few Obsidian plugins to render and view data in other ways (e.g. Marp for slides). Q&A: Where things get interesting is that once your wiki is big enough (e.g. mine on some recent research is ~100 articles and ~400K words), you can ask your LLM agent all kinds of complex questions against the wiki, and it will go off, research the answers, etc. I thought I had to reach for fancy RAG, but the LLM has been pretty good about auto-maintaining index files and brief summaries of all the documents and it reads all the important related data fairly easily at this ~small scale. Output: Instead of getting answers in text/terminal, I like to have it render markdown files for me, or slide shows (Marp format), or matplotlib images, all of which I then view again in Obsidian. You can imagine many other visual output formats depending on the query. Often, I end up "filing" the outputs back into the wiki to enhance it for further queries. So my own explorations and queries always "add up" in the knowledge base. Linting: I've run some LLM "health checks" over the wiki to e.g. find inconsistent data, impute missing data (with web searchers), find interesting connections for new article candidates, etc., to incrementally clean up the wiki and enhance its overall data integrity. The LLMs are quite good at suggesting further questions to ask and look into. Extra tools: I find myself developing additional tools to process the data, e.g. I vibe coded a small and naive search engine over the wiki, which I both use directly (in a web ui), but more often I want to hand it off to an LLM via CLI as a tool for larger queries. Further explorations: As the repo grows, the natural desire is to also think about synthetic data generation + finetuning to have your LLM "know" the data in its weights instead of just context windows. TLDR: raw data from a given number of sources is collected, then compiled by an LLM into a .md wiki, then operated on by various CLIs by the LLM to do Q&A and to incrementally enhance the wiki, and all of it viewable in Obsidian. You rarely ever write or edit the wiki manually, it's the domain of the LLM. I think there is room here for an incredible new product instead of a hacky collection of scripts.

↗ view on x.com

在 Collaborator 推出之前，我搭了一套完整的 agentic 知识库应用，整个管道是：源文档 → 知识图谱 → MCP。一直没发布。大家想让我开源吗？【引用 @karpathy】 LLM 知识库最近发现一件很有用的事：用 LLM 为各类研究主题构建个人知识库。我最近大量的 token 消耗不再是操作代码，而是操作知识（以 markdown 和图片形式存储）。最新的 LLM 在这方面表现相当好。具体做法如下： **数据摄取**：把源文档（文章、论文、代码库、数据集、图片等）索引到 raw/ 目录，然后用 LLM 增量「编译」一个 wiki——本质上就是一个目录结构下的 .md 文件集合。wiki 包含所有数据的摘要、反向链接，并将数据分类为概念、撰写文章、互相链接。把网页文章转为 .md 我喜欢用 Obsidian Web Clipper，还有一个快捷键可以把所有相关图片下载到本地，方便 LLM 直接引用。 **IDE**：用 Obsidian 作为「前端」，查看原始数据、编译好的 wiki 和衍生可视化内容。重要的是：LLM 负责编写和维护 wiki 的所有内容，我几乎不直接修改它。 **Q&A**：当 wiki 足够大（比如我某个研究主题的 wiki 已有约 100 篇文章、约 40 万字），就可以让 LLM agent 针对 wiki 回答各种复杂问题。我以为必须上 fancy RAG，但实际上 LLM 在自动维护索引文件和所有文档的简短摘要方面表现不错，在这个小规模下能相当轻松地读取所有重要相关数据。 **输出**：不是在终端拿文本答案，而是让它为我渲染 markdown 文件、幻灯片（Marp 格式）或 matplotlib 图像，然后在 Obsidian 里查看。可以想象出很多其他可视化输出格式。我经常把输出「归档」回 wiki，以便后续查询时增强知识库。 **Linting**：对 wiki 运行了一些 LLM「健康检查」，比如找出不一致的数据、补全缺失数据（配合 web 搜索）、发现有趣的关联作为新文章候选等，从而增量清理 wiki、提升整体数据完整性。 **额外工具**：我发现自己在开发额外工具来处理这些数据，比如 vibe coded 了一个轻量搜索引擎，既可以直接在 web UI 里用，更多时候是通过 CLI 作为工具交给 LLM 处理更大的查询。 **进一步探索**：随着 repo 增长，自然会想到合成数据生成 + 微调，让 LLM 把这些知识「内化」到权重里，而不仅仅依赖 context window。 TLDR：从若干来源收集原始数据，由 LLM 编译成 .md wiki，再通过各种 CLI 由 LLM 进行 Q&A 和增量增强，所有内容都可在 Obsidian 中查看。你几乎不需要手动编写或编辑 wiki，那是 LLM 的地盘。我认为这里有机会做出一款出色的新产品，而不是一堆临时脚本的拼凑。

@yiliush·3月27日For You

tmux + xterm + node-pty = horrible scrollback artifacting. I struggled all week with maxxed claude + codex + gemini. all failed. I realized the solution was to drop tmux and persist node-pty in a sidecar. None of the agents even proposed this approach. https://t.co/nSWHJarknD https://t.co/WjExM9XUYi

↗ view on x.com

tmux + xterm + node-pty 组合会产生严重的滚动回显 artifact。我花了一整周，用 Claude、Codex、Gemini 全都没解决。最终我意识到解法是：去掉 tmux，把 node-pty 持久化在一个 sidecar 进程里。没有任何一个 agent 提出过这个方案。

760 tweets · 188 sources