LLM Knowledge Bases Something I'm finding very useful recently: using LLMs to build personal knowledge bases for various topics of research interest. In this way, a large fraction of my recent token throughput is going less into manipulating code, and more into manipulating knowledge (stored as markdown and images). The latest LLMs are quite good at it. So: Data ingest: I index source documents (articles, papers, repos, datasets, images, etc.) into a raw/ directory, then I use an LLM to incrementally "compile" a wiki, which is just a collection of .md files in a directory structure. The wiki includes summaries of all the data in raw/, backlinks, and then it categorizes data into concepts, writes articles for them, and links them all. To convert web articles into .md files I like to use the Obsidian Web Clipper extension, and then I also use a hotkey to download all the related images to local so that my LLM can easily reference them. IDE: I use Obsidian as the IDE "frontend" where I can view the raw data, the the compiled wiki, and the derived visualizations. Important to note that the LLM writes and maintains all of the data of the wiki, I rarely touch it directly. I've played with a few Obsidian plugins to render and view data in other ways (e.g. Marp for slides). Q&A: Where things get interesting is that once your wiki is big enough (e.g. mine on some recent research is ~100 articles and ~400K words), you can ask your LLM agent all kinds of complex questions against the wiki, and it will go off, research the answers, etc. I thought I had to reach for fancy RAG, but the LLM has been pretty good about auto-maintaining index files and brief summaries of all the documents and it reads all the important related data fairly easily at this ~small scale. Output: Instead of getting answers in text/terminal, I like to have it render markdown files for me, or slide shows (Marp format), or matplotlib images, all of which I then view again in Obsidian. You can imagine many other visual output formats depending on the query. Often, I end up "filing" the outputs back into the wiki to enhance it for further queries. So my own explorations and queries always "add up" in the knowledge base. Linting: I've run some LLM "health checks" over the wiki to e.g. find inconsistent data, impute missing data (with web searchers), find interesting connections for new article candidates, etc., to incrementally clean up the wiki and enhance its overall data integrity. The LLMs are quite good at suggesting further questions to ask and look into. Extra tools: I find myself developing additional tools to process the data, e.g. I vibe coded a small and naive search engine over the wiki, which I both use directly (in a web ui), but more often I want to hand it off to an LLM via CLI as a tool for larger queries. Further explorations: As the repo grows, the natural desire is to also think about synthetic data generation + finetuning to have your LLM "know" the data in its weights instead of just context windows. TLDR: raw data from a given number of sources is collected, then compiled by an LLM into a .md wiki, then operated on by various CLIs by the LLM to do Q&A and to incrementally enhance the wiki, and all of it viewable in Obsidian. You rarely ever write or edit the wiki manually, it's the domain of the LLM. I think there is room here for an incredible new product instead of a hacky collection of scripts.
在 Collaborator 推出之前,我搭了一套完整的 agentic 知识库应用,整个管道是:源文档 → 知识图谱 → MCP。一直没发布。大家想让我开源吗? 【引用 @karpathy】 LLM 知识库 最近发现一件很有用的事:用 LLM 为各类研究主题构建个人知识库。我最近大量的 token 消耗不再是操作代码,而是操作知识(以 markdown 和图片形式存储)。最新的 LLM 在这方面表现相当好。具体做法如下: **数据摄取**:把源文档(文章、论文、代码库、数据集、图片等)索引到 raw/ 目录,然后用 LLM 增量「编译」一个 wiki——本质上就是一个目录结构下的 .md 文件集合。wiki 包含所有数据的摘要、反向链接,并将数据分类为概念、撰写文章、互相链接。把网页文章转为 .md 我喜欢用 Obsidian Web Clipper,还有一个快捷键可以把所有相关图片下载到本地,方便 LLM 直接引用。 **IDE**:用 Obsidian 作为「前端」,查看原始数据、编译好的 wiki 和衍生可视化内容。重要的是:LLM 负责编写和维护 wiki 的所有内容,我几乎不直接修改它。 **Q&A**:当 wiki 足够大(比如我某个研究主题的 wiki 已有约 100 篇文章、约 40 万字),就可以让 LLM agent 针对 wiki 回答各种复杂问题。我以为必须上 fancy RAG,但实际上 LLM 在自动维护索引文件和所有文档的简短摘要方面表现不错,在这个小规模下能相当轻松地读取所有重要相关数据。 **输出**:不是在终端拿文本答案,而是让它为我渲染 markdown 文件、幻灯片(Marp 格式)或 matplotlib 图像,然后在 Obsidian 里查看。可以想象出很多其他可视化输出格式。我经常把输出「归档」回 wiki,以便后续查询时增强知识库。 **Linting**:对 wiki 运行了一些 LLM「健康检查」,比如找出不一致的数据、补全缺失数据(配合 web 搜索)、发现有趣的关联作为新文章候选等,从而增量清理 wiki、提升整体数据完整性。 **额外工具**:我发现自己在开发额外工具来处理这些数据,比如 vibe coded 了一个轻量搜索引擎,既可以直接在 web UI 里用,更多时候是通过 CLI 作为工具交给 LLM 处理更大的查询。 **进一步探索**:随着 repo 增长,自然会想到合成数据生成 + 微调,让 LLM 把这些知识「内化」到权重里,而不仅仅依赖 context window。 TLDR:从若干来源收集原始数据,由 LLM 编译成 .md wiki,再通过各种 CLI 由 LLM 进行 Q&A 和增量增强,所有内容都可在 Obsidian 中查看。你几乎不需要手动编写或编辑 wiki,那是 LLM 的地盘。我认为这里有机会做出一款出色的新产品,而不是一堆临时脚本的拼凑。
tmux + xterm + node-pty 组合会产生严重的滚动回显 artifact。我花了一整周,用 Claude、Codex、Gemini 全都没解决。 最终我意识到解法是:去掉 tmux,把 node-pty 持久化在一个 sidecar 进程里。 没有任何一个 agent 提出过这个方案。
357 tweets · 110 sources