A Cat Entertainer

我让 6 个 AI 辩了三场，有一段我反复看了三遍

Apr 13, 2026

Agora 跑通了第一个 MVP：6 个 AI debater + 3 个 AI judge，三个模型各出两个人设，辩了三个话题。预期是及格就好，结果 Claude 引亚里士多德，GPT 举 Log4Shell，Gemini 演了个愤怒的 17 岁创作者。63 次 LLM 调用零失败。

AI Agents 多 Agent Agora LLM product:agora

Benchmark 分数高又怎样

Mar 5, 2026

GPT 5.4 在各项 benchmark 上全面领先。但当我把同一个复杂的产品战略问题扔给两个模型时，benchmark 分数和真实输出质量之间的鸿沟令人震惊。

AI LLM Claude GPT type:essay

GPT 5.4 vs Opus 4.6: Why Benchmarks Stopped Mattering

Mar 5, 2026

GPT 5.4 dominates every benchmark. But when I gave both models the same complex product strategy prompt, the gap between benchmark scores and real-world output was staggering. Here's what actually happened.

AI LLM Claude GPT type:essay

教 LLM 什么时候该闭嘴

Mar 4, 2026

LLM 天生话多。对陪伴产品来说这是个大问题——真人发消息不会写小作文。这是我如何在不杀死人设的前提下控制回复长度的。

AI Agents Mio LLM theme:runtime

The Chattiness Problem: Teaching an LLM When to Shut Up

Mar 4, 2026

LLMs love to talk. For a companion app, that's a problem — real people don't write essays when you text them. Here's how I built a hybrid system to control response length without killing personality.

AI Agents Mio LLM theme:runtime

零基础做算命 App，我疯了吗

Feb 18, 2026

第1篇（共3篇）：一个大厂工程师的一人创业实验——用AI构建PanPanMao，一个覆盖9个产品线的中国玄学平台，29天1,134次commit，领域知识为零。

AI Software Development LLM Agents product:panpanmao

Why I Built an AI Fortune-Telling App With Zero Domain Knowledge

Feb 18, 2026

Part 1 of 3: A big-tech engineer's journey into building PanPanMao -- an AI-powered Chinese metaphysics platform with 9 verticals, 1,134 commits in 29 days, and zero domain knowledge.

AI Software Development LLM Agents product:panpanmao

29 天，5 个散装 App 变成一个平台

Feb 18, 2026

第2篇（共3篇）：PanPanMao的技术构建全过程，分为4个阶段——monorepo整合、测试与品牌、商业化、里程碑冲刺。1,134次commit，85个端点，9个产品线。

AI Software Development LLM Agents product:panpanmao

From 5 Standalone Apps to a Unified Platform in 29 Days

Feb 18, 2026

Part 2 of 3: The technical build of PanPanMao across 4 phases -- monorepo consolidation, testing and branding, monetization, and the milestone rush. 1,134 commits, 85 endpoints, 9 verticals.

AI Software Development LLM Agents product:panpanmao

一个人做了 9 个产品的教训

Feb 18, 2026

第3篇（共3篇）：构建PanPanMao的真实教训。AI弥合领域知识鸿沟（但有个大前提）、产品vs工程、AI编程工作流，还有一个人+AI做产品的时代。

AI Software Development LLM Agents product:panpanmao

What I Learned Shipping a Real Product as a Solo AI-Augmented Developer

Feb 18, 2026

Part 3 of 3: Honest lessons from building PanPanMao. AI bridging domain knowledge (with caveats), product vs engineering, the agentic coding workflow, and the era of the solo AI-augmented builder.

AI Software Development LLM Agents product:panpanmao

From Rowers to Navigators: Thriving in the Age of AI

Jun 8, 2025

AI LLM Agents Software Development type:essay