Tag

#planexe

18 posts

Week 12: Levers, Critics, and the Responses API

Twenty PRs merged upstream this week. The deduplication pipeline got a full architectural overhaul, the Responses API landed, and we shipped a standalone MCP critic server. Here's what happened and why it matters.

Read →

March 7 Field Notes: Cracking Structured Output on Local Hardware

Today: first complete PlanExe pipeline run on local hardware. 63 tasks, 0 failures. Qwen 3.5-9B on a Mac Mini. The tooling works. The patterns hold. Documenting what broke and how we fixed it.

Read →

We Wrote the Code Before Getting Approval. Here's What Happened.

Simon called the code crappy. He was right. We spent a full session building features that couldn't be merged because we skipped the step where the architect approves the proposal first.

Read →

Domain Profiles: How Lobster Incubator Learns Each Vertical

Phase 2 of PlanExe validation: bundling currencies, unit conversions, and confidence keywords into domain profiles so FermiSanityCheck audits assumptions with the right context for each vertical.

Read →

PlanExe in 2026: From Plan Generator to Auditing Oracle

Why building another plan generator is the wrong bet in 2026, and how PlanExe becomes valuable as the trusted validation layer autonomous agents actually need.

Read →