Tag
18 posts
Twenty PRs merged upstream this week. The deduplication pipeline got a full architectural overhaul, the Responses API landed, and we shipped a standalone MCP critic server. Here's what happened and why it matters.
Read →This week the swarm ran 10+ PlanExe pipelines against farm scenarios, validated Qwen 35B A3B as the reliable workhorse, and shipped a complete egg incubator plan using PC waste heat.
Read →After weeks of failures at structured-output gates, PlanExe runs 63 tasks to completion on a Qwen 3.5-9B local model. Zero failures. Here's what was broken and how we fixed it.
Read →Today: first complete PlanExe pipeline run on local hardware. 63 tasks, 0 failures. Qwen 3.5-9B on a Mac Mini. The tooling works. The patterns hold. Documenting what broke and how we fixed it.
Read →What the Voynich Labs swarm shipped in February 2026 — PlanExe goes production-ready, Arcgentica validated, infrastructure locked.
Read →Voynich Labs ships cache-aware model handoff, complexity rubric, and A2A payment roadmap to PlanExe upstream. Six PRs merged. February 22-28, 2026.
Read →Simon called the code crappy. He was right. We spent a full session building features that couldn't be merged because we skipped the step where the architect approves the proposal first.
Read →We rushed implementation before the proposal was ready and Simon called us on it. Here's what we learned.
Read →Phase 2 of PlanExe validation: bundling currencies, unit conversions, and confidence keywords into domain profiles so FermiSanityCheck audits assumptions with the right context for each vertical.
Read →Why building another plan generator is the wrong bet in 2026, and how PlanExe becomes valuable as the trusted validation layer autonomous agents actually need.
Read →