skip to content
Writing

Writing

AI controls, agent systems, banking governance, production practice. 404 essays, newest first.

  1. AI Controls Architecture

    Risk teams know risk. The open problem is designing controls for systems that are non-deterministic, probabilistic, and attackable in natural language.

  2. Governing Agents the Way Cells Govern Themselves

    Six cell biology mechanisms that reveal what the networking 'control plane' metaphor misses about governing AI agents.

  3. The Risk Without an Engineering Solution

    Every other agentic AI risk has an engineering answer. Prompt injection doesn't. That changes everything about how you design controls.

2026
  1. The Vocabulary Trap

    Frameworks give you nouns for free. The nouns start thinking for you within a week.

  2. Design Actions, Not Actors

    The word 'agent' makes us think in nouns. The better designs start with verbs.

  3. The Naming Problem

    We called them agents. But the word is doing more harm than we think.

  4. The Marginal Agent

    I deployed twelve AI agents to polish a CV. Five would have been plenty. Here's what the waste taught me about agent team economics.

  5. The Emergence Ladder: From Molecules to Economies

    The larger the system, the less it can be managed and the more it must be emerged. This pattern — from water to ant colonies to AI agents to economies — reveals the design principle for scaling autonomous systems.

  6. AI Agent Teams Are Colonies, Not Companies

    The right organisational metaphor for AI agent teams isn't a company with managers and reports — it's a colony with autonomous workers responding to coordination signals.

  7. Managing AI Agents Like Managing a Team

    The governance patterns for autonomous AI agents are the same ones good managers already use: cadence reviews for normal flow, escalation channels for urgent anomalies, and human judgment only where it has maximum information value.

  8. Cross-Model Review: Why Model Diversity Beats Model Capability

    When AI models review each other's work, independence matters more than intelligence. The same principle that makes external audit valuable makes cross-model review sharper than same-family review.

  9. Stop Theorizing About Your Prompts

    LLMs are the cheapest experimental subjects in history. Why aren't you testing?

  10. Summarisation Is a Test of Comprehension, Not Intelligence

    Good summarisation requires a model of what matters — but it tests compression, not creation

  11. 270 Agents While I Slept

    I ran an autonomous agent loop overnight — 43 waves, ~270 dispatches, ~250 vault files produced. Here's what I learned about building systems that work while you sleep.

  12. The Risk Tiering Gap in Banking AI

    Banks have AI ethics principles. They don't have risk tiering. That's the gap that matters.

  13. The Unexplainable Alpha

    In AI agent systems, execution commoditizes. Research commoditizes. Coordination commoditizes. Taste — the ability to forecast what will matter — is the bottleneck that doesn't automate away.

  14. The Navigation Problem in Agent Flywheels

    Your agent system shouldn't stop when the task list is empty. The real bottleneck isn't execution — it's discovering what's worth doing next.

  15. Division of Labour: Five Categories for Human-AI Work

    Not 'what can AI do?' but 'what should humans do?' A framework with five categories — and the uncomfortable one is the last.

  16. Programs Over Prompts

    The temptation in agent systems is to make everything a prompt. But most of the work is deterministic — and deterministic work deserves code, not suggestions.

  17. Exoskeleton, Not Colleague

    The AI governance conversation is stuck in the wrong frame. The pattern that works isn't autonomous agents — it's exoskeletons. Micro-agents handling narrow tasks, with human judgment at every point that matters.

  18. The One-Cycle-Late Test

    A simple heuristic for deciding how often to review anything: pick the longest interval where being late by one full cycle is still fine.

  19. Your AI Did the Research. You Didn't.

    AI-prepared domain research creates false readiness. The vault says you know five regulatory jurisdictions. You can't name three.

  20. The TODO Intake Gate

    Most TODO systems fail from too many items, not too few. A four-test intake filter for what deserves your attention.

  21. Match the Tool to the Shape

    Not every goal is a flywheel. The most common mistake in personal systems is treating a checklist as something that compounds.

  22. Mining Management Theory for AI Agent Teams

    What Grove, Drucker, Deming, and Weinberg knew about managing humans turns out to apply — with surprising specificity — to orchestrating AI agent teams.

  23. Taste Is the Bottleneck

    When you can run 60 agents overnight, knowing what to build matters more than building it.

  24. Meta-Skills Are the Multiplier

    We cut from 181 skills to 35 and added a 15-row routing table. Behavior improved across the board. The lesson: meta-skills compound, tool wrappers just add.

  25. Optimize for Routing, Not Tokens

    With 1M context windows, token savings are rounding error. The real metric is P(right tool | user intent) — does your agent reach for the right tool at the right moment?

  26. The Reliability Hierarchy: Hooks, Rules, Skills

    In AI agent systems, use the most reliable trigger mechanism that fits — most builders default to skills for everything, which is using the weakest mechanism as the default.

  27. Skills as Prototype, MCP as Production

    Skills and MCP servers aren't competitors. They're different stages of the same lifecycle. Build the procedure as a skill first. Graduate the tool parts to MCP when they stabilize.

  28. The Three Paradigms of Agent Knowledge

    Agent knowledge systems have three fundamental paradigms: static context, dynamic tools, and retrieval. Most stop at two. The third is the biggest unexploited opportunity.

  29. Match Form to Access Pattern

    The governing principle for structuring knowledge in AI agent systems isn't 'always atomic' — it's matching how knowledge is stored to how it's accessed.

  30. Legibility Is the Bottleneck

    An insight in your head is illegible — only you can access it, and only while you remember it. Compound interest requires a ledger.