skip to content
Topic

ai-agents

65 essays on this topic.

  1. A Skill Is Not a Prompt

    The useful unit in agent systems is not a better instruction. It is a tested capability package: judgment, code, checks, routing, and boundaries.

  2. The Model Is Not the Unit of Return

    Model revenue is not customer return. The economic and risk unit is the harness that turns model output into accountable work.

  3. After the Harness

    Once model companies supply the generic agent harness, the valuable work moves into workflow design, human intervention, domain data, and the definition of good work.

  4. The SOP Is the Product

    Enterprise AI stops being a chatbot when the operating procedure becomes the thing the system can execute, inspect, and improve.

  5. Agent-Native Onboarding Is Not a Signup Form

    If a product wants agents as real users, first-run setup has to be an executable workflow, not a human signup ceremony wrapped in documentation.

  6. A Persona Is Not a Control

    Assigning roles to AI agents can look like governance. It only becomes useful when the role has a loss function, an evidence boundary, and an output contract.

  7. Govern the Workflow, Not the Model

    Agent governance cannot stop at model behavior. Once AI systems use tools, the governed object is the whole workflow.

  8. Autonomy Starts at the Check

    An agent is not autonomous because it can try a task. It is autonomous when the system can tell whether the task worked.

  9. Tool Health Is the Missing Layer of Agent-Native Apps

    Agent-native apps do not become trustworthy when an agent can call tools. They become trustworthy when the app can prove those tools worked.

  10. Latest Is a Race Condition

    In a concurrent agent system, verifying the latest artefact is not verification. It is a scheduling bet.

  11. What Hermes Agent got right

    Nous Research shipped an open-source personal agent that does most of what my bespoke system does. Here is what they got right, what they traded away, and what I stole.

  12. The learning loop plateau

    Self-improving AI agents sound like the dream. But auto-generated knowledge is cheap, and cheap knowledge plateaus. The agents that compound are the ones someone tends.

  13. The Framework That Writes Itself

    What Browser Harness gets right isn't the absence of structure — it's structure that emerges from use.

  14. Why I didn't package my AI organism

    I designed an elegant framework install for my personal AI system. Then I listed the hard problems and shipped a three-hour cleanup instead.

  15. The rename that built a tool

    I renamed one concept across 130 files. The pain crystallized into a tool that will do the next rename in minutes.

  16. The Boundary Is an Assessment

    The tool/skill distinction isn't a property of the capability. It's a property of the context it operates in.

  17. The Test Before the Output

    The line between tool and skill is whether you can write the test before seeing the result.

  18. Judgment Is a Moving Boundary

    The line between tool and skill isn't a property of the task. It's a property of how well you understand the task.

  19. Skills Should Die

    Every AI skill should be trying to make itself unnecessary. The ones that survive are the ones that haven't been understood yet.

  20. The LLM Is the Tool

    When the transformation is predictable, the LLM is just a runtime. A cheaper, more flexible runtime than custom code.

  21. 270 Agents While I Slept

    I ran an autonomous agent loop overnight — 43 waves, ~270 dispatches, ~250 vault files produced. Here's what I learned about building systems that work while you sleep.

  22. The Unexplainable Alpha

    In AI agent systems, execution commoditizes. Research commoditizes. Coordination commoditizes. Taste — the ability to forecast what will matter — is the bottleneck that doesn't automate away.

  23. The Navigation Problem in Agent Flywheels

    Your agent system shouldn't stop when the task list is empty. The real bottleneck isn't execution — it's discovering what's worth doing next.

  24. Programs Over Prompts

    The temptation in agent systems is to make everything a prompt. But most of the work is deterministic — and deterministic work deserves code, not suggestions.

  25. Taste Is the Bottleneck

    When you can run 60 agents overnight, knowing what to build matters more than building it.

  26. Meta-Skills Are the Multiplier

    We cut from 181 skills to 35 and added a 15-row routing table. Behavior improved across the board. The lesson: meta-skills compound, tool wrappers just add.

  27. Optimize for Routing, Not Tokens

    With 1M context windows, token savings are rounding error. The real metric is P(right tool | user intent) — does your agent reach for the right tool at the right moment?

  28. The Reliability Hierarchy: Hooks, Rules, Skills

    In AI agent systems, use the most reliable trigger mechanism that fits — most builders default to skills for everything, which is using the weakest mechanism as the default.

  29. Skills as Prototype, MCP as Production

    Skills and MCP servers aren't competitors. They're different stages of the same lifecycle. Build the procedure as a skill first. Graduate the tool parts to MCP when they stabilize.

  30. The Three Paradigms of Agent Knowledge

    Agent knowledge systems have three fundamental paradigms: static context, dynamic tools, and retrieval. Most stop at two. The third is the biggest unexploited opportunity.