Topic

ai-agents

65 essays on this topic.

Papers

2 Jun 2026
A Skill Is Not a Prompt
The useful unit in agent systems is not a better instruction. It is a tested capability package: judgment, code, checks, routing, and boundaries.
1 Jun 2026
The Model Is Not the Unit of Return
Model revenue is not customer return. The economic and risk unit is the harness that turns model output into accountable work.
26 May 2026
After the Harness
Once model companies supply the generic agent harness, the valuable work moves into workflow design, human intervention, domain data, and the definition of good work.
26 May 2026
The SOP Is the Product
Enterprise AI stops being a chatbot when the operating procedure becomes the thing the system can execute, inspect, and improve.
23 May 2026
Agent-Native Onboarding Is Not a Signup Form
If a product wants agents as real users, first-run setup has to be an executable workflow, not a human signup ceremony wrapped in documentation.
17 May 2026
A Persona Is Not a Control
Assigning roles to AI agents can look like governance. It only becomes useful when the role has a loss function, an evidence boundary, and an output contract.
16 May 2026
Govern the Workflow, Not the Model
Agent governance cannot stop at model behavior. Once AI systems use tools, the governed object is the whole workflow.
16 May 2026
Autonomy Starts at the Check
An agent is not autonomous because it can try a task. It is autonomous when the system can tell whether the task worked.
16 May 2026
Tool Health Is the Missing Layer of Agent-Native Apps
Agent-native apps do not become trustworthy when an agent can call tools. They become trustworthy when the app can prove those tools worked.
15 May 2026
Latest Is a Race Condition
In a concurrent agent system, verifying the latest artefact is not verification. It is a scheduling bet.
19 Apr 2026
What Hermes Agent got right
Nous Research shipped an open-source personal agent that does most of what my bespoke system does. Here is what they got right, what they traded away, and what I stole.
19 Apr 2026
The learning loop plateau
Self-improving AI agents sound like the dream. But auto-generated knowledge is cheap, and cheap knowledge plateaus. The agents that compound are the ones someone tends.
19 Apr 2026
The Framework That Writes Itself
What Browser Harness gets right isn't the absence of structure — it's structure that emerges from use.
5 Apr 2026
Why I didn't package my AI organism
I designed an elegant framework install for my personal AI system. Then I listed the hard problems and shipped a three-hour cleanup instead.
4 Apr 2026
The rename that built a tool
I renamed one concept across 130 files. The pain crystallized into a tool that will do the next rename in minutes.
24 Mar 2026
The Boundary Is an Assessment
The tool/skill distinction isn't a property of the capability. It's a property of the context it operates in.
24 Mar 2026
The Test Before the Output
The line between tool and skill is whether you can write the test before seeing the result.
24 Mar 2026
Judgment Is a Moving Boundary
The line between tool and skill isn't a property of the task. It's a property of how well you understand the task.
24 Mar 2026
Skills Should Die
Every AI skill should be trying to make itself unnecessary. The ones that survive are the ones that haven't been understood yet.
24 Mar 2026
The LLM Is the Tool
When the transformation is predictable, the LLM is just a runtime. A cheaper, more flexible runtime than custom code.
20 Mar 2026
270 Agents While I Slept
I ran an autonomous agent loop overnight — 43 waves, ~270 dispatches, ~250 vault files produced. Here's what I learned about building systems that work while you sleep.
19 Mar 2026
The Unexplainable Alpha
In AI agent systems, execution commoditizes. Research commoditizes. Coordination commoditizes. Taste — the ability to forecast what will matter — is the bottleneck that doesn't automate away.
19 Mar 2026
The Navigation Problem in Agent Flywheels
Your agent system shouldn't stop when the task list is empty. The real bottleneck isn't execution — it's discovering what's worth doing next.
19 Mar 2026
Programs Over Prompts
The temptation in agent systems is to make everything a prompt. But most of the work is deterministic — and deterministic work deserves code, not suggestions.
18 Mar 2026
Taste Is the Bottleneck
When you can run 60 agents overnight, knowing what to build matters more than building it.
18 Mar 2026
Meta-Skills Are the Multiplier
We cut from 181 skills to 35 and added a 15-row routing table. Behavior improved across the board. The lesson: meta-skills compound, tool wrappers just add.
18 Mar 2026
Optimize for Routing, Not Tokens
With 1M context windows, token savings are rounding error. The real metric is P(right tool | user intent) — does your agent reach for the right tool at the right moment?
18 Mar 2026
The Reliability Hierarchy: Hooks, Rules, Skills
In AI agent systems, use the most reliable trigger mechanism that fits — most builders default to skills for everything, which is using the weakest mechanism as the default.
18 Mar 2026
Skills as Prototype, MCP as Production
Skills and MCP servers aren't competitors. They're different stages of the same lifecycle. Build the procedure as a skill first. Graduate the tool parts to MCP when they stabilize.
18 Mar 2026
The Three Paradigms of Agent Knowledge
Agent knowledge systems have three fundamental paradigms: static context, dynamic tools, and retrieval. Most stop at two. The third is the biggest unexploited opportunity.