Topic

ai-agents

65 essays on this topic.

Papers

18 Mar 2026
Match Form to Access Pattern
The governing principle for structuring knowledge in AI agent systems isn't 'always atomic' — it's matching how knowledge is stored to how it's accessed.
18 Mar 2026
Play Within the Design
Every AI coding platform has mechanisms designed for specific purposes. Using them as intended beats clever hacks — and the reason is deeper than cleanliness.
18 Mar 2026
Stealing from Peers: A Truth-Seeking Discipline
Most people scan competitors for positioning. I scan them for transferable patterns — and route each steal to every domain it applies to.
18 Mar 2026
The Boring Future of AI Agents
The real arrival of AI agents isn't spectacular. It's when you stop noticing.
17 Mar 2026
The Treadmill and the Loop
Getting ahead of AI best practices is a treadmill. The durable skill is testing assumptions faster than they expire.
17 Mar 2026
Personas Exploit a Blind Spot in LLM-as-Judge Evaluation
Persona prompting generates the exact type of hallucination that automated LLM judges reward as 'depth.' Two experiments, blind evaluation, and a fact-check that flipped the finding.
16 Mar 2026
The Persona Paradox in AI Agent Teams
Personas hurt for structured tasks, help for judgment-heavy tasks. Two experiments, blind evaluation, frontier models. The distinction is task-dependent, not binary.
15 Mar 2026
The Debate Round Is Where Value Lives
Independent parallel reviews produce overlapping findings. The cross-critique round produces resolution. That's where multi-agent value actually emerges.
14 Mar 2026
Planning Needs Eyes
A 3-pass AI planning pipeline caught 0 out of 6 design issues. The same planning done in-session with tool access caught 2.5. Planning isn't a prompt problem — it's a tools problem.
14 Mar 2026
Put the Rule Where It Fires
Documenting a rule is half a loop. The rule only works when it fires at the moment of decision — not when it sits in a file nobody reads.
14 Mar 2026
What Human Memory Teaches AI Agents (and What It Doesn't)
A calculator doesn't simulate forgetting — it manages its context budget. What to cherry-pick from cognitive science for AI agent memory, and what to leave behind.
14 Mar 2026
The MTEB Leader Barely Beats a Free Model on Agent Memory
I benchmarked 10 memory backends and multiple embedding models on actual agent memory retrieval. The results challenge common assumptions about what matters.
14 Mar 2026
The Agent Governance Gap Is Already Here
Agentic AI isn't a future governance problem — it arrived ungoverned, and this week saw the first enforcement action.
14 Mar 2026
Your Agent Pays the Cold-Start Tax Every Morning
Agent memory isn't knowledge management. It's performance infrastructure — and the gap between a stateless agent and one that accumulates context is measurable.
14 Mar 2026
Progressive Trust: How to Give AI Agents Autonomy Without Gambling
The debate about AI agent autonomy is wrong. It's not a binary choice — it's a graduated trust system with observability.
11 Mar 2026
This Year's DeepSeek
An open-source AI agent framework became the fastest-growing project in GitHub history — mostly in China. The pattern is the same as last year. So is the security panic.
11 Mar 2026
Enterprise AI Has a Plumbing Problem, Not a Model Problem
Most enterprises are optimising the wrong variable. The gap between 5% and 40% agent adoption won't be closed by better models.
11 Mar 2026
From Chatbots to Event Loops
The shift from agents you summon to agents that watch. Enterprise AI workflows are becoming continuous loops — and the failure modes are different.
11 Mar 2026
What MCP Actually Changes for Enterprise AI
Not better function calling — decoupling. When tools expose MCP servers, any agent can compose any system freely. The heterogeneity problem becomes a configuration problem.
11 Mar 2026
Language Is the Medium, Not the Purpose
We called them language models and spent years confused about why they could reason. The name stuck to the interface, not the mechanism.
7 Mar 2026
When Intelligence Becomes Infrastructure
What changes when LLMs stop being the special thing and become just another software component? The answer is: everything about how you build.
7 Mar 2026
Expansion, Not Speedup
The real ROI of AI coding isn't doing the same work faster. It's doing work that wasn't worth doing before.
7 Mar 2026
Skills as Files
The simplest agent architecture might already be the right one: give the agent a file explaining how to do something, and let it read when needed.
7 Mar 2026
The Trust Spectrum
Peter Steinberger stopped reviewing AI-generated code entirely. That works for indie software. In regulated environments, it can't. Here's how to think about where you sit.
7 Mar 2026
Traces Are the New Debugger
When behaviour emerges from both code and model responses, reading source files isn't enough. You debug by examining execution traces.
7 Mar 2026
Rules Decay, Hooks Don't
The difference between writing down a rule and making the system enforce it — illustrated by a 15-line hook.
6 Mar 2026
Agentic Engineering: Why Less Is More
Tool enthusiasm is often net-negative. Context pollution degrades performance faster than features improve it. The principles that actually work.
6 Mar 2026
CLIs Enforce Structure and Save Tokens — Not Just Discipline
The instinct to add a rule to a skill file is usually the wrong abstraction. A CLI wrapper enforces at the tool level: zero deliberation, zero token cost.
6 Mar 2026
Software Engineering Principles for AI Instruction Files
LLM instruction files are code. They have the same failure modes — with one interesting twist that changes everything.
6 Mar 2026
Agent-First CLI Design: TTY Detection as Philosophy
The primary user of my CLI tools isn't me anymore. Designing for that changes everything about how output should work.