skip to content
Topic

ai-agents

65 essays on this topic.

  1. Match Form to Access Pattern

    The governing principle for structuring knowledge in AI agent systems isn't 'always atomic' — it's matching how knowledge is stored to how it's accessed.

  2. Play Within the Design

    Every AI coding platform has mechanisms designed for specific purposes. Using them as intended beats clever hacks — and the reason is deeper than cleanliness.

  3. Stealing from Peers: A Truth-Seeking Discipline

    Most people scan competitors for positioning. I scan them for transferable patterns — and route each steal to every domain it applies to.

  4. The Boring Future of AI Agents

    The real arrival of AI agents isn't spectacular. It's when you stop noticing.

  5. The Treadmill and the Loop

    Getting ahead of AI best practices is a treadmill. The durable skill is testing assumptions faster than they expire.

  6. Personas Exploit a Blind Spot in LLM-as-Judge Evaluation

    Persona prompting generates the exact type of hallucination that automated LLM judges reward as 'depth.' Two experiments, blind evaluation, and a fact-check that flipped the finding.

  7. The Persona Paradox in AI Agent Teams

    Personas hurt for structured tasks, help for judgment-heavy tasks. Two experiments, blind evaluation, frontier models. The distinction is task-dependent, not binary.

  8. The Debate Round Is Where Value Lives

    Independent parallel reviews produce overlapping findings. The cross-critique round produces resolution. That's where multi-agent value actually emerges.

  9. Planning Needs Eyes

    A 3-pass AI planning pipeline caught 0 out of 6 design issues. The same planning done in-session with tool access caught 2.5. Planning isn't a prompt problem — it's a tools problem.

  10. Put the Rule Where It Fires

    Documenting a rule is half a loop. The rule only works when it fires at the moment of decision — not when it sits in a file nobody reads.

  11. What Human Memory Teaches AI Agents (and What It Doesn't)

    A calculator doesn't simulate forgetting — it manages its context budget. What to cherry-pick from cognitive science for AI agent memory, and what to leave behind.

  12. The MTEB Leader Barely Beats a Free Model on Agent Memory

    I benchmarked 10 memory backends and multiple embedding models on actual agent memory retrieval. The results challenge common assumptions about what matters.

  13. The Agent Governance Gap Is Already Here

    Agentic AI isn't a future governance problem — it arrived ungoverned, and this week saw the first enforcement action.

  14. Your Agent Pays the Cold-Start Tax Every Morning

    Agent memory isn't knowledge management. It's performance infrastructure — and the gap between a stateless agent and one that accumulates context is measurable.

  15. Progressive Trust: How to Give AI Agents Autonomy Without Gambling

    The debate about AI agent autonomy is wrong. It's not a binary choice — it's a graduated trust system with observability.

  16. This Year's DeepSeek

    An open-source AI agent framework became the fastest-growing project in GitHub history — mostly in China. The pattern is the same as last year. So is the security panic.

  17. Enterprise AI Has a Plumbing Problem, Not a Model Problem

    Most enterprises are optimising the wrong variable. The gap between 5% and 40% agent adoption won't be closed by better models.

  18. From Chatbots to Event Loops

    The shift from agents you summon to agents that watch. Enterprise AI workflows are becoming continuous loops — and the failure modes are different.

  19. What MCP Actually Changes for Enterprise AI

    Not better function calling — decoupling. When tools expose MCP servers, any agent can compose any system freely. The heterogeneity problem becomes a configuration problem.

  20. Language Is the Medium, Not the Purpose

    We called them language models and spent years confused about why they could reason. The name stuck to the interface, not the mechanism.

  21. When Intelligence Becomes Infrastructure

    What changes when LLMs stop being the special thing and become just another software component? The answer is: everything about how you build.

  22. Expansion, Not Speedup

    The real ROI of AI coding isn't doing the same work faster. It's doing work that wasn't worth doing before.

  23. Skills as Files

    The simplest agent architecture might already be the right one: give the agent a file explaining how to do something, and let it read when needed.

  24. The Trust Spectrum

    Peter Steinberger stopped reviewing AI-generated code entirely. That works for indie software. In regulated environments, it can't. Here's how to think about where you sit.

  25. Traces Are the New Debugger

    When behaviour emerges from both code and model responses, reading source files isn't enough. You debug by examining execution traces.

  26. Rules Decay, Hooks Don't

    The difference between writing down a rule and making the system enforce it — illustrated by a 15-line hook.

  27. Agentic Engineering: Why Less Is More

    Tool enthusiasm is often net-negative. Context pollution degrades performance faster than features improve it. The principles that actually work.

  28. CLIs Enforce Structure and Save Tokens — Not Just Discipline

    The instinct to add a rule to a skill file is usually the wrong abstraction. A CLI wrapper enforces at the tool level: zero deliberation, zero token cost.

  29. Software Engineering Principles for AI Instruction Files

    LLM instruction files are code. They have the same failure modes — with one interesting twist that changes everything.

  30. Agent-First CLI Design: TTY Detection as Philosophy

    The primary user of my CLI tools isn't me anymore. Designing for that changes everything about how output should work.