Writing • Terry Li

Writing

AI controls, agent systems, banking governance, production practice. 405 essays, newest first.

01Index

02Pinned

18 Apr 2026
AI Controls Architecture
Risk teams know risk. The open problem is designing controls for systems that are non-deterministic, probabilistic, and attackable in natural language.
18 Apr 2026
Governing Agents the Way Cells Govern Themselves
Six cell biology mechanisms that reveal what the networking 'control plane' metaphor misses about governing AI agents.
18 Apr 2026
The Risk Without an Engineering Solution
Every other agentic AI risk has an engineering answer. Prompt injection doesn't. That changes everything about how you design controls.

03All

2026

13 Mar 2026
Shifting Priors Is Not Finding Truth
An experiment with AI deliberation revealed something uncomfortable: accumulating confident opinions feels like convergence on truth, but isn't.
13 Mar 2026
The Deliberation Format Is the Product
I ran an experiment to find where multi-model deliberation adds value. The answer surprised me: it's the structured format, not the model diversity.
13 Mar 2026
The System for Checking Is Not the Checking
On the difference between eliminating friction and eliminating anxiety — and how to know when you've crossed the line.
12 Mar 2026
The Wrong Metric: Why I Stopped Switching AI Models Mid-Session
Per-task model routing optimises cost per token. But at personal assistant scale, friction is the real cost.
12 Mar 2026
Cross-Cutting Is Just Another Word for Optional
In AI agent architecture, calling something a 'cross-cutting concern' without naming an owner and a gate is just a polite way of saying nobody owns it.
12 Mar 2026
Stop Asking Which AI Model Is Better. Ask Which Phase.
The planning/execution split is more useful than any benchmark comparison.
12 Mar 2026
The second pass finds more
When red-teaming a document with multiple AI models, the second review — run on the edited version — consistently finds more than the first. Here's why, and what it means for how many rounds to run.
11 Mar 2026
LLM evals aren't data science
Evaluating LLM systems requires judgment, not statistics. That shifts who's qualified to do it — and where the gap is in most organisations.
11 Mar 2026
Progressive disclosure in MCP tools
When building MCP servers, search should return scannable summaries — not full content. Let the model decide what to read.
11 Mar 2026
RAG Solved the Wrong Problem
The retrieval pipeline was built for systems that couldn't reason about their own information needs. Agents can.
11 Mar 2026
This Year's DeepSeek
An open-source AI agent framework became the fastest-growing project in GitHub history — mostly in China. The pattern is the same as last year. So is the security panic.
11 Mar 2026
Enterprise AI Has a Plumbing Problem, Not a Model Problem
Most enterprises are optimising the wrong variable. The gap between 5% and 40% agent adoption won't be closed by better models.
11 Mar 2026
The Accidental Life OS
I spent an afternoon researching AI tools for personal life management. The conclusion was that I should stop looking.
11 Mar 2026
The $1 Billion Bet Against LLMs
One of the architects of modern deep learning just raised $1B on the thesis that token prediction can't reach real reasoning. Here's what he's proposing instead — and why it matters even if he's wrong.
11 Mar 2026
The First Datapoint
An AI agent ran unsupervised for two days and found twenty improvements to another model's training. Not an AGI claim. A rate claim.
11 Mar 2026
From Chatbots to Event Loops
The shift from agents you summon to agents that watch. Enterprise AI workflows are becoming continuous loops — and the failure modes are different.
11 Mar 2026
What MCP Actually Changes for Enterprise AI
Not better function calling — decoupling. When tools expose MCP servers, any agent can compose any system freely. The heterogeneity problem becomes a configuration problem.
11 Mar 2026
Language Is the Medium, Not the Purpose
We called them language models and spent years confused about why they could reason. The name stuck to the interface, not the mechanism.
11 Mar 2026
LLMs Are Better at Editing Than Writing
Ask an AI to write from scratch and you get the average of the corpus. Give it something rough and it amplifies what's already there. The workflow implications are significant.
11 Mar 2026
Consulting Is Mostly About Reducing Uncertainty
Clients hire consultants to solve problems. What they're actually paying for is the reduction of a particular feeling. The distinction matters.
11 Mar 2026
The Case Against Knowledge Management Systems
Most PKM tools are procrastination with better aesthetics. The problem isn't the software — it's that filing a note feels like understanding it.
11 Mar 2026
What It Actually Feels Like to Use AI for 80% of Your Work
Not productivity. Something stranger — the cognitive texture of days when the bottleneck shifts from execution to articulation.
11 Mar 2026
When the Platform Is Mature, the Architect's Job Changes
The hardest phase of AI architecture isn't building the stack. It's the moment after the stack is built and eighteen teams start making independent decisions on top of it.
10 Mar 2026
The Calibration Trap
The comfort trap is about effort. This one is about epistemics — and it's harder to see.
10 Mar 2026
The Comfort Trap
The right test for any AI interaction isn't 'did it help me?' but 'am I more capable after it?'
10 Mar 2026
The Personalised System Era
AI coding agents didn't just make developers faster. They changed who gets to have a bespoke system.
10 Mar 2026
Let the OS Schedule, Let Your Tool Dispatch
The moment I stopped building scheduling into my tools, everything got simpler.
10 Mar 2026
The Nag Tax
When building automation around a third-party app, the first question to answer is: what's the one thing this app does that nothing else can replicate? That feature becomes the tax you pay on everything else.
9 Mar 2026
Benchmark Your Research Stack
Running 10 real queries through 5 tools revealed that theoretical routing rules have systematic gaps — and the surprises were more useful than the confirmations.
9 Mar 2026
The Queue Should Live Where Your Thoughts Live
AI agent results should be push, not pull. The feedback loop should close on mobile. Most tools miss all three — not from ignorance, but because dashboards photograph better.

04Topics