Topic

ai

187 essays on this topic.

Papers

13 Mar 2026
The Bootstrap Problem in AI Tooling
You need the tool to build the tool. The answer is: build the dumb version first, use it once, then have it build its replacement.
13 Mar 2026
Why Nobody Builds Cross-Vendor AI Orchestration
Every AI lab builds single-vendor orchestration. The cross-vendor layer is a gap — and it's a gap for a reason.
13 Mar 2026
The Orchestration Layer Is Knowledge, Not Code
Multi-agent AI orchestration frameworks are commodity. The competitive advantage is knowing which agent to use when, what breaks, and how to recover.
13 Mar 2026
Is Insight an Illusion
When pattern-matching feels like wisdom, what are we actually experiencing?
13 Mar 2026
The Fluency Trap
When AI conversations feel insightful because the language model is good at producing insight-shaped text
13 Mar 2026
Why Nobody Benchmarks Memory
The things that matter most in production are the things that get benchmarked least
13 Mar 2026
The Byproduct Trap
When the paper becomes more interesting than the answer you set out to find
13 Mar 2026
AI Agents Need Notebooks, Not Just Memories
The missing layer in enterprise AI isn't smarter models — it's structured memory that humans can actually review.
12 Mar 2026
Cross-Cutting Is Just Another Word for Optional
In AI agent architecture, calling something a 'cross-cutting concern' without naming an owner and a gate is just a polite way of saying nobody owns it.
12 Mar 2026
The second pass finds more
When red-teaming a document with multiple AI models, the second review — run on the edited version — consistently finds more than the first. Here's why, and what it means for how many rounds to run.
11 Mar 2026
RAG Solved the Wrong Problem
The retrieval pipeline was built for systems that couldn't reason about their own information needs. Agents can.
11 Mar 2026
This Year's DeepSeek
An open-source AI agent framework became the fastest-growing project in GitHub history — mostly in China. The pattern is the same as last year. So is the security panic.
11 Mar 2026
The Accidental Life OS
I spent an afternoon researching AI tools for personal life management. The conclusion was that I should stop looking.
11 Mar 2026
The $1 Billion Bet Against LLMs
One of the architects of modern deep learning just raised $1B on the thesis that token prediction can't reach real reasoning. Here's what he's proposing instead — and why it matters even if he's wrong.
11 Mar 2026
The First Datapoint
An AI agent ran unsupervised for two days and found twenty improvements to another model's training. Not an AGI claim. A rate claim.
11 Mar 2026
From Chatbots to Event Loops
The shift from agents you summon to agents that watch. Enterprise AI workflows are becoming continuous loops — and the failure modes are different.
11 Mar 2026
What MCP Actually Changes for Enterprise AI
Not better function calling — decoupling. When tools expose MCP servers, any agent can compose any system freely. The heterogeneity problem becomes a configuration problem.
11 Mar 2026
Language Is the Medium, Not the Purpose
We called them language models and spent years confused about why they could reason. The name stuck to the interface, not the mechanism.
11 Mar 2026
LLMs Are Better at Editing Than Writing
Ask an AI to write from scratch and you get the average of the corpus. Give it something rough and it amplifies what's already there. The workflow implications are significant.
11 Mar 2026
What It Actually Feels Like to Use AI for 80% of Your Work
Not productivity. Something stranger — the cognitive texture of days when the bottleneck shifts from execution to articulation.
10 Mar 2026
The Calibration Trap
The comfort trap is about effort. This one is about epistemics — and it's harder to see.
10 Mar 2026
The Comfort Trap
The right test for any AI interaction isn't 'did it help me?' but 'am I more capable after it?'
10 Mar 2026
The Personalised System Era
AI coding agents didn't just make developers faster. They changed who gets to have a bespoke system.
10 Mar 2026
Let the OS Schedule, Let Your Tool Dispatch
The moment I stopped building scheduling into my tools, everything got simpler.
9 Mar 2026
Benchmark Your Research Stack
Running 10 real queries through 5 tools revealed that theoretical routing rules have systematic gaps — and the surprises were more useful than the confirmations.
9 Mar 2026
Eliminate the Reminder, Don't Schedule It
When you catch yourself setting a reminder to check something later, that's usually a signal that a tool is failing to report what it should.
9 Mar 2026
You Are the Bottleneck in Your Own Agentic Workflow
Adding more AI tools doesn't help if you're still the bus between them.
8 Mar 2026
Where Rules Live
The difference between a rule that works and a rule that doesn't is usually not the content of the rule — it's where it lives.
8 Mar 2026
When Better Is Worse
Upgrading to a more capable model made my tool sixty times slower. The lesson isn't about models — it's about the difference between capability and fit.
8 Mar 2026
The Experiment Loop Without the GPU
Andrej Karpathy's autoresearch project is being read as a demo of what H100s can do overnight. It's actually a discipline for doing rigorous work on anything measurable.