designtheagent.com · a planning canvas & coach
Most agentic AI pilots are lost before a line of code is written, in the assumptions nobody put on paper.
Agentic AI design is org design. A one-page canvas for the nine decisions you make before you build, and a coach that reviews it before you commit.
The diagnosis
The cause is not in the technology. An un-designed agent is not un-designed; it is designed by accident, in code, by whoever shipped first. The briefing nobody wrote. The boundaries nobody drew. The oversight nobody owns. A person you hire restrains themselves, remembers, and can be held to account; an agent brings none of it. Every design decision you skip to close that gap is a debt, and model capability is the interest rate.
Capable, articulate, probabilistic. It generates the next plausible word, brilliant the way a new hire is brilliant on day one.
The part you control. Five things, each already familiar from managing people:
A web search, a database query, an email sent. The worker issues a command, the harness catches it, the system executes. An LLM in a loop, with tools.
The Agent Operating Model
Three layers sit inside every agentic system, and each supplies one term of the same equation. The worker brings cognition. The harness imposes control. The tools grant reach. Most failures trace to the same blind spot: a worker chosen with care, wrapped in a harness and a governance regime that were never designed at all.
power = cognition × control × reach
It is a product, not a sum. A brilliant worker with wide reach and no control isn't powerful; it is a liability. Add a calendar and the worker becomes a scheduler. Give it the authority to move money and it becomes a fiduciary actor. The more reach you grant, the more control the harness has to hold.
The canvas
One page, three colour groups, nine cells: the Agent Operating Model turned into choices you make on paper, before anyone writes a line of code. This is the canvas itself. Start with Purpose. Always.
What business outcome must this agent ultimately achieve?
Which workflow will it improve, and which metrics will show it worked?
01Who uses it directly, and who is affected by what it does?
02What accuracy, explainability, speed, and cost standards must it meet?
04What role should it play, and where should its autonomy stop? (assist, advise, draft, decide, execute)
03What information is available to support its work, and which sources should it trust (e.g. policies, cases, records, reports, communications)?
05Which tools or systems can it read from, write to, or act through (e.g. email, calendar, file storage, databases, enterprise systems, APIs)?
06What must it always do, and what must it never do, even when asked?
07What should it remember, what must it forget, and how will it improve over time?
08Who oversees it, when must it hand off to a person, and who owns the outcome?
09Want to run it in the room? Download the A1 canvas →
Kirin Holdings gave its executive committee a thirteenth participant: CoreMate, an AI adviser that stress-tests proposals before the room decides. See the whole agent on one canvas, reconstructed from public sources and run through the coach.
Two canvases, two outcomes
An un-designed agent is designed by accident. A designed one is designed on purpose. The answers ran in opposite directions.

The compass set to the wrong north.
A little over a year later, it narrowed what the agent was allowed to own and hired people back for the conversations that needed them.
Bloomberg · May 2025 ↗
Every cell filled.
A&O Shearman cut contract-review time sharply, then co-developed with Harvey and redesigned its business model around the agent.
A&O Shearman · April 2025 ↗The canvas coach
Fill the nine cells, then let a Claude-backed coach press the gaps a good workshop facilitator would: per cell, plus cross-cell contradictions and an overall readiness read.
How to run it
The canvas is built to be worked, not read. Answer its nine questions in half a day now, or discover them the hard way after you ship. In real cases, that has meant months of rework, a public reversal, or an apology to a federal judge.
Put it on an A1 sheet. Gather product, engineering, design, legal or risk, and someone who does the work the agent will touch.
Anchor on the workflow and the people it serves before anyone discusses what the AI does. Teams that start in Capability design a clever agent in search of a job.
Place the agent on the autonomy spectrum, then decide which tools, knowledge and rules that rung requires. Autonomy is the pivot.
What must never happen? What persists, what is forgotten? Who owns the outcome? The cells teams skip are where the failures cluster.
The empty cells and the contradictions are the deliverable. Surface them now, in a room, for the price of an afternoon.
For the first time, you have to write it all down before the worker shows up. And the better the model becomes, the more that control is worth. Start on the canvas.