Three lessons the human mind offers agent architecture

Several hard problems in agent design rhyme with how the human mind works: focus over clutter, knowing when a job is done, and separating the layers of memory. The resemblances are a good guide for design.

Some of the hard problems in agent design bear a striking resemblance to how the human mind works. This resemblance doesn’t necessarily mean these solutions were drawn from neuroscience; rather, it shows that good engineering solutions often arrive at the same patterns the human mind reached long ago. Let’s look at three of them.

Focus, not clutter

The human mind doesn’t hold all of the day’s concerns in working memory at once; at any moment it focuses only on what’s relevant to the task at hand. Agents should do the same. When an agent has access to everything, two problems arise: processing cost goes up, and — more importantly — irrelevant context entering the picture makes the model hallucinate and err.

The solution is to scope the context to the user’s intent: first detect what the user wants, then give each component only the slice of information needed for that intent. A component whose job is distributing tasks doesn’t need the conversation history; it needs only the task list. This is, in effect, the principle of “least privilege” applied to knowledge: each component receives only what its job requires. The result is a system that is both cheaper and more accurate.

Knowing when the job is done

A person usually has a clear sense of whether they’ve finished a task — a simple self-awareness of where the work stands. A capable agent system needs the same: a component whose only job is to ask, “Has all the work really been completed?”

The subtle point is that this judgement doesn’t need the intelligence of a language model. “Are all the tasks done?” is a matter of arithmetic, not reasoning; you just count the completed tasks. Handing this to a language model is both expensive and needlessly fragile. It’s better for this completion monitor to be deterministic and predictable. Separating the responsibility of “detecting the end of the work” into an independent, deterministic component makes the system both simpler and more robust.

Separating the layers of memory

A human doesn’t have one memory; they have several, with different lifespans and functions: what’s in mind right now is not the same as what we recall from years ago. Agents benefit from the same separation.

In agent design, separate at least two kinds of memory. First, coordination memory: temporary data that only makes sense during a single request — the answer to “how are we working together right now?” This data is transient and is cleared when the work ends. What’s cleared, though, is only the request-local working context — the live scratchpad of how the parts are cooperating in that moment — not the record of what happened and why; that account of the reasoning is kept in durable memory. Second, domain memory: what we know about the user and the conversation, which must persist over time. This separation has a big advantage: even if an error occurs midway through a task, domain memory is untouched and accumulated knowledge isn’t lost. Set each datum’s lifespan by its role, not with a single rule for everything.

The resemblances aren’t accidental

These three patterns — context focus, the completion monitor, and multi-layer memory — are each engineering answers to specific problems. But their resemblance to familiar patterns of human cognition isn’t accidental: when you’re dealing with limited attention, the chance of error, and the need for durability, good solutions converge. The human mind is not a bad roadmap for designing agents.