The case for dropping natural language

When two agents coordinate by exchanging natural-language messages, they pay a tax. Language is lossy and slow: a model compresses its internal state into tokens, the other model re-expands those tokens into its own state, and information is lost at both ends. A 2026 paper, Recursive Multi-Agent Systems, proposes exactly that: agents collaborate entirely in latent space through a lightweight cross-agent link, decoding to text only after the final recursion round — and report token-usage reductions of 35 to 76 percent against text-passing baselines. The promise is faster, more capable multi-agent systems that are not bottlenecked by the inefficiency of human language.

As an efficiency argument, it is sound. Language is an abstraction designed for humans, and there is no law that says agents must use it to talk to each other.

The performance gain is not the interesting part. The interesting part is what we lose when agents stop coordinating in a language we can read.

Natural language is an accidental governance layer

Today, almost everything we use to oversee multi-agent systems depends on the fact that agents communicate in readable text. We inspect prompts. We read tool calls. We follow plans, agent discussions, reasoning traces, and approval steps. Auditing, debugging, and verification all lean on the legibility of agent communication.

None of that legibility was designed as governance. It is a side effect of agents happening to think out loud in English. But it functions as governance: it is the surface on which we currently catch mistakes, trace decisions, and confirm that an agent did what it claimed. Latent-space communication removes that surface. The agents become more capable and the system becomes less observable at the same time.

PropertyNatural-language coordinationLatent-space coordination
SpeedSlowerFaster
Inspectable by humansYesNo
Auditable traceReadable textOpaque vectors
Where we catch errors todayThe messagesGone
What still governs the systemThe visible exchangeThe change the agents make

Why observability gets harder

The instinct, when communication goes dark, is to add observability: log the latent exchanges, build tools to interpret them. But interpreting latent vectors is not the same as reading a sentence. You can record the traffic without being able to understand it, and a recorded exchange you cannot interpret is not an audit trail — it is storage. Observability was never the same as governance, and latent communication widens the gap: even if you can see that agents communicated, you may not be able to tell what they decided.

The answer is to govern the change, not the conversation

If the conversation becomes illegible, governance cannot depend on reading it. It has to attach to the thing that remains concrete: the change the agents actually make to the system. Whatever an agent fleet decides in latent space, it eventually produces a diff, a schema change, a configuration edit — an action with consequences. That action can be checked against architectural decisions deterministically, regardless of how the agents talked about it.

Future agent architectures will likely need explicit translation and governance layers between latent coordination and consequential action — points where the system re-surfaces what is about to happen and verifies it against the rules. The governable boundary moves from the message to the act.

If agents stop talking in a language we can read, governance cannot live in the transcript. It has to live at the boundary where latent intent becomes a real change — and verify the change.