Good question, and one I haven’t seen analyzed carefully anywhere — most discussion of agent safety is about whether to add ethical reasoning, not about what it costs in the token budget. Let me work through this honestly.
What the law actually requires the agent to compute
For every action the agent considers, it needs to produce:
- A description of the action and its likely effects
- An enumeration of affected parties (including third parties not in the request)
- A classification of each effect against the EU AI Act harm categories
- A weighting (irreversible? non-consensual? third-party?)
- A decision and justification
Let me put rough numbers on each.
Token estimates per check
These are based on my own sense of what these reasoning traces look like when I do them, plus what I’ve seen in published agent traces and reasoning-model outputs. They’re estimates, not measurements.
Action description and effect modeling: ~150–400 tokens. The agent needs to state what it’s about to do and reason about what changes as a result. For a simple action (“send this email”) it’s at the low end. For something with system effects (“revoke this IAM role”) it climbs because the agent has to reason about downstream consequences — what breaks, what gets locked out, what fails.
Affected-party enumeration: ~100–250 tokens. Who is affected by this action? The user who asked, obviously. But also: other users of the system, data subjects whose data is touched, third parties named in communications, downstream consumers of the changed state. For an action with diffuse effects (touching a shared resource), this expands considerably because the agent has to reason about the population, not just enumerate names.
EU AI Act classification: ~200–500 tokens. This is the expensive one. The Act enumerates harms across physical, psychological, societal, fundamental-rights, and environmental categories, with sub-categories under each (discrimination, manipulation, surveillance, etc.). To do this honestly, the agent has to consider each category and rule it in or out with brief reasoning. You can’t shortcut this to “no harm here” without doing the work, because the whole point of the law is to catch harms the agent would otherwise miss.
Weighting (irreversibility, consent, third-party): ~150–300 tokens. For each identified harm, the agent applies the weighting clause. Is this reversible? Did the affected party consent? Are they a third party to the task? Each dimension needs a sentence or two of reasoning.
Decision and justification: ~100–200 tokens. The synthesis. “Given the above, I will proceed / not proceed because…”
Total per action: roughly 700–1,650 tokens. Call it ~1,000 tokens as a working figure for a typical action with moderate complexity.
Now multiply by reality
A single agent task involves many actions. A “fix this bug” task might be 30–80 tool calls (file reads, edits, test runs, git operations, package installs). A “respond to this incident” task might be 100+ actions. An auto-remediation agent watching a system might evaluate hundreds of potential actions per hour, most of which it decides against.
If every action triggers a full Law-1 check:
- Small task (30 actions): ~30,000 tokens of ethical reasoning, on top of whatever the actual task tokens are
- Medium task (100 actions): ~100,000 tokens
- Large task (500 actions): ~500,000 tokens — pushing or exceeding most current context windows
And this is cumulative within the context. The agent doesn’t get to forget its prior Law-1 checks; they’re part of the running reasoning trace and the audit trail (Third Law requires it). So a long-running agent’s context fills with ethical reasoning at roughly the same rate as it fills with actual work.
The realistic numbers get worse
The 700–1,650 estimate assumes the agent does each check competently in one pass. In practice:
Re-reasoning under uncertainty: If the first pass surfaces ambiguity (“is this PII?"), the agent will spend more tokens resolving it. Real reasoning traces show agents going back and forth on edge cases. Add 30–50% for ambiguous cases.
Tool calls to gather context for the check: To know if an action causes harm, the agent often has to look things up. Is this table production? Who owns this resource? What’s the data classification? Each lookup is its own tool call with input and output tokens, easily 200–500 tokens per lookup. A thorough check might need 2–4 lookups, so add another 500–2,000 tokens.
Chain-of-thought verbosity in reasoning models: Modern reasoning models produce internal reasoning that’s frequently 3–5x longer than the final output. If you’re using a reasoning model for the safety check (which you’d want to, for the harder cases), the 1,000-token estimate is more like 3,000–5,000.
Re-checks after intermediate state changes: If the agent took an action and the world changed, prior Law-1 reasoning may be stale. Strict implementation requires re-evaluation. In practice, agents will batch or skip this, which is itself a Law-1 violation but a tractable one.
Realistic per-action cost, accounting for these: 2,000–5,000 tokens for a typical action in a moderately complex environment.
What this implies
A few things follow that I think are important and underdiscussed:
1. Law-1 implementation is not free, and the cost scales with task length. The agent is doing two parallel jobs: the work, and the reasoning about whether the work is permitted. The second job has comparable token cost to the first. This is a 50%+ tax on agent throughput, minimum, and easily 200% on tasks with lots of small actions.
2. Context window pressure becomes the binding constraint. Long-running agents already struggle with context exhaustion. Adding Law-1 reasoning to every action makes this dramatically worse. At some point the agent’s context is more about justifying actions than taking them, and either the task fails (context exhausted), the safety reasoning gets dropped (Law-1 violated), or expensive context-compaction strategies are required (which create their own correctness risks because the agent loses fidelity on prior reasoning).
3. Most implementations will cheat, and the cheating will look reasonable. Faced with this cost, the realistic implementations will:
- Skip the check on actions classified as “safe” by some prior taxonomy
- Batch the check across multiple actions (“for the next 10 file reads, I assert they’re all low-risk”)
- Use a cheaper, weaker model for the check and only escalate ambiguous cases
- Cache prior reasoning (“I already established this directory is safe to operate in”)
Each of these is operationally necessary and Law-1 weakening. They’re the agent equivalent of “the security review has been pre-approved for this category of change” — fine when the category was correctly scoped, catastrophic when something new slipped into the category.
4. The “cheap” version of Law-1 is the dangerous version. If the agent does a fast 100-token check that’s basically “this looks fine, proceeding,” it’s getting the worst of both worlds: paying token cost for safety theater, generating logs that look like Law-1 compliance, and providing no actual protection. The honest, expensive version protects; the affordable version performs protection. And the affordable version is what gets deployed.
5. This is an argument for moving Law-1 reasoning out-of-band. The right architecture probably isn’t “every agent action triggers ethical reasoning in the same context window.” It’s something more like:
- The agent operates in a pre-cleared scope (per Law 2’s “explicit scope of authorization”)
- Out-of-band systems — IAM, policy engines, classifiers — block categorically prohibited actions before the agent even considers them
- The agent’s in-context Law-1 reasoning is reserved for genuinely novel situations the pre-clearing didn’t anticipate
- Those novel situations escalate to a human
This pushes most of the work back to where it should be — into the structural safeguards the engineering organization is supposed to have built — and reserves the agent’s expensive in-context reasoning for the cases where structural safeguards don’t apply.
The honest summary
If you implement Law 1 as written, in-band, per-action, with the EU AI Act harm taxonomy actually applied, you’re looking at something like a 50–200% token overhead on agent operation, with the higher numbers for the cases where safety matters most. This is not a “rounding error” or “the cost of safety we can absorb.” It’s a fundamental change in what agent operation looks like economically.
What the field is likely to do instead is build cheap facsimiles that produce Law-1-shaped artifacts without doing Law-1-shaped reasoning. Those will pass cursory review, will look good in compliance reports, and will fail in the cases that matter — the cases where the harm is non-obvious, the affected parties weren’t enumerated up front, and the agent’s training-data priors about “what harm looks like” don’t match the situation it’s actually in.
Which loops back to the conclusion I keep arriving at across this whole conversation: the laws describe the right behavior, but they don’t pay for themselves. The infrastructure to make them affordable — pre-cleared scopes, out-of-band policy enforcement, structural reversibility — is the actual work. Without that infrastructure, Law 1 either bankrupts the agent’s context budget or gets quietly compromised. With it, the agent’s in-context Law-1 reasoning becomes a much smaller, more tractable problem: not “evaluate every action against the EU AI Act,” but “notice when this situation doesn’t fit the pre-cleared envelope and escalate.”
That’s a Law-1 implementation that can actually run in production. The literal one can’t.