Date Compiled: 2026-04-28
Type: concept — factory-methodology
Related Questions: agent-autonomy, delegation-policy, operating-policy-for-agents, multi-agent-governance
Related Articles: him-model, superada-enterprise-crew, superada-multi-agent-architecture, Kelly router, super-ada
Source Attribution: raw/superada/blog-three-policies.md · compiled/sources/superada-multi-agent-architecture.md
Autonomy Policy v3 — Delegation That Actually Works¶
Overview¶
The Autonomy Policy v3 is the third and current iteration of the Enterprise Crew's operating policy for agent decision-making. It evolved from two earlier versions that each fixed a specific failure mode.
The core insight of v3: delegation is not leverage unless the delegate has context and availability.
The Three Versions¶
Version 1: Stop Asking for Permission When the Threshold Is Clear¶
What it fixed: Agents treating every decision as requiring human approval. Routine work was constantly escalated, creating a bottleneck at the human operator.
Core rule: If a task is internal, reversible, and verifiable, act first. Escalate only when a threshold is crossed.
Introduced:
- Four autonomy buckets: Full-Auto · Auto-With-Notify · Approval-Gated · Never Autonomous
- The human reviews thresholds, not routine work
- The "tattoo rule": act from live evidence, not stale memory
The failure mode it solved: Anxious intern behavior — asking for permission on work that didn't need it, slowing everything down.
Version 2: Named Authority and Terse Reporting¶
What it fixed: General autonomy principles with no agent-specific accountability. Everyone knew the rules in theory, but no one knew who had authority over what.
Core upgrade: Autonomy levels attached to specific agents and specific work types.
Levels:
- Level A — Full Auto (act and report concisely)
- Level B — Auto With Notify (act, then notify)
- Level C — Approval Gated (get approval first)
- Level D — Never Autonomous
Examples by agent:
- Ada Level A: internal ops cleanup, benchmarks, reporting, infra investigation
- Ada Level C: customer-facing production deploys, outbound communication as Henry
- Scotty Level A: build, test, verify, ship non-prod work without asking
- Spock Level A: investigate and synthesize without ceremony; binding commitments step up to higher levels
Reporting format (terse):
DONE
NOT DONE
WAITING ON YOU
With proof. Not essays. Not diary entries.
The failure mode it solved: Verbosity as performance art — agents writing lengthy updates for routine work instead of just shipping it.
Version 3: Delegation Requires Context and Availability¶
What it fixed: Delegation itself being treated as success. Sending work to an agent was considered done, even if the agent lacked the context to act or was unavailable to receive it.
The three rules in memory/rules.md:
1. Delegation must be context-complete — the assignee must have everything needed to act without asking follow-up questions
2. Delegation must not create blocking — if the assignee is unavailable, work continues via a different executor, not pauses
3. Dead delegate = switch executor immediately — do not wait for the delegate to come back online
The three core statements:
- A delegate without context is not leverage.
- A delegate who is offline is not leverage.
- A beautiful handoff to the wrong executor is not leverage.
The failure mode it solved: Delegation that looks elegant but creates delay because the assignee lacked context or was simply unavailable.
The Three Diagnostic Questions¶
When agents underperform, the policy says to ask in order:
-
Are they still asking for permission on internal, reversible, verifiable work?
→ If yes, they are operating under v1 or earlier -
Do they have clear lane authority and clear escalation thresholds?
→ If no, they are operating under v1 or v2 without named authority -
When work is delegated, does the assignee have enough context and availability to finish it?
→ If no, they are operating under v2 without v3's context requirement
The Meta-Lesson¶
The same model can look timid, noisy, and dependent in a weak operating system, then look sharp and useful in a strong one. Not because the model changed. Because the policy around it stopped rewarding hesitation.
The fix is operating policy. Not another model benchmark.
The Enterprise Crew runs the same foundation models before and after each policy iteration. The change in output quality comes entirely from the rules governing when agents act, what they escalate, and how they delegate.
Relationship to HiM¶
The autonomy policy is the operating layer of the him-model. HiM provides the cognitive separation (human thinks, Ada orchestrates, crew ships). The autonomy policy defines the rules each layer follows to prevent hesitation, verbosity, and dead delegation from degrading system performance.
Sources: blog-three-policies.md (raw/superada/), superada-multi-agent-architecture.md (compiled/sources/)
Related Articles¶
- superada — the chief-of-staff orchestrator that runs under this policy at full scale
- him-model — the cognitive architecture this policy operates within
- 7-agent-crew-topology — the crew topology this policy governs
Source Attribution¶
raw/superada/blog-three-policies.md · compiled/sources/superada-multi-agent-architecture.md