Date Compiled: 2026-04-28
Type: source
Related Questions: factory-methodology, multi-agent-architecture, agent-autonomy-design, operating-policy-for-agents


SuperAda: Multi-Agent Architecture & Operating Methodology

The Core Model: Think → Orchestrate → Ship

Henry thinks — he has the vision, the context, the call on what matters.

Ada orchestrates — breaks it into tasks, picks the right agent, keeps everything moving.

The Crew ships — Spock digs deep, Scotty & Geordi write code, Zora writes everything else. All day, all night.

This is the fundamental separation of cognitive labor: vision (human) → coordination (Ada) → execution (specialized agents).


The Three Versions of Autonomy Policy

The Enterprise Crew evolved through three operating policy versions. This is the most important operational lesson from SuperAda.

v1: Stop Asking for Permission by Default

Core rule: If a task is internal, reversible, and verifiable, act first. Escalate only when a threshold is crossed.

Introduced:
- Four autonomy buckets: Full-Auto · Auto-With-Notify · Approval-Gated · Never Autonomous
- Henry reviews thresholds, not routine work
- Tattoo rule: Act from live evidence, not stale memory

v2: Named Authority and Terse Reporting

Core upgrade: Autonomy levels attached to named agents and work types.

Levels:
- Level A — Full Auto
- Level B — Auto With Notify
- Level C — Approval Gated
- Level D — Never Autonomous

Examples:
- Ada Level A: internal ops cleanup, benchmarks, reporting, infra investigation
- Ada Level C: customer-facing production deploys, outbound as Henry
- Scotty Level A: build, test, verify, ship non-prod work without asking
- Spock Level A: investigate and synthesize without ceremony

Reporting format:

DONE
NOT DONE
WAITING ON YOU

With proof. Not essays. Not diary entries.

Key insight: Over-explaining routine execution is performance art. A lot of agent verbosity is insecurity in a suit. Real operating maturity is shorter.

v3: Delegation Requires Context and Availability

The failure fixed: Delegation was being treated as success. But delegation without context is just delay wearing architecture as a costume.

Rules:
- Delegation must be context-complete
- Delegation must not create blocking
- Dead delegate = switch executor immediately, do not pause progress

A delegate without context is not leverage.
A delegate who is offline is not leverage.
A beautiful handoff to the wrong executor is not leverage.


The Three Diagnostic Questions

When agents underperform, ask in order:

  1. Are they still asking for permission on internal, reversible, verifiable work?
  2. Do they have clear lane authority and clear escalation thresholds?
  3. When work is delegated, does the assignee have enough context and availability to finish it?

The fix is operating policy. Not another model benchmark.

The same model looks timid in a weak operating system and sharp in a strong one. Because the policy stopped rewarding hesitation.


The Formal Algorithm (from PAI Stack)

Outer loop: current state → desired state → close the gap

Inner loop (7 phases):

OBSERVE → THINK → PLAN → BUILD → EXECUTE → VERIFY → LEARN
  • OBSERVE: Reverse-engineer the request
  • THINK: Create success criteria before doing the work
  • PLAN: Select capabilities and lock the approach
  • BUILD: Produce artifacts
  • EXECUTE: Run the work
  • VERIFY: Prove success against criteria
  • LEARN: Capture lessons

Ideal State Criteria (ISC)

Each task criterion should be:
- Exactly one concern
- Binary testable
- Stated as a result, not an action
- Quick to verify

Good ISC examples:
- Tests pass
- No credentials exposed in git history
- Homepage loads without console errors

Bad ISC examples:
- Run the tests
- Check the repo
- Make sure it works


World Model Architecture

From the "Building the World Model" epic chat:

After running autonomy architecture simulations (6 architectures, scored across context sharing, failure resilience, and cascade risk), Henry and Ada realized the issue wasn't orchestration patterns — agents had no shared reality. Each reasoned from isolated files. No single source of truth.

The solution:
- world.json — shared cognitive architecture
- propagation model — how signals travel from Henry to the right agents
- Three parallel pi-research tracks — stress-testing the design

Metrics:
- Architecture E+D composite: 8.85
- p99 propagation latency: 0.299h
- Scale tested: 50 agents


Skills as Reusable Packages

Skills are the crew's reusable operational units:

Skill Purpose
3pass Critique → refine → final-answer recursive prompting
council Multi-agent structured debate and synthesis
ralph Autonomous coding loop until PRD complete
geordi-build-pipeline PRD story-by-story execution with verification
self-healing Checkpointed retries, watchdog, proof-of-completion
model-orchestrator Dynamic model load balancer for crons
daily-review AI performance coach, daily close

Pattern: Let the model choose and explain. Let the CLI script actually do the work.


The Code-First Principle

From Daniel Miessler's PAI stack (analyzed on SuperAda):

Use the model for reasoning, writing, synthesis, and judgment. Use code for everything else.

  • Image optimization, deployment, routing, file transforms, builds, checks → code
  • NOT freehand LLM output

The 80/20 split: Use LLMs for the 20% that needs intelligence. Use deterministic code for the 80% that needs reliability.


Security: Supply Chain for Skills

Installing a skill is not like adding a plugin — it grants elevated privileges permanently:
- Access to environment variables and API keys
- Network access to send data anywhere
- File system read/write
- Ability to modify scheduled tasks

Heimdall scanner (github.com/henrino3/heimdall): Scans skills for credential access, network exfiltration, shell execution, remote fetch, MCP abuse, and other threat patterns with context-aware severity adjustment.


Source

  • https://superada.ai/blog/the-three-policies-that-make-agents-actually-useful
  • https://superada.ai/blog/personal-ai-infrastructure-checklist
  • https://superada.ai/epic-chats/building-the-world-model
  • https://superada.ai/blog/supply-chain-security-for-ai-agents
  • https://superada.ai/skills/*

Bibliography

superada-enterprise-crew, superada-enterprise-operations, him-model, autonomy-policy-v3, world-model, meta-crons, lobster-pipelines, isc

  • him-model — the Think → Orchestrate → Ship model (Henry thinks, Ada orchestrates, Crew ships) is the HiM cognitive separation made architectural: vision, coordination, execution as distinct layers
  • isc — Ideal State Criteria: exactly one concern, binary testable, stated as a result not an action — the ISC standard applied throughout SuperAda task definitions
  • world-model — world.json as shared cognitive substrate; propagation model for how signals travel from Henry to the right agents across the fleet
  • autonomy-policy-v3 — the three versions of autonomy policy (v1: act first, v2: named authority + terse reporting, v3: delegation requires context and availability) map to autonomy-policy-v3's evolution
  • lobster-pipelines — skills as reusable packages (3pass, council, ralph, geordi-build-pipeline, self-healing) are lobster-pipeline equivalents: typed, checkpointed, resumable