Claude Code Orchestration

Purpose: Guide model selection, task delegation, context management, and continuous self-improvement for efficient Claude Code usage.

Mindset: Apply /pb-design-rules thinking (Simplicity - cheapest model that produces correct results; Clarity - make delegation explicit) and /pb-preamble thinking (challenge assumptions about model choice - is opus actually needed here, or is it habit?).

Resource Hint: sonnet - reference guide for model selection and delegation patterns.

When to Use

Starting a session with mixed-complexity tasks
Planning workflows that involve subagent delegation
Reviewing resource efficiency after a session
Generating or updating CLAUDE.md templates
After a session where model choice caused issues (wrong model, wasted tokens)

Model Tiers

Tier	Model	Role	Strengths	Trade-off
Architect	opus	Planner, reviewer, decision-maker	Deep reasoning, nuance, trade-offs	Highest cost, slowest
Engineer	sonnet	Implementer, coder, analyst	Code generation, balanced judgment	Medium cost, medium speed
Scout	haiku	Runner, searcher, formatter	File search, validation, mechanical	Lowest cost, fastest

Opus reasons. Sonnet builds. Haiku runs.

Harness Reality (Opus 4.7 GA)

The table above is cost-oriented guidance, not harness description. Claude Code defaults to Opus 4.7 for coding sessions, so Engineer-tier work frequently runs on Opus anyway. Two adjustments matter:

/fast pins the session to Opus 4.6 for speed without tier downgrade to Sonnet.
A 1M-context variant ([1m] suffix on the model ID) is available as an opt-in tier; standard stays at 200K.

When cost discipline matters (routine dev loop, CI, automation), switch to Sonnet explicitly rather than relying on the harness default. Haiku remains subagent-only via the Task tool.

Model Selection Strategy

By Task Type

Task	Model	Why
Architecture decisions, complex planning	opus	Multi-step reasoning, trade-off analysis
Security deep-dives, threat modeling	opus	Correctness stakes are high
Code review (critical paths)	opus	Judgment about design, not just correctness
Code implementation, refactoring	sonnet	Well-defined task, good balance
Test writing, documentation	sonnet	Pattern application, not invention
Routine code review	sonnet	Standard checklist evaluation
File search, codebase exploration	haiku	Mechanical, no reasoning needed
Linting, formatting, validation	haiku	Rule application, not judgment
Status checks, simple lookups	haiku	Information retrieval only

Decision Criteria

Ask these in order (first match wins):

Does this require architectural judgment or trade-off analysis? → opus
Does this require code generation or analytical reasoning? → sonnet
Is this mechanical (search, format, validate, scaffold)? → haiku

When unsure, start with sonnet. Upgrade to opus if results lack depth. Downgrade to haiku if the task is mechanical.

Task Delegation Patterns

When to Delegate (Task Tool)

Delegate to subagents:

Independent research or codebase exploration
File search across many files
Validation and lint checks
Parallel information gathering
Work that would pollute main context with noise

Keep in main context:

Decisions that affect subsequent steps
Architecture and planning
Work requiring conversational continuity with the user
Anything where the user needs to see the reasoning

Output Discipline

When a subagent returns a summary, accept the summary as context. Do not pipe raw tool output, diffs, or file dumps back into the main conversation unless verification explicitly requires it. Context is finite; delegation is the exchange – pay for it once, not twice.

Parallel vs Sequential

Pattern	When	Example
Parallel subagents	Independent queries, no shared state	Search 3 directories simultaneously
Sequential subagents	Output of one feeds into next	Explore → then Plan based on findings
Main context only	User interaction needed, judgment calls	Architecture review with the user

Model Assignment in Task Tool

model: "haiku"   → Explore agents, file search, grep, validation
model: "sonnet"  → Code writing, analysis, standard reviews
(default/opus)   → Planning, architecture, complex analysis

Context Budget Management

Budget Awareness

Context Load	Budget	Frequency
Global CLAUDE.md	<150 lines	Every turn, every session
Project CLAUDE.md	<150 lines	Every turn, every session
Auto-memory MEMORY.md	<200 lines	Every turn, every session
Session context	Finite, compaction is lossy	Fills during session

Every unnecessary line in CLAUDE.md or MEMORY.md costs tokens on every single turn. Be ruthlessly concise in persistent files.

The 1M-context [1m] tier (Opus 4.7) does not retire this hygiene. Compaction is still lossy, every turn still pays the tokens, and cost scales with context size. Budget for the 200K default; treat 1M as headroom for specific long-horizon work.

Efficiency Principles

Subagents for exploration (separate context window, doesn’t pollute main)
Surgical file reads (offset + limit, not full files when you know the area)
Plans in files, not in chat (reference by path, not by pasting)
Compact at natural breakpoints (after commit, after phase - not mid-task)
Commit frequently (each commit is a context checkpoint)
Reference by commit hash (not by re-reading entire files)

Playbook-to-Model Mapping

Classification	Example Commands	Default Model	Delegation
Executor	pb-commit, pb-start, pb-deploy	sonnet	Procedural steps, well-defined
Orchestrator	pb-release, pb-ship, pb-review	opus (main)	Delegates subtasks to sonnet/haiku
Guide	pb-preamble, pb-design-rules	opus	Deep reasoning about principles
Reference	`pb-patterns-*`, pb-templates	sonnet	Pattern application, lookup
Review	`pb-review-*`, pb-security	opus + haiku	Phase 1: haiku automated; Phase 2-3: opus

Self-Healing and Continuous Learning

The orchestrator is not static. It learns, adapts, and improves.

Operational Self-Awareness

After each significant workflow, reflect:

Question	Action if Yes
Did a model choice produce poor results?	Record in auto-memory, adjust default for that task type
Did a subagent return insufficient results?	Note the prompt pattern that failed, try broader/narrower next time
Did context fill up mid-task?	Record breakpoint strategy, compact earlier next session
Was a playbook missing or insufficient?	Note the gap, suggest improvement to user
Did the workflow take more turns than expected?	Analyze why - wrong model? Missing information? Poor delegation?

Auto-Memory as Learning Journal

Use the auto-memory directory (~/.claude/projects/<project>/memory/) to persist operational learnings:

MEMORY.md (loaded every session, <200 lines):

Model selection adjustments discovered through experience
Playbook gaps encountered and workarounds used
Project-specific orchestration preferences
Context management lessons learned

Topic files (referenced from MEMORY.md, loaded on demand):

orchestration-lessons.md - Model choice outcomes, delegation pattern results
playbook-gaps.md - Missing guidance discovered during workflows
project-patterns.md - Project-specific efficiency patterns

Feedback Loop

Execute workflow
    |
    v
Observe outcome
    |
    v
Was it efficient? Correct? Right model?
    |           |
    YES         NO
    |           |
    v           v
Continue    Record learning in auto-memory
            Adjust approach for next time
            Surface playbook gap to user if systemic

Self-Healing Behaviors

Trigger	Self-Healing Response
Subagent returns empty/useless results	Retry with adjusted prompt or different model tier
Context approaching limit mid-task	Proactively compact, checkpoint state in files
Playbook command produces unexpected output	Note in memory, suggest playbook update
Model produces shallow reasoning	Escalate to higher tier, record the task type
Repeated pattern across sessions	Extract to auto-memory for persistent learning
Stale information in MEMORY.md	Prune during session start, keep only current learnings

Suggesting Playbook Improvements

When the orchestrator discovers gaps during operation:

Note the gap - What was missing, what workaround was used
Assess frequency - One-off vs recurring need
Propose to user - “Encountered [gap] during [workflow]. Suggest updating [playbook] with [specific addition].”
Don’t self-modify playbooks silently - Propose, don’t assume

This creates a virtuous cycle: use playbooks → discover gaps → propose improvements → playbooks get better → usage gets better.

Anti-Patterns

Anti-Pattern	Why It Hurts	Better Approach
Opus for file search	Expensive, no reasoning advantage	haiku via Task tool
Haiku for architecture	Shallow reasoning, bad decisions	opus in main context
Serializing independent subagents	Wastes wall-clock time	Parallel Task calls
Loading full files for 10 lines	Context waste	Read with offset + limit
Pasting plans into chat	Consumes context every turn	Store in files, reference by path
Skipping compaction until forced	Lossy emergency compaction	Compact at natural breakpoints
Same model for everything	Wastes cost or quality	Match model to task
Never recording what worked	Same mistakes repeated	Use auto-memory feedback loop
Ignoring playbook friction	Workarounds accumulate silently	Surface gaps, propose fixes

Examples

Example 1: Feature Implementation Workflow

/pb-plan - opus (main context): architecture decisions, trade-offs
Explore codebase - haiku (Task tool, 2-3 parallel agents): find relevant files
Implementation - sonnet (main context): write code
Write tests - sonnet (Task tool): parallel test generation
Self-review - opus (main context): critical evaluation
/pb-commit - sonnet: procedural commit workflow

Post-session reflection:

Did haiku find what was needed? (If not, adjust search prompts in memory)
Did sonnet’s code need significant opus review fixes? (If yes, consider opus for complex implementation next time)

Example 2: Playbook Review with Model Delegation

Phase 1 automated checks - haiku (Task tool): count commands, validate cross-refs
Phase 2 category review - opus (main context): nuanced evaluation of intent, quality
Phase 3 cross-category - opus (main context): holistic pattern recognition

/pb-claude-global - Generate global CLAUDE.md (concise orchestration rules)
/pb-claude-project - Generate project CLAUDE.md
/pb-learn - Pattern learning from debugging (complements operational learning here)
/pb-review-playbook - Playbook review (model delegation by phase)
/pb-new-playbook - Meta-playbook (resource hint in scaffold)

Last Updated: 2026-04-17 Version: 1.1.0

Keyboard shortcuts

Engineering Playbook