Agents & Parallelism — agentplaybook.ai

#039 Claude Code

Each Claude Code subagent gets its own fresh 200k context window — not a shared slice

Each subagent spawned via the Task tool starts with a blank context. Four parallel subagents give you 800k tokens of simultaneous attention. Context multiplication is the primary value — not just task delegation.

"Each subagent will have its own context window, so this is a neat way to gain additional context window for large codebases."

↗ Source

#040 Claude Code

Subagents don't inherit parent skills — list them explicitly in the skills: frontmatter

Without the skills: field in agent YAML frontmatter, the subagent discovers skills on-demand, wasting turns and tokens on context loading before doing any actual work.

"Use the skills field to inject skill content into a subagent's context at startup… Subagents don't inherit skills from the parent conversation; you must list them explicitly."

↗ Source

#041 Claude Code

Session-scoped permissions don't transfer to subagents — use PreToolUse hooks to auto-approve

Interactive permissions granted during a session don't propagate to spawned subagents. Without PreToolUse hooks auto-approving tools in the subagent definition, they interrupt repeatedly.

"Subagents do not automatically inherit parent agent permissions. To avoid repeated prompts, use PreToolUse hooks to auto-approve specific tools."

↗ Source

#042 Claude Code

Task tool parallelism caps at 10 — batches run atomically, no rolling queue drain

Claude Code has a hard 10-concurrent-subagent limit. The entire current batch must complete before the next batch starts. Structure task decomposition to optimize for batches of ≤10 independent units.

"The parallelism level is capped at 10… It will wait until all the tasks in the current batch completed before starting the next batch."

↗ Source

#043 Claude Code

Press Ctrl+B while a subagent runs to background it and keep working immediately

When Claude spawns a blocking subagent, Ctrl+B moves it to the background. Monitor with /tasks. This turns Claude Code from turn-based into a parallel development environment without any config change.

"When Claude spawns a sub-agent, press Ctrl+B to move it to the background."

↗ Source

#044 Claude Code

CLAUDE_CODE_SUBAGENT_MODEL runs all subagents on a cheaper model

Set this env var to run all subagents on Haiku while your main session uses Sonnet or Opus. Subagents do focused, bounded work and rarely need the frontier model.

"Set CLAUDE_CODE_SUBAGENT_MODEL to control which model your sub-agents run on. This is one of the most impactful optimizations for cost and speed."

↗ Source

#045 Claude Code

isolation: worktree in agent frontmatter runs it in a temporary git worktree

Setting this creates a fresh git worktree for the agent's work. Changes don't affect your working branch unless explicitly merged. Essential for risky automated refactors — the worktree is disposable.

"isolation: Set to worktree to run in a temporary git worktree"

↗ Source

#046 Claude Code

Only the subagent's final message returns to the parent — all intermediate work stays isolated

Every file read, tool call, and reasoning step inside a subagent stays within its context window. Design subagents to return concise, structured summaries — verbose final outputs bloat the parent context.

"Intermediate tool calls and results stay inside the subagent; only its final message returns to the parent."

↗ Source

#047 Claude Code

Resuming the same session in two terminals interleaves both — use --fork-session

If you open the same session ID in two terminals, messages interleave and the conversation becomes unusable. For parallel work from the same starting point, always use --fork-session.

"If you resume the same session in multiple terminals, both terminals write to the same session file. Messages get interleaved. For parallel work from the same starting point, use --fork-session."

↗ Source

#048 Codex

Codex only spawns subagents when you explicitly ask — never automatic delegation

Unlike some frameworks that auto-delegate, Codex requires explicit instruction to use subagent workflows. Each subagent does its own model and tool work — always opt-in per-task.

"Codex only spawns subagents when you explicitly ask it to. Because each subagent does its own model and tool work, subagent workflows consume more tokens."

↗ Source

#049 Codex

Custom Codex agents can have per-agent model, reasoning effort, AND sandbox mode

Each custom agent in config.toml specifies its own model, model_reasoning_effort, and sandbox_mode. A read-only mapper uses a mini model. A debugger uses the full frontier model.

"name = 'code_mapper' … model = 'gpt-5.4-mini' model_reasoning_effort = 'medium' sandbox_mode = 'read-only'"

↗ Source

#050 Codex

Subagent activity visible in app and CLI — IDE Extension support still pending

The IDE Extension does not yet show subagent output inline. If you're using VSCode/Cursor and expecting to monitor subagents inline, you're looking at the wrong surface — use the app or CLI.

"Subagent activity is currently surfaced in the Codex app and CLI. Visibility in the IDE Extension is coming soon."

↗ Source

#051 Codex

sqlite_home controls where Codex stores agent job state and exported results

Redirect to a shared volume in team environments so agent job outputs are inspectable by other team members or downstream pipeline steps.

"sqlite_home controls where Codex stores the SQLite-backed state used for agent jobs and their exported results."

↗ Source

#052 Codex

Run Codex CLI as an MCP server so the OpenAI Agents SDK can call it as a tool

Codex CLI can start as a long-running MCP server process. The Agents SDK calls it as a tool — enabling multi-agent orchestration where a GPT-5 orchestrator delegates file editing and code execution to Codex.

"Initializing Codex CLI as an MCP Server: How to run Codex as a long-running MCP process."

↗ Source

#053 Codex

Gate multi-agent handoffs on artifact existence — not on model assertions

The orchestrator should verify that required output files physically exist before advancing to the next agent. Don't trust 'I finished' — verify /design/design_spec.md exists. File existence is the tamper-proof gate.

"Wait for the Designer to produce /design/design_spec.md. Verify that file exists before proceeding. Do not advance to the next handoff until the required files for that step are present."

↗ Source

#054 Codex

codex cloud exec --attempts 3 generates best-of-N solutions in parallel

The --attempts flag (1–4) tells Codex cloud to generate multiple independent solutions. For ambiguous tasks where the first solution might not be optimal, best-of-N improves quality at modest extra cost.

"Add --attempts (1–4) to request best-of-N runs when you want Codex cloud to generate more than one solution."

↗ Source

#055 Both

Keep the orchestrator narrow (read + route) and give execution agents wide tool access

The orchestrator plans and delegates with minimal tool permissions. Execution subagents have the full tool surface for their domain. This prevents an orchestrator bug from causing broad damage.

"Make the orchestrator responsible for global planning, delegation, and state. Keep its tool permissions narrow — mostly 'read and route.'"

↗ Source

#056 Codex

Codex SDK thread model — call thread.run() again to continue, no history re-sending

The Codex TypeScript SDK uses persistent threads. Call thread.run('next prompt') to continue without re-sending history. Resume past threads by thread ID. Far more efficient than reconstructing full history.

"Call run() again to continue on the same thread, or resume a past thread by providing a thread ID."

↗ Source

#057 Codex

max_nesting_depth caps subagent recursion — default is 1, root session is depth 0

Controls how many levels of subagents can spawn subagents. Default max = 1. Increase carefully — each depth level multiplies token cost and makes debugging exponentially harder.

"Maximum nesting depth allowed for spawned agent threads (root sessions start at depth 0; default: 1)."

↗ Source

#058 Codex

Context pollution and context rot are the two named failure modes of long agent chains

Context pollution = irrelevant data filling context early. Context rot = context becoming stale and misleading later. Solution: isolated context windows per subagent and curated summary-only handoffs.

"For the concepts and tradeoffs behind subagent workflows, including context pollution, context rot, and model-selection guidance, see Subagent concepts."

↗ Source