Daily Newsletter

Tool Showing 20 tips
#039 Claude Code
Each Claude Code subagent gets its own fresh 200k context window β€” not a shared slice
Each subagent spawned via the Task tool starts with a blank context. Four parallel subagents give you 800k tokens of simultaneous attention. Context multiplication is the primary value β€” not just task delegation.
"Each subagent will have its own context window, so this is a neat way to gain additional context window for large codebases."
β†— Source
#040 Claude Code
Subagents don't inherit parent skills β€” list them explicitly in the skills: frontmatter
Without the skills: field in agent YAML frontmatter, the subagent discovers skills on-demand, wasting turns and tokens on context loading before doing any actual work.
"Use the skills field to inject skill content into a subagent's context at startup… Subagents don't inherit skills from the parent conversation; you must list them explicitly."
β†— Source
#041 Claude Code
Session-scoped permissions don't transfer to subagents β€” use PreToolUse hooks to auto-approve
Interactive permissions granted during a session don't propagate to spawned subagents. Without PreToolUse hooks auto-approving tools in the subagent definition, they interrupt repeatedly.
"Subagents do not automatically inherit parent agent permissions. To avoid repeated prompts, use PreToolUse hooks to auto-approve specific tools."
β†— Source
#042 Claude Code
Task tool parallelism caps at 10 β€” batches run atomically, no rolling queue drain
Claude Code has a hard 10-concurrent-subagent limit. The entire current batch must complete before the next batch starts. Structure task decomposition to optimize for batches of ≀10 independent units.
"The parallelism level is capped at 10… It will wait until all the tasks in the current batch completed before starting the next batch."
β†— Source
#043 Claude Code
Press Ctrl+B while a subagent runs to background it and keep working immediately
When Claude spawns a blocking subagent, Ctrl+B moves it to the background. Monitor with /tasks. This turns Claude Code from turn-based into a parallel development environment without any config change.
"When Claude spawns a sub-agent, press Ctrl+B to move it to the background."
β†— Source
#044 Claude Code
CLAUDE_CODE_SUBAGENT_MODEL runs all subagents on a cheaper model
Set this env var to run all subagents on Haiku while your main session uses Sonnet or Opus. Subagents do focused, bounded work and rarely need the frontier model.
"Set CLAUDE_CODE_SUBAGENT_MODEL to control which model your sub-agents run on. This is one of the most impactful optimizations for cost and speed."
β†— Source
#045 Claude Code
isolation: worktree in agent frontmatter runs it in a temporary git worktree
Setting this creates a fresh git worktree for the agent's work. Changes don't affect your working branch unless explicitly merged. Essential for risky automated refactors β€” the worktree is disposable.
"isolation: Set to worktree to run in a temporary git worktree"
β†— Source
#046 Claude Code
Only the subagent's final message returns to the parent β€” all intermediate work stays isolated
Every file read, tool call, and reasoning step inside a subagent stays within its context window. Design subagents to return concise, structured summaries β€” verbose final outputs bloat the parent context.
"Intermediate tool calls and results stay inside the subagent; only its final message returns to the parent."
β†— Source
#047 Claude Code
Resuming the same session in two terminals interleaves both β€” use --fork-session
If you open the same session ID in two terminals, messages interleave and the conversation becomes unusable. For parallel work from the same starting point, always use --fork-session.
"If you resume the same session in multiple terminals, both terminals write to the same session file. Messages get interleaved. For parallel work from the same starting point, use --fork-session."
β†— Source
#048 Codex
Codex only spawns subagents when you explicitly ask β€” never automatic delegation
Unlike some frameworks that auto-delegate, Codex requires explicit instruction to use subagent workflows. Each subagent does its own model and tool work β€” always opt-in per-task.
"Codex only spawns subagents when you explicitly ask it to. Because each subagent does its own model and tool work, subagent workflows consume more tokens."
β†— Source
#049 Codex
Custom Codex agents can have per-agent model, reasoning effort, AND sandbox mode
Each custom agent in config.toml specifies its own model, model_reasoning_effort, and sandbox_mode. A read-only mapper uses a mini model. A debugger uses the full frontier model.
"name = 'code_mapper' … model = 'gpt-5.4-mini' model_reasoning_effort = 'medium' sandbox_mode = 'read-only'"
β†— Source
#050 Codex
Subagent activity visible in app and CLI β€” IDE Extension support still pending
The IDE Extension does not yet show subagent output inline. If you're using VSCode/Cursor and expecting to monitor subagents inline, you're looking at the wrong surface β€” use the app or CLI.
"Subagent activity is currently surfaced in the Codex app and CLI. Visibility in the IDE Extension is coming soon."
β†— Source
#051 Codex
sqlite_home controls where Codex stores agent job state and exported results
Redirect to a shared volume in team environments so agent job outputs are inspectable by other team members or downstream pipeline steps.
"sqlite_home controls where Codex stores the SQLite-backed state used for agent jobs and their exported results."
β†— Source
#052 Codex
Run Codex CLI as an MCP server so the OpenAI Agents SDK can call it as a tool
Codex CLI can start as a long-running MCP server process. The Agents SDK calls it as a tool β€” enabling multi-agent orchestration where a GPT-5 orchestrator delegates file editing and code execution to Codex.
"Initializing Codex CLI as an MCP Server: How to run Codex as a long-running MCP process."
β†— Source
#053 Codex
Gate multi-agent handoffs on artifact existence β€” not on model assertions
The orchestrator should verify that required output files physically exist before advancing to the next agent. Don't trust 'I finished' β€” verify /design/design_spec.md exists. File existence is the tamper-proof gate.
"Wait for the Designer to produce /design/design_spec.md. Verify that file exists before proceeding. Do not advance to the next handoff until the required files for that step are present."
β†— Source
#054 Codex
codex cloud exec --attempts 3 generates best-of-N solutions in parallel
The --attempts flag (1–4) tells Codex cloud to generate multiple independent solutions. For ambiguous tasks where the first solution might not be optimal, best-of-N improves quality at modest extra cost.
"Add --attempts (1–4) to request best-of-N runs when you want Codex cloud to generate more than one solution."
β†— Source
#055 Both
Keep the orchestrator narrow (read + route) and give execution agents wide tool access
The orchestrator plans and delegates with minimal tool permissions. Execution subagents have the full tool surface for their domain. This prevents an orchestrator bug from causing broad damage.
"Make the orchestrator responsible for global planning, delegation, and state. Keep its tool permissions narrow β€” mostly 'read and route.'"
β†— Source
#056 Codex
Codex SDK thread model β€” call thread.run() again to continue, no history re-sending
The Codex TypeScript SDK uses persistent threads. Call thread.run('next prompt') to continue without re-sending history. Resume past threads by thread ID. Far more efficient than reconstructing full history.
"Call run() again to continue on the same thread, or resume a past thread by providing a thread ID."
β†— Source
#057 Codex
max_nesting_depth caps subagent recursion β€” default is 1, root session is depth 0
Controls how many levels of subagents can spawn subagents. Default max = 1. Increase carefully β€” each depth level multiplies token cost and makes debugging exponentially harder.
"Maximum nesting depth allowed for spawned agent threads (root sessions start at depth 0; default: 1)."
β†— Source
#058 Codex
Context pollution and context rot are the two named failure modes of long agent chains
Context pollution = irrelevant data filling context early. Context rot = context becoming stale and misleading later. Solution: isolated context windows per subagent and curated summary-only handoffs.
"For the concepts and tradeoffs behind subagent workflows, including context pollution, context rot, and model-selection guidance, see Subagent concepts."
β†— Source