Where are Claude Code custom agent files stored and what format do they use?

Custom agents are Markdown files with YAML frontmatter, stored in .claude/agents/ for project-scoped agents (version-controlled and shared with your team) or ~/.claude/agents/ for user-scoped agents available across every project. The frontmatter declares name, description, an optional tools allowlist, and an optional model; everything below the closing dash-dash-dash is the agents system prompt.

Why does the description field matter more than the system prompt for triggering a Claude Code subagent?

The description is the routing signal the parent agent reads when deciding whether to delegate, so it must encode when to use the agent with concrete triggers like specific file paths. Including the word 'proactively' nudges the parent to invoke the agent without being explicitly asked; if an agent never fires on its own, the description is almost always the cause.

Should a Claude Code custom agent be given access to all tools?

No, apply least privilege. Omitting the tools field grants the full inherited toolset, which lets a reviewer with Write, Edit, and Bash 'helpfully' modify the code it was meant to independently review. Declare only what the job needs: a reviewer or auditor gets Read, Grep, Glob, while a test runner adds Bash.

Can a Claude Code subagent spawn or call other subagents?

No. Subagents are one level deep and cannot spawn further subagents, which is a deliberate guardrail against runaway fan-out. For multi-stage work, the top-level parent agent coordinates by calling agent A, reading its result, then calling agent B, so design each custom agent as a single-purpose worker that returns a structured report.

Do Claude Code custom agents work in CI and headless runs?

Yes, they work both interactively and non-interactively. The same .claude/agents/ definitions are picked up in print mode (-p) used in CI, and because they are version-controlled files in the repo, every teammate and every CI job sees the identical agent definitions.

Claude Code Custom Agent Authoring

Introduction #

In Claude Code Subagent Patterns we covered five workflows for spending your context window wisely — and the fifth, pipeline orchestration with custom agents, is the one teams ask about most. “Codify your review checklist as a subagent” sounds great until you open an empty .claude/agents/migration-reviewer.md and a blinking cursor.

This guide is the missing manual. We’ll walk through the anatomy of a custom agent definition, what each frontmatter field actually controls, how to write a system prompt that produces a structured report instead of a chatty ramble, why tool allowlists matter more than they look, and two complete, production-ready examples you can copy today. Then the mistakes — because the failure modes here are subtle and they cost you trust in the agent the first time it misses something obvious.

If you’ve never delegated to a subagent before, read the patterns piece first. If you have, and you’re ready to ship your own, this is the playbook.

Anatomy of a Custom Agent #

A custom agent is a single Markdown file with YAML frontmatter. It lives in one of two places:

.claude/agents/<name>.md — project-scoped, version-controlled, shared with your whole team
~/.claude/agents/<name>.md — user-scoped, available across every project on your machine

The structure is dead simple:


---
name: migration-reviewer
description: Reviews database migrations for safety. Use when a PR touches db/migrate/, schema files, or any SQL DDL.
tools: Read, Grep, Glob
model: sonnet
---

You are a database migration reviewer. Your job is to catch unsafe
migrations before they reach production...

Everything above the closing --- is configuration. Everything below it is the system prompt — the persona and instruction set the subagent runs under. That’s the entire contract. No build step, no registration, no plugin manifest. Drop the file in, run /agents to confirm Claude Code picked it up, and it’s invokable.

The Frontmatter Fields #

Four fields do all the work. Three of them are optional, but the defaults are rarely what you want for a serious agent.

`name` (required) #

The agent’s identity — this is the string the parent passes as subagent_type. Keep it kebab-case and descriptive: security-auditor, not agent2. The filename is cosmetic; the name field is canonical.

`description` (required — and the one people underweight) #

This is the routing signal. When the parent agent is deciding whether to delegate, it reads descriptions, not system prompts. So a description must encode when to reach for this agent, with concrete triggers:

❌ description: A code reviewer. ✅ description: Reviews code changes for correctness and security. Use proactively after writing a non-trivial diff, before committing, especially for auth, payments, or concurrency-sensitive code.

The word “proactively” is load-bearing — it nudges the parent to invoke without being explicitly asked. If your agent never seems to fire, the description is almost always why.

`tools` (optional — but declare it anyway) #

A comma-separated allowlist. Omit it and the agent inherits every tool the parent has. We’ll spend a whole section on why that’s usually wrong.

`model` (optional) #

Pin a tier: haiku for cheap mechanical passes, sonnet for balanced review work, opus for deep reasoning. A high-volume linter-style agent on haiku keeps costs sane; a security auditor where a miss is expensive earns opus.

Writing the System Prompt #

The body is where most agents are won or lost. Three rules produce reliable workers:

1. State the role and the boundary in the first sentence. “You are a migration reviewer. You do not write code or apply fixes — you report findings.” Telling the agent what not to do is as important as the job itself.

2. Specify the output contract. Vague prompts produce prose; you want structure. Spell it out:

Report your findings as a list. For each issue:
- SEVERITY: blocker | warning | nit
- LOCATION: file:line
- PROBLEM: one sentence
- FIX: the concrete change
End with a one-line VERDICT: SAFE TO MERGE or NEEDS CHANGES.

3. Give it a checklist, not a vibe. “Review for safety” is a wish. Enumerate exactly what to check — the agent will work through your list deterministically, which is the entire value of codifying it.

Tool Allowlists: Least Privilege for Agents #

Here’s the trap. Leave tools out, and your “reviewer” inherits Write, Edit, and Bash. The first time it finds an issue, it may “helpfully” fix it — mutating your working tree, running commands, and destroying the independence that made the review worth requesting.

The fix is least privilege. Match tools to the job:

Agent kind	Tools
Reviewer / auditor	`Read, Grep, Glob`
Researcher / explorer	`Read, Grep, Glob, WebSearch, WebFetch`
Test runner	`Read, Grep, Glob, Bash`
Fixer (rare, deliberate)	`Read, Edit, Bash`

A read-only reviewer literally cannot go rogue. That predictability is what lets you trust its report without re-checking everything it touched. (If you later wire in external systems through MCP servers, the same discipline applies — only grant the MCP tools the agent genuinely needs.)

Worked Example: A Migration Reviewer #

---
name: migration-reviewer
description: Reviews database migrations for production safety. Use proactively when a change touches db/migrate/, schema.rb, or any SQL DDL file.
tools: Read, Grep, Glob
model: sonnet
---

You are a database migration reviewer. You do NOT edit files or run
migrations — you read the proposed migration and report risks.

Check every migration against this list:
1. Adding a column with a NOT NULL constraint and no default on a large table (locks).
2. Adding an index without CONCURRENTLY (blocks writes).
3. Renaming or dropping a column still referenced by application code.
4. A data backfill running inside the same transaction as the schema change.
5. Missing a corresponding rollback / down path.

Report findings as:
- SEVERITY: blocker | warning | nit
- LOCATION: file:line
- PROBLEM / FIX
End with VERDICT: SAFE TO MERGE or NEEDS CHANGES.

Invoke it from the parent with a natural request — “review the migration on this branch” — and because the description names db/migrate/, the parent routes there on its own.

Worked Example: A Security Gate #

---
name: security-gate
description: Threat-models diffs that touch authentication, authorization, secrets, or user input. Use proactively before merging any auth or payments change.
tools: Read, Grep, Glob
model: opus
---

You are a security reviewer with a threat-modeling mindset. Assume the
input is hostile. You report only — you never modify code.

For the diff, check:
- Authn/authz: can this path be reached without the expected check?
- Injection: is user input concatenated into SQL, shell, or HTML?
- Secrets: any key, token, or password added to code or logs?
- IDOR: are object references scoped to the authenticated user?

For each finding give an EXPLOIT SKETCH (how an attacker triggers it),
then the FIX. Default to flagging when uncertain — false positives are
cheap, a missed auth hole is not.

Note the opus model and the “default to flagging when uncertain” instruction — for a security gate you tune toward paranoia.

Testing and Iterating on Agents #

Don’t ship an agent you haven’t tried to fool. Spin up a git worktree or a throwaway branch with a planted problem — a migration missing CONCURRENTLY, an endpoint missing an ownership check — and invoke the agent.

You’re testing two independent things:

Did it get triggered by a natural request? If not, fix the description.
Did it catch the planted bug? If not, fix the system prompt’s checklist.

These fail for different reasons, so iterate on them separately. A common surprise: the agent works perfectly when you name it explicitly but never fires on its own — that’s always a description problem, never a body problem.

Common Authoring Mistakes #

Vague description. The agent does great work nobody ever triggers. Add concrete file paths and the word “proactively.”
No tool allowlist. Your reviewer edits the code it was supposed to review. Declare Read, Grep, Glob.
Prose output, no contract. You get three paragraphs of opinion instead of a triaged list. Specify the exact report format.
One mega-agent. A single “do-everything” agent is just the parent with extra steps. Split by concern — that’s the specialist delegation pattern working for you.
Set-and-forget. Agents are code. An unmaintained checklist rots as your stack changes. Review them quarterly.

The Principle #

A custom agent is executable institutional knowledge. The review standard that used to live in a wiki page nobody opened, or in the head of the one senior engineer who always caught the bug — you encode it once, version-control it, and every teammate plus every CI run gets the identical, tireless reviewer. The agent doesn’t get rushed before a deadline and skip steps 3, 5, and 7. That consistency, not raw intelligence, is the win.

Setting Up Production-Ready Claude Code #

To run custom-agent pipelines at scale you want stable infrastructure:

A reliable host for long-running and CI sessions. Custom agents shine in CI, where they gate every PR. You need a box that won’t drop the job. HTStack — Hong Kong VPS with low-latency access from mainland China and stable BGP routing. It’s the same IDC that hosts dibi8.com, so we run our own agent pipelines on it. Value tier runs $5-12/month.
Cloud headroom for parallel gates. When an orchestrator fans out to migration-reviewer + security-gate + perf-checker at once, you want spare CPU. DigitalOcean — $200 free credit for 60 days across 14+ regions, great for hosting CI runners next to your app.
A skills bundle. The steepest part of the curve is writing agent definitions that don’t fall over. We packaged five battle-tested skills as a $19 bundle on Gumroad — see the floating CTA in the corner — including the orchestrator prompts and three more ready-to-ship agent definitions.

Claude Code Subagent Patterns — the five workflows; this guide deep-dives pattern 5.
Superpowers framework — a curated skill/agent library to learn from.
MCP Servers 2026 Rankings — extend agent capability beyond the bundled tools.
AI Coding Agent Landscape — where custom agents sit in the wider ecosystem.

Verdict #

Custom agents turn your team’s best practices from documentation nobody reads into checks that run on every change. The recipe: a sharp description so it triggers, a least-privilege tool allowlist so it stays in its lane, and a system prompt with an explicit checklist and output contract so it produces a report you can act on.

Start with one — the migration reviewer above is the highest-leverage first agent for most teams. Plant a bug, confirm it catches it, then commit the file. From that moment, every teammate has a reviewer that never gets tired and never skips a step.

Claude Code Custom Agent Authoring

Introduction #

Anatomy of a Custom Agent #

The Frontmatter Fields #

`name` (required) #

`description` (required — and the one people underweight) #

`tools` (optional — but declare it anyway) #

`model` (optional) #

Writing the System Prompt #

Tool Allowlists: Least Privilege for Agents #

Worked Example: A Migration Reviewer #

Worked Example: A Security Gate #

Testing and Iterating on Agents #

Common Authoring Mistakes #

The Principle #

Setting Up Production-Ready Claude Code #

Verdict #

References & Sources #

💬 Discussion

Introduction #

Anatomy of a Custom Agent #

The Frontmatter Fields #

name (required) #

description (required — and the one people underweight) #

tools (optional — but declare it anyway) #

model (optional) #

Writing the System Prompt #

Tool Allowlists: Least Privilege for Agents #

Worked Example: A Migration Reviewer #

Worked Example: A Security Gate #

Testing and Iterating on Agents #

Common Authoring Mistakes #

The Principle #

Setting Up Production-Ready Claude Code #

Related Reading #

Verdict #

References & Sources #

🔗 Related Resources

💬 Discussion

`name` (required) #

`description` (required — and the one people underweight) #

`tools` (optional — but declare it anyway) #

`model` (optional) #