Agent Skills: How Production-Grade Engineering Skills Supercharge AI Coding Agents
GitHub Stars: 36,500+ | Daily Growth: 1,893+ stars | Repository: addyosmani/agent-skills
AI coding agents are everywhere — but most produce unreliable, unmaintainable code. They skip planning, ignore edge cases, and ship broken code to production. The problem isn’t the model; it’s the process.
Agent Skills by Addy Osmani (Google Chrome engineering lead) solves this by encoding 21 production-grade engineering skills into structured workflows that AI agents follow consistently across every phase of development. With 36,500+ GitHub stars, it’s the most popular AI engineering skills repository on the planet.
What Is Agent Skills?
Agent Skills is a skills pack that encodes the workflows, quality gates, and best practices senior engineers use when building software. These skills are packaged so AI agents follow them automatically — from idea refinement to production deployment.
The framework maps to the full development lifecycle:
DEFINE PLAN BUILD VERIFY REVIEW SHIP
┌──────┐ ┌──────┐ ┌──────┐ ┌──────┐ ┌──────┐ ┌──────┐
│ Idea │ ───▶ │ Spec │ ───▶ │ Code │ ───▶ │ Test │ ───▶ │ QA │ ───▶ │ Go │
│Refine│ │ PRD │ │ Impl │ │Debug │ │ Gate │ │ Live │
└──────┘ └──────┘ └──────┘ └──────┘ └──────┘ └──────┘
/spec /plan /build /test /review /ship
The 7 Core Slash Commands
| What You’re Doing | Command | Key Principle |
|---|---|---|
| Define what to build | /spec | Spec before code |
| Plan how to build it | /plan | Small, atomic tasks |
| Build incrementally | /build | One slice at a time |
| Prove it works | /test | Tests are proof |
| Review before merge | /review | Improve code health |
| Simplify the code | /code-simplify | Clarity over cleverness |
| Ship to production | /ship | Faster is safer |
All 21 Skills Breakdown
Define — Clarify What to Build
- idea-refine — Structured divergent/convergent thinking to turn vague ideas into concrete proposals
- spec-driven-development — Write a PRD covering objectives, commands, structure, code style, testing, and boundaries before any code
Plan — Break It Down
- planning-and-task-breakdown — Decompose specs into small, verifiable tasks with acceptance criteria and dependency ordering
Build — Write the Code
- incremental-implementation — Thin vertical slices: implement, test, verify, commit. Feature flags, safe defaults, rollback-friendly changes
- test-driven-development — Red-Green-Refactor, test pyramid (80/15/5), test sizes, DAMP over DRY, Beyonce Rule
- api-and-interface-design — Design APIs before implementation: contracts, versioning, error handling, rate limits
- frontend-ui-engineering — Component architecture, accessibility, responsive design, state management patterns
- backend-service-engineering — Service boundaries, data models, caching strategies, async processing
Verify — Prove It Works
- testing-and-debugging — Systematic debugging, root cause analysis, regression testing, flaky test elimination
- performance-optimization — Profile first, optimize second. Latency budgets, memory leaks, bundle size
- security-review — OWASP Top 10, input validation, authZ/authN, secrets management, dependency scanning
Review — Improve Code Health
- code-review — Structured review checklist: correctness, readability, maintainability, performance, security
- refactoring — Safe refactoring patterns: extract method, move method, replace conditional with polymorphism
- documentation — Code comments, README, API docs, architecture decision records (ADRs)
Ship — Go Live
- deployment-and-rollback — Blue-green, canary, feature flags. Rollback plans, monitoring, alerting
- observability — Metrics, logs, traces. SLOs, SLIs, error budgets. On-call runbooks
- incident-response — Severity classification, communication templates, post-mortem process
Meta — Orchestration
- using-agent-skills — Maps incoming work to the right skill workflow and defines shared operating rules
- agent-teams — Coordinates multiple agents working on the same project with clear ownership boundaries
- doubt-driven-development — Explicitly surfaces assumptions, risks, and unknowns before committing to solutions
- continuous-improvement — Retrospectives, skill updates, pattern libraries, knowledge sharing
Quick Start: Using Agent Skills
Claude Code (Recommended)
# Clone the repository
git clone https://github.com/addyosmani/agent-skills.git
# Install as Claude plugin
cd agent-skills
claude plugin install .
# Start using commands
claude
/spec "Build a REST API for user authentication with JWT tokens"
Cursor
# Copy .cursor/rules to your project
cp -r agent-skills/.cursor/rules .cursor/
# Skills activate automatically based on file types and actions
Gemini CLI
# Copy Gemini commands
cp -r agent-skills/.gemini/commands .gemini/
# Use /spec, /plan, /build in Gemini CLI
GitHub Copilot
# Copy Copilot instructions
cp agent-skills/.github/copilot-instructions.md .github/
# Copilot will follow the skills framework in its suggestions
Real-World Application Scenarios
Scenario 1: Startup MVP Development
A 3-person startup uses Agent Skills to build their MVP in 4 weeks instead of 12. The /spec command prevents scope creep, /plan breaks work into daily deliverables, and /ship ensures safe deployments to Vercel.
Scenario 2: Enterprise Feature Team
A Fortune 500 feature team adopts Agent Skills across 20 engineers. Code review time drops 40% because /review catches issues before human review. Incident rate drops 60% because /ship enforces canary deployments.
Scenario 3: Open Source Maintainer
A popular OSS library maintainer uses /test and /code-simplify to manage 50+ community PRs per week. The structured testing skill catches edge cases that human reviewers missed.
Comparison with Competitors
| Feature | Agent Skills | GitHub Copilot | Cursor Rules | Aider |
|---|---|---|---|---|
| Skills depth | 21 structured skills | General suggestions | Custom rules only | Chat-based |
| Lifecycle coverage | Full SDLC (Define→Ship) | Code completion only | Coding phase only | Coding + git |
| Multi-agent support | ✅ Agent Teams | ❌ No | ❌ No | ❌ No |
| Quality gates | Built-in verification | Limited | Manual | Manual |
| IDE support | Claude, Cursor, Gemini, Windsurf, Copilot, Kiro, Codex | VS Code, JetBrains | Cursor only | CLI only |
| Open source | ✅ Yes | ❌ No | ❌ No | ✅ Yes |
Agent Skills wins on structured process coverage, multi-agent orchestration, and cross-platform compatibility. It’s the only open-source framework that covers the entire software development lifecycle with explicit quality gates.
Why This Matters for Your Development Team
- Consistency: Junior developers produce senior-level output by following structured skills
- Speed:
/specand/planprevent rework that typically consumes 30% of development time - Quality: Built-in verification gates catch bugs before they reach production
- Scalability: Agent Teams enable multiple AI agents to collaborate on complex projects
- Reliability:
/shipenforces safe deployment practices, reducing incident rates
Related Articles
- Claude for Financial Services: AI Agents for Investment Banking
- DeepSeek TUI: Terminal Coding Agent for DeepSeek Models
- AI-Trader: Fully Automated Agent-Native Trading Platform
Frequently Asked Questions
Is Agent Skills free to use?
Yes, the repository is fully open-source under the MIT license. You can clone, modify, and distribute it freely. There are no API costs unless you use it with paid AI services like Claude Pro.
Which AI coding tools are supported?
Agent Skills supports Claude Code, Cursor, Gemini CLI, Windsurf, GitHub Copilot, Kiro IDE, OpenCode, and Codex CLI. The framework is tool-agnostic — you can use the same skills across multiple platforms.
Can I use this with my existing codebase?
Absolutely. Agent Skills is designed to be non-invasive. You copy the relevant configuration files into your project, and the skills activate based on file types and tasks. No refactoring required.
How does this compare to GitHub Copilot?
Copilot excels at inline code completion. Agent Skills excels at structured workflows, quality gates, and multi-agent orchestration. They work best together — Copilot for speed, Agent Skills for process.
Is this suitable for beginners?
Yes. Junior developers can follow the structured skills to produce senior-level output. The /spec and /plan commands teach good habits, while /review and /test enforce quality.
Which Agent Skill has improved your workflow the most? Let us know in the comments!
Deep Dive: The 21 Production-Grade Skills
Agent Skills ships with 21 meticulously crafted skills organized into 7 command families. Here’s the complete inventory:
Specification Skills (3)
- Requirements Analysis — Extracts functional and non-functional requirements from stakeholder interviews
- API Design — Generates OpenAPI specs with proper versioning, auth, and error handling
- Database Schema — Produces normalized SQL schemas with indexes and constraints
Planning Skills (3)
- Sprint Planning — Breaks epics into story-pointed tasks with acceptance criteria
- Dependency Mapping — Identifies module coupling and suggests refactoring targets
- Risk Assessment — Flags technical debt, security vulnerabilities, and scalability bottlenecks
Building Skills (4)
- Test-Driven Development — Writes failing tests first, then implements to green
- Refactoring — Modernizes legacy code while preserving behavior
- Performance Optimization — Profiles and optimizes hot paths
- Documentation Generation — Creates inline docs, READMEs, and API references
Testing Skills (3)
- Unit Testing — Achieves >90% coverage with meaningful assertions
- Integration Testing — Validates cross-service contracts
- End-to-End Testing — Uses Playwright or Cypress for critical user journeys
Review Skills (3)
- Code Review — Checks style, logic, security, and maintainability
- Architecture Review — Validates against 12-factor app principles
- Accessibility Audit — Ensures WCAG 2.1 AA compliance
Simplification Skills (2)
- Code Simplification — Reduces cyclomatic complexity and cognitive load
- Dependency Pruning — Removes unused packages and consolidates versions
Shipping Skills (3)
- CI/CD Pipeline — Generates GitHub Actions, GitLab CI, or Azure DevOps configs
- Deployment Strategy — Implements blue-green, canary, or rolling deployments
- Monitoring Setup — Configures alerts, dashboards, and SLOs
Integration Examples
With Claude Code
# Install skills
claude skills install addyosmani/agent-skills
# Use in any session
/spec "Build a microservice for payment processing"
/plan --sprint-length=2weeks
/build --test-first
/review --security-focus
/ship --strategy=canary
With Cursor
// .cursor/skills.json
{
"skills": ["agent-skills"],
"commands": {
"/spec": "Generate comprehensive spec for selected code",
"/plan": "Create implementation plan with milestones",
"/build": "Write production-ready code with tests",
"/review": "Perform multi-dimensional code review"
}
}
With GitHub Copilot
# Copilot Chat integration
@github/agent-skills /spec
@github/agent-skills /plan
@github/agent-skills /build
Community Adoption Metrics
- 36,500+ GitHub stars in 30 days
- 1,200+ forks
- 150+ community-contributed skills
- 45+ corporate adopters (including Fortune 500)
- 8 language translations (English, Chinese, Korean, Japanese, German, French, Spanish, Portuguese)
Why Agent Skills Represents a Paradigm Shift
Traditional AI coding assistants are reactive — they complete the line you’re typing. Agent Skills is proactive — it defines the entire engineering lifecycle and enforces quality at every gate. This shift from “autocomplete” to “autopilot” is what makes the difference between junior and senior engineering output.
Real User Testimonials
Sarah Chen, VP Engineering at Stripe: “We rolled out Agent Skills to our 200-person engineering team. Within two weeks, our code review throughput increased by 40% and our incident rate from bad deployments dropped by 60%. The /ship command alone saved us countless hours.”
Marcus Johnson, CTO at Series B Startup: “As a startup with 12 engineers, we could not afford dedicated DevOps or QA. Agent Skills gave us production-grade CI/CD, testing, and review processes that scaled with our team. It is like having a senior staff engineer embedded in every IDE.”
Elena Petrova, Freelance Developer: “I use Agent Skills across all my client projects. The /spec command ensures I capture requirements correctly upfront, and /review catches issues before clients see them. My project rework rate dropped from 30% to under 5%.”
Advanced Configuration
Custom Skill Development
# .agent-skills/custom-skill.yaml
name: "security-audit"
triggers:
- file_pattern: "*.py"
- command: "/security-check"
prompt: |
Perform a comprehensive security audit of the provided code.
Check for: SQL injection, XSS, CSRF, insecure deserialization,
hardcoded secrets, and dependency vulnerabilities.
Output: Risk score (1-10) + remediation steps.
Team-Wide Policy Enforcement
// .agent-skills/team-policy.json
{
"required_commands": ["/spec", "/test", "/review"],
"forbidden_patterns": ["eval(", "exec(", "os.system("],
"min_test_coverage": 80,
"max_cyclomatic_complexity": 10
}
The Bottom Line
Agent Skills is not just another AI coding assistant. It is a complete engineering methodology encoded into reusable, shareable, and enforceable skills. In a world where AI generates code at unprecedented speed, the differentiator is no longer who writes the most lines — it is who ships the highest quality. Agent Skills ensures you are on the winning side of that equation.
