When starting work on complex tasks, Claude Code MUST automatically:
- Initialize the swarm using MCP tools
- Spawn concurrent agents using Claude Code's Task tool
- Coordinate via hooks and memory
When user says "spawn swarm" or requests complex work, Claude Code MUST in ONE message:
- Call MCP tools to initialize coordination
- IMMEDIATELY call Task tool to spawn REAL working agents
- Both MCP and Task calls must be in the SAME response
MCP alone does NOT execute work - Task tool agents do the actual work!
The routing system has 3 tiers for optimal cost/performance:
| Tier | Handler | Latency | Cost | Use Cases |
|---|---|---|---|---|
| 1 | Agent Booster (WASM) | <1ms | $0 | Simple transforms (var→const, add types, etc.) - Skip LLM entirely |
| 2 | Haiku | ~500ms | $0.0002 | Simple tasks, low complexity (<30%) |
| 3 | Sonnet/Opus | 2-5s | $0.003-0.015 | Complex reasoning, architecture, security (>30%) |
When you see these recommendations:
-
[AGENT_BOOSTER_AVAILABLE]→ The task can be handled by Agent Booster (352x faster, $0)- Use the Edit tool directly for simple code transforms
- Intent types:
var-to-const,add-types,add-error-handling,async-await,add-logging,remove-console
-
[TASK_MODEL_RECOMMENDATION] Use model="X"→ Use that model in Task tool:
// If hook recommends: [TASK_MODEL_RECOMMENDATION] Use model="opus"
Task({
prompt: "...",
subagent_type: "coder",
model: "opus" // ← USE THE RECOMMENDED MODEL
})Model Selection Logic:
| Complexity | Model | Use For |
|---|---|---|
| Agent Booster intent detected | Skip LLM | var→const, add-types, remove-console (352x faster) |
| High (architecture, system design, security) | opus | Complex reasoning, multi-step planning |
| Medium (features, refactoring, debugging) | sonnet | Balanced capability and speed |
| Low (formatting, simple fixes, docs) | haiku | Fast, cost-effective tasks |
CRITICAL: Always check for [AGENT_BOOSTER_AVAILABLE] or [TASK_MODEL_RECOMMENDATION] before spawning agents.
To prevent goal drift, context drift, and agent desynchronization, ALWAYS use this configuration for coding swarms:
mcp__ruv-swarm__swarm_init({
topology: "hierarchical", // Single coordinator enforces alignment
maxAgents: 8, // Smaller team = less drift surface
strategy: "specialized" // Clear roles reduce ambiguity
})Why This Prevents Drift:
| Choice | Anti-Drift Benefit |
|---|---|
| hierarchical | Coordinator validates each output against goal, catches divergence early |
| maxAgents: 6-8 | Fewer agents = less coordination overhead, easier alignment |
| specialized | Clear boundaries - each agent knows exactly what to do, no overlap |
Consensus for Hive-Mind: Use raft (leader maintains authoritative state)
Additional Anti-Drift Measures:
- Frequent checkpoints via
post-taskhooks - Shared memory namespace for all agents
- Short task cycles with verification gates
When the user requests a complex task (multi-file changes, feature implementation, refactoring), immediately execute this pattern in a SINGLE message:
// STEP 1: Initialize swarm coordination via MCP (in parallel with agent spawning)
// USE ANTI-DRIFT CONFIG: hierarchical + specialized + small team
mcp__ruv-swarm__swarm_init({
topology: "hierarchical",
maxAgents: 8,
strategy: "specialized"
})
// STEP 2: Spawn agents concurrently using Claude Code's Task tool
// ALL Task calls MUST be in the SAME message for parallel execution
Task("Coordinator", "You are the swarm coordinator. Initialize session, coordinate other agents via memory. Run: npx claude-flow@v3alpha hooks session-start", "hierarchical-coordinator")
Task("Researcher", "Analyze requirements and existing code patterns. Store findings in memory via hooks.", "researcher")
Task("Architect", "Design implementation approach based on research. Document decisions in memory.", "system-architect")
Task("Coder", "Implement the solution following architect's design. Coordinate via hooks.", "coder")
Task("Tester", "Write tests for the implementation. Report coverage via hooks.", "tester")
Task("Reviewer", "Review code quality and security. Document findings.", "reviewer")
// STEP 3: Batch all todos
TodoWrite({ todos: [
{content: "Initialize swarm coordination", status: "in_progress", activeForm: "Initializing swarm"},
{content: "Research and analyze requirements", status: "in_progress", activeForm: "Researching requirements"},
{content: "Design architecture", status: "pending", activeForm: "Designing architecture"},
{content: "Implement solution", status: "pending", activeForm: "Implementing solution"},
{content: "Write tests", status: "pending", activeForm: "Writing tests"},
{content: "Review and finalize", status: "pending", activeForm: "Reviewing code"}
]})
// STEP 4: Store swarm state in memory
mcp__claude-flow__memory_usage({
action: "store",
namespace: "swarm",
key: "current-session",
value: JSON.stringify({task: "[user's task]", agents: 6, startedAt: new Date().toISOString()})
})| Code | Task | Agents |
|---|---|---|
| 1 | Bug Fix | coordinator, researcher, coder, tester |
| 3 | Feature | coordinator, architect, coder, tester, reviewer |
| 5 | Refactor | coordinator, architect, coder, reviewer |
| 7 | Performance | coordinator, perf-engineer, coder |
| 9 | Security | coordinator, security-architect, auditor |
| 11 | Memory | coordinator, memory-specialist, perf-engineer |
| 13 | Docs | researcher, api-docs |
Codes 1-11: hierarchical/specialized (anti-drift). Code 13: mesh/balanced
AUTO-INVOKE SWARM when task involves:
- Multiple files (3+)
- New feature implementation
- Refactoring across modules
- API changes with tests
- Security-related changes
- Performance optimization
- Database schema changes
SKIP SWARM for:
- Single file edits
- Simple bug fixes (1-2 lines)
- Documentation updates
- Configuration changes
- Quick questions/exploration
ABSOLUTE RULES:
- ALL operations MUST be concurrent/parallel in a single message
- NEVER save working files, text/mds and tests to the root folder
- ALWAYS organize files in appropriate subdirectories
- USE CLAUDE CODE'S TASK TOOL for spawning agents concurrently, not just MCP
MANDATORY PATTERNS:
- TodoWrite: ALWAYS batch ALL todos in ONE call (5-10+ todos minimum)
- Task tool (Claude Code): ALWAYS spawn ALL agents in ONE message with full instructions
- File operations: ALWAYS batch ALL reads/writes/edits in ONE message
- Bash commands: ALWAYS batch ALL terminal operations in ONE message
- Memory operations: ALWAYS batch ALL memory store/retrieve in ONE message
NEVER save to root folder. Use these directories:
/src- Source code files/tests- Test files/docs- Documentation and markdown files/config- Configuration files/scripts- Utility scripts/examples- Example code
This project is configured with Claude Flow V3 (Anti-Drift Defaults):
- Topology: hierarchical (prevents drift via central coordination)
- Max Agents: 8 (smaller team = less drift)
- Strategy: specialized (clear roles, no overlap)
- Consensus: raft (leader maintains authoritative state)
- Memory Backend: hybrid (SQLite + AgentDB)
- HNSW Indexing: Enabled (150x-12,500x faster)
- Neural Learning: Enabled (SONA)
| Command | Subcommands | Description |
|---|---|---|
init |
4 | Project initialization with wizard, presets, skills, hooks |
agent |
8 | Agent lifecycle (spawn, list, status, stop, metrics, pool, health, logs) |
swarm |
6 | Multi-agent swarm coordination and orchestration |
memory |
11 | AgentDB memory with vector search (150x-12,500x faster) |
mcp |
9 | MCP server management and tool execution |
task |
6 | Task creation, assignment, and lifecycle |
session |
7 | Session state management and persistence |
config |
7 | Configuration management and provider setup |
status |
3 | System status monitoring with watch mode |
start |
3 | Service startup and quick launch |
workflow |
6 | Workflow execution and template management |
hooks |
17 | Self-learning hooks + 12 background workers |
hive-mind |
6 | Queen-led Byzantine fault-tolerant consensus |
| Command | Subcommands | Description |
|---|---|---|
daemon |
5 | Background worker daemon (start, stop, status, trigger, enable) |
neural |
5 | Neural pattern training (train, status, patterns, predict, optimize) |
security |
6 | Security scanning (scan, audit, cve, threats, validate, report) |
performance |
5 | Performance profiling (benchmark, profile, metrics, optimize, report) |
providers |
5 | AI providers (list, add, remove, test, configure) |
plugins |
5 | Plugin management (list, install, uninstall, enable, disable) |
deployment |
5 | Deployment management (deploy, rollback, status, environments, release) |
embeddings |
4 | Vector embeddings (embed, batch, search, init) - 75x faster with agentic-flow |
claims |
4 | Claims-based authorization (check, grant, revoke, list) |
migrate |
5 | V2 to V3 migration with rollback support |
process |
4 | Background process management |
doctor |
1 | System diagnostics with health checks |
completions |
4 | Shell completions (bash, zsh, fish, powershell) |
# Initialize project
npx claude-flow@v3alpha init --wizard
# Start daemon with background workers
npx claude-flow@v3alpha daemon start
# Spawn an agent
npx claude-flow@v3alpha agent spawn -t coder --name my-coder
# Initialize swarm
npx claude-flow@v3alpha swarm init --v3-mode
# Search memory (HNSW-indexed)
npx claude-flow@v3alpha memory search -q "authentication patterns"
# System diagnostics
npx claude-flow@v3alpha doctor --fix
# Security scan
npx claude-flow@v3alpha security scan --depth full
# Performance benchmark
npx claude-flow@v3alpha performance benchmark --suite allcoder, reviewer, tester, planner, researcher
security-architect, security-auditor, memory-specialist, performance-engineer
CVE remediation, input validation, path security:
InputValidator- Zod-based validation at boundariesPathValidator- Path traversal preventionSafeExecutor- Command injection protectionPasswordHasher- bcrypt hashingTokenGenerator- Secure token generation
Integrates agentic-flow optimizations for 30-50% token reduction:
import { getTokenOptimizer } from '@claude-flow/integration';
const optimizer = await getTokenOptimizer();
// Compact context (32% fewer tokens)
const ctx = await optimizer.getCompactContext("auth patterns");
// 352x faster edits = fewer retries
await optimizer.optimizedEdit(file, old, new, "typescript");
// Optimal config (100% success rate)
const config = optimizer.getOptimalConfig(agentCount);| Feature | Token Savings |
|---|---|
| ReasoningBank retrieval | -32% |
| Agent Booster edits | -15% |
| Cache (95% hit rate) | -10% |
| Optimal batch size | -20% |
hierarchical-coordinator, mesh-coordinator, adaptive-coordinator, collective-intelligence-coordinator, swarm-memory-manager
byzantine-coordinator, raft-manager, gossip-coordinator, consensus-builder, crdt-synchronizer, quorum-manager, security-manager
perf-analyzer, performance-benchmarker, task-orchestrator, memory-coordinator, smart-agent
github-modes, pr-manager, code-review-swarm, issue-tracker, release-manager, workflow-automation, project-board-sync, repo-architect, multi-repo-swarm
sparc-coord, sparc-coder, specification, pseudocode, architecture, refinement
backend-dev, mobile-dev, ml-developer, cicd-engineer, api-docs, system-architect, code-analyzer, base-template-generator
tdd-london-swarm, production-validator
| Category | Hooks | Purpose |
|---|---|---|
| Core | pre-edit, post-edit, pre-command, post-command, pre-task, post-task |
Tool lifecycle |
| Session | session-start, session-end, session-restore, notify |
Context management |
| Intelligence | route, explain, pretrain, build-agents, transfer |
Neural learning |
| Learning | intelligence (trajectory-start/step/end, pattern-store/search, stats, attention) |
Reinforcement |
| Worker | Priority | Description |
|---|---|---|
ultralearn |
normal | Deep knowledge acquisition |
optimize |
high | Performance optimization |
consolidate |
low | Memory consolidation |
predict |
normal | Predictive preloading |
audit |
critical | Security analysis |
map |
normal | Codebase mapping |
preload |
low | Resource preloading |
deepdive |
normal | Deep code analysis |
document |
normal | Auto-documentation |
refactor |
normal | Refactoring suggestions |
benchmark |
normal | Performance benchmarking |
testgaps |
normal | Test coverage analysis |
# Core hooks
npx claude-flow@v3alpha hooks pre-task --description "[task]"
npx claude-flow@v3alpha hooks post-task --task-id "[id]" --success true
npx claude-flow@v3alpha hooks post-edit --file "[file]" --train-patterns
# Session management
npx claude-flow@v3alpha hooks session-start --session-id "[id]"
npx claude-flow@v3alpha hooks session-end --export-metrics true
npx claude-flow@v3alpha hooks session-restore --session-id "[id]"
# Intelligence routing
npx claude-flow@v3alpha hooks route --task "[task]"
npx claude-flow@v3alpha hooks explain --topic "[topic]"
# Neural learning
npx claude-flow@v3alpha hooks pretrain --model-type moe --epochs 10
npx claude-flow@v3alpha hooks build-agents --agent-types coder,tester
# Background workers
npx claude-flow@v3alpha hooks worker list
npx claude-flow@v3alpha hooks worker dispatch --trigger audit
npx claude-flow@v3alpha hooks worker statusV3 includes the RuVector Intelligence System:
- SONA: Self-Optimizing Neural Architecture (<0.05ms adaptation)
- MoE: Mixture of Experts for specialized routing
- HNSW: 150x-12,500x faster pattern search
- EWC++: Elastic Weight Consolidation (prevents forgetting)
- Flash Attention: 2.49x-7.47x speedup
The 4-step intelligence pipeline:
- RETRIEVE - Fetch relevant patterns via HNSW
- JUDGE - Evaluate with verdicts (success/failure)
- DISTILL - Extract key learnings via LoRA
- CONSOLIDATE - Prevent catastrophic forgetting via EWC++
Features:
- sql.js: Cross-platform SQLite persistent cache (WASM, no native compilation)
- Document chunking: Configurable overlap and size
- Normalization: L2, L1, min-max, z-score
- Hyperbolic embeddings: Poincaré ball model for hierarchical data
- 75x faster: With agentic-flow ONNX integration
- Neural substrate: Integration with RuVector
hierarchical- Queen controls workers directlymesh- Fully connected peer networkhierarchical-mesh- Hybrid (recommended)adaptive- Dynamic based on load
byzantine- BFT (tolerates f < n/3 faulty)raft- Leader-based (tolerates f < n/2)gossip- Epidemic for eventual consistencycrdt- Conflict-free replicated data typesquorum- Configurable quorum-based
| Metric | Target | Status |
|---|---|---|
| HNSW Search | 150x-12,500x faster | Implemented (persistent) |
| Memory Reduction | 50-75% with quantization | Implemented (3.92x Int8) |
| SONA Integration | Pattern learning | Implemented (ReasoningBank) |
| Flash Attention | 2.49x-7.47x speedup | In progress |
| MCP Response | <100ms | Achieved |
| CLI Startup | <500ms | Achieved |
| SONA Adaptation | <0.05ms | In progress |
# Configuration
CLAUDE_FLOW_CONFIG=./claude-flow.config.json
CLAUDE_FLOW_LOG_LEVEL=info
# Provider API Keys
ANTHROPIC_API_KEY=sk-ant-...
OPENAI_API_KEY=sk-...
GOOGLE_API_KEY=...
# MCP Server
CLAUDE_FLOW_MCP_PORT=3000
CLAUDE_FLOW_MCP_HOST=localhost
CLAUDE_FLOW_MCP_TRANSPORT=stdio
# Memory
CLAUDE_FLOW_MEMORY_BACKEND=hybrid
CLAUDE_FLOW_MEMORY_PATH=./data/memoryRun npx claude-flow@v3alpha doctor to check:
- Node.js version (20+)
- npm version (9+)
- Git installation
- Config file validity
- Daemon status
- Memory database
- API keys
- MCP servers
- Disk space
- TypeScript installation
# Add MCP servers
claude mcp add claude-flow npx claude-flow@v3alpha mcp start
claude mcp add ruv-swarm npx ruv-swarm mcp start # Optional
claude mcp add flow-nexus npx flow-nexus@latest mcp start # Optional
# Start daemon
npx claude-flow@v3alpha daemon start
# Run doctor
npx claude-flow@v3alpha doctor --fix- Task tool: Spawn and run agents concurrently
- File operations (Read, Write, Edit, MultiEdit, Glob, Grep)
- Code generation and programming
- Bash commands and system operations
- TodoWrite and task management
- Git operations
- Swarm initialization (topology setup)
- Agent type definitions
- Task orchestration
- Memory management
- Neural features
- Performance tracking
KEY: MCP coordinates the strategy, Claude Code's Task tool executes with real agents.
When publishing CLI changes, you MUST:
- Publish
@claude-flow/cli - Publish
claude-flow(umbrella) - Update ALL dist-tags for BOTH packages
# STEP 1: Build and publish CLI
cd v3/@claude-flow/cli
npm version 3.0.0-alpha.XXX --no-git-tag-version
npm run build
npm publish --tag alpha
npm dist-tag add @claude-flow/cli@3.0.0-alpha.XXX latest
# STEP 2: Publish umbrella
cd /workspaces/claude-flow
npm version 3.0.0-alpha.YYY --no-git-tag-version
npm publish --tag v3alpha
# STEP 3: Update ALL umbrella tags (CRITICAL - DON'T SKIP!)
npm dist-tag add claude-flow@3.0.0-alpha.YYY latest
npm dist-tag add claude-flow@3.0.0-alpha.YYY alphaVerification (MUST DO before telling user):
npm view @claude-flow/cli dist-tags --json
npm view claude-flow dist-tags --json
# BOTH packages need: alpha AND latest pointing to newest version| Package | Tag | Command Users Run |
|---|---|---|
@claude-flow/cli |
alpha |
npx @claude-flow/cli@alpha |
@claude-flow/cli |
latest |
npx @claude-flow/cli@latest |
claude-flow |
alpha |
npx claude-flow@alpha |
claude-flow |
latest |
npx claude-flow@latest |
claude-flow |
v3alpha |
npx claude-flow@v3alpha |
The umbrella alpha tag is MOST commonly forgotten - users run npx claude-flow@alpha!
- Documentation: https://github.com/ruvnet/claude-flow
- Issues: https://github.com/ruvnet/claude-flow/issues
Remember: Claude Flow coordinates, Claude Code creates!
Do what has been asked; nothing more, nothing less. NEVER create files unless they're absolutely necessary for achieving your goal. ALWAYS prefer editing an existing file to creating a new one. NEVER proactively create documentation files (*.md) or README files. Only create documentation files if explicitly requested by the User. Never save working files, text/mds and tests to the root folder. After spawning a swarm, wait, don't continuously check status.