Forge CLI — Software Craftsmanship, Automated

Capabilities

The full toolkit for autonomous development

Every practice that makes software reliable — automated, enforced, and built into the loop.

🧪

TDD Enforcement

Red-Green-Refactor cycle tracked and enforced. Tests are written before code, with regression detection on refactor.

🤖

Multi-Agent Teams

6 specialized agents — Architect, Implementer, Tester, Reviewer, Security, Documenter — automatically matched to tasks.

🛡

Security Scanning

Secret detection, SAST vulnerability scanning, and dependency audits run on every iteration. Blocks on critical findings.

✅

Quality Gates + Auto-Fix

5-gate pipeline: tests, coverage, security, lint, and commit validation. When gates fail, Claude automatically fixes the issues and re-runs.

📝

Conventional Commits

Every TDD phase produces a commit: test: for red, feat: for green, refactor: for cleanup. Clean git history.

⚡

Circuit Breaker

Nygard pattern detects stagnation — repeated failures, no progress, test regressions — and stops the loop before wasting tokens.

📊

Live Dashboard

Real-time Ink TUI with cost tracking, TDD phase pipeline, progress bar, Claude stream output, and toggleable detail overlay. Press d for deep metrics, q to quit safely.

💰

Cost Tracking

Real-time API cost monitoring — total spend, per-task cost, per-phase breakdown, and average cost per call. Stay in control of your token budget.

🧠

Smart Task Ordering

Tasks are sorted by priority and dependency depth. Foundational tasks with the most dependents are built first — not random order.

⏳

Rate Limit Resilience

Hits a rate limit? Forge shows a countdown modal, waits for the exact reset time, and resumes automatically. No task failures, no wasted retries.

🔄

Session Continuity

Resume interrupted runs. Task completion persists to disk. Context-exhausted sessions rotate automatically. Pick up where you left off.

🧑‍💻

Human-in-the-Loop

Task stuck? Forge pauses and asks you: retry with guidance, defer to later, skip, or abort. Your hint is injected into the next attempt.

⏭

Deferred Tasks

Skip a task for now, work on others, come back later. Deferred tasks deprioritize behind pending work and retry with a fresh count.

📐

Spec-Kit Integration

Use GitHub's spec-kit for planning (specify, plan, tasks), Forge for execution. Best of both worlds.

Documentation

Everything you need to get started

From first install to full autonomous loops — step by step.

Quick Start

Get from zero to running in under a minute. You need Node.js ≥ 20 and Claude Code CLI installed.

$ npm install -g @redgreen-labs/forge-cli

# Run inside your existing project directory
$ cd my-project
$ forge init

# Import your requirements document
$ forge import requirements.md

# Start the autonomous development loop
$ forge run --iterations 20

# Check progress anytime
$ forge status

That's it. Forge reads your requirements, builds a task dependency graph, and starts executing — test-first, with security scanning and quality gates on every iteration.

Commands

Command	Description
forge init	Initialize project, auto-detect workspaces and language
forge import <file>	Import a PRD (Markdown or JSON), scan and auto-decompose
forge run	Start the autonomous development loop
forge status	Show session progress and quality metrics
forge report	Generate a project health report (terminal, JSON, or HTML)
forge decompose	Decompose large tasks into smaller TDD-friendly subtasks
forge agents	List available agent roles and their tools

`forge init`

Option	Description
-n, --name <name>	Project name
-i, --interactive	Guided PRD creation with questions
-f, --force	Overwrite existing .forge directory
--no-scan	Skip workspace auto-detection
-v, --verbose	Show detailed scan output

`forge import`

Option	Description
-v, --verbose	Show detailed scan output
--no-scan	Skip codebase scan for existing implementations
--no-decompose	Skip automatic decomposition of large tasks

`forge run`

Option	Description
-n, --iterations <n>	Maximum iterations (default: 50)
--resume	Resume from previous run, skipping completed tasks
--no-tui	Disable live TUI (plain text output)
-v, --verbose	Show detailed executor output
--solo	Single agent mode (no team rotation)
--dry-run	Simulate execution without running Claude

`forge status`

Option	Description
--json	Output as JSON
-w, --watch	Refresh status every few seconds
--interval <seconds>	Watch interval in seconds (default: 3)

`forge report`

Option	Description
-f, --format <type>	Output format: terminal, html, or json (default: terminal)

`forge decompose`

Option	Description
--threshold <n>	Complexity threshold 1-10 (tasks above this are decomposed)
--max-subtasks <n>	Max subtasks per parent task
--dry-run	Show which tasks would be decomposed without calling Claude
-v, --verbose	Show detailed output

Task Sources

Forge supports three task formats, auto-detected in priority order:

1. Spec-Kit (recommended)

Use GitHub's spec-kit for planning, then let Forge execute:

# Generate specs with spec-kit
$ npx spec-kit specify
$ npx spec-kit plan
$ npx spec-kit tasks

# Forge auto-detects specs/tasks.md
$ forge run

Forge reads from the specs/ directory:

File	Purpose
specs/tasks.md	Task list with T-IDs, phases, dependencies
specs/constitution.md	Project principles — injected into agent prompts
specs/spec.md	Detailed requirements — injected into agent prompts
specs/plan.md	Architecture decisions — injected into agent prompts

Spec-kit task format supports markers: [P] = parallelizable, [US1] = user story ref, (depends on T001) = dependency.

2. Forge PRD (JSON)

$ forge import requirements.md    # Parses to .forge/prd.json
$ forge run

3. Markdown Task List

# Place a tasks.md in .forge/
$ forge run

Live Dashboard

The TUI shows everything at a glance — phase, progress, cost, TDD pipeline, quality gates, and Claude's real-time output.

╭──────────────────── FORGE Development Loop ────────────────────╮ ┌──────────────────────────────────────────────────────────────────┐ Phase: IMPLEMENTING Tasks: 5/12 Iter: 7 Elapsed: 14:32 Cost: $1.24 Commits: 15 Files: 3 [████████████░░] 42% Task: Implement JWT auth middleware ✓RED → ●GREEN → ○REFACTOR → ○Gates (2 cycles) Gates: ✓tests ✓security ✓lint ✗coverage ├──────────────────────────────────────────────────────────────────┤ Claude Output ⚡ Writing src/auth/middleware.ts... Running npm test -- --reporter verbose Tests 42 passed (42) Done — Cost: $0.12 Duration: 45s ├──────────────────────────────────────────────────────────────────┤ [d] Dashboard [q] Quit ✓tests ✓sec ✓lint ✗cov └──────────────────────────────────────────────────────────────────┘

Keyboard Shortcuts

Key	Action
d	Toggle dashboard overlay (cost breakdown, coverage, security findings, code quality)
q	Quit with confirmation — gracefully aborts the running Claude process

Rate Limit Modal

When the API rate limit is hit, the dashboard shows a countdown modal:

┌──────────────────────────────────────┐ │ │ │ API Rate Limit Reached │ │ │ │ Waiting for rate limit to reset... │ │ │ │ 2h 34m 12s │ │ │ │ Resets at 3:45:00 PM │ │ Session will resume automatically │ │ │ └──────────────────────────────────────┘

Forge waits for the exact reset time from the API and resumes automatically. Rate limits don't count as task failures.

Human-in-the-Loop

When a task fails maxTaskFailures times (default 3), Forge pauses and shows an interactive prompt:

┌──────────────────────────────────────────────┐ │ Task Failed (3x) │ │ │ │ Automated UI tests (integration_test) │ │ Green phase failed: Process timed out │ │ │ │ What would you like to do? │ │ │ │ ▸ Retry with guidance — provide a hint │ │ Skip for now — defer to later │ │ Skip permanently — won't retry │ │ Abort session — stop forge │ │ │ │ Use ↑↓ arrows and Enter to select │ └──────────────────────────────────────────────┘

Retry with guidance — Type a hint (e.g., "Use widget tests, not integration tests"). Your guidance is injected into the next attempt's prompt.
Skip for now (defer) — The task moves to the back of the queue. Other tasks run first, then it retries with a fresh failure count.
Skip permanently — The task is marked as skipped and won't be retried.
Abort session — Stops Forge immediately.

In non-interactive mode (--no-tui), tasks are auto-skipped after maxTaskFailures.

Configuration

Create .forge/forge.config.json in your project root:

{ "maxIterations": 50, "maxCallsPerHour": 100, "timeoutMinutes": 15, "tdd": { "enabled": true, "requireFailingTestFirst": true, "commitPerPhase": true }, "coverage": { "lineThreshold": 80, "branchThreshold": 70 }, "security": { "enabled": true, "sast": true, "dependencyAudit": true, "secretScanning": true, "blockOnSeverity": "high" }, "agents": { "team": ["architect", "implementer", "tester", "reviewer"], "soloMode": false } }

Environment Variables

Override any config option via environment variables:

Variable	Effect
FORGE_MAX_ITERATIONS	Override max loop iterations
FORGE_MAX_CALLS_PER_HOUR	Override API rate limit
FORGE_TDD_ENABLED	Enable/disable TDD enforcement
FORGE_SECURITY_ENABLED	Enable/disable security scanning

Build softwarethe right way

AI that writes code is easy.AI that crafts software is hard.

Tests First, Always

Security Is Not Optional

Every Commit Tells a Story

Know When to Stop

The full toolkit for autonomous development

TDD Enforcement

Multi-Agent Teams

Security Scanning

Quality Gates + Auto-Fix

Conventional Commits

Circuit Breaker

Live Dashboard

Cost Tracking

Smart Task Ordering

Rate Limit Resilience

Session Continuity

Human-in-the-Loop

Deferred Tasks

Spec-Kit Integration

Each iteration, every time

Plan with spec-kit.Build with Forge.

Everything you need to get started

Quick Start

Commands

forge init

forge import

forge run

forge status

forge report

forge decompose

Task Sources

1. Spec-Kit (recommended)

2. Forge PRD (JSON)

3. Markdown Task List

Live Dashboard

Keyboard Shortcuts

Rate Limit Modal

Human-in-the-Loop

Configuration

Environment Variables

Start building the right way

Build software
the right way

AI that writes code is easy.
AI that crafts software is hard.

Plan with spec-kit.
Build with Forge.

`forge init`

`forge import`

`forge run`

`forge status`

`forge report`

`forge decompose`