SocketDev
diff --git a/‎.agents/skills/fleet-agent-ci/SKILL.md‎
Lines changed: 55 additions & 0 deletions b/‎.agents/skills/fleet-agent-ci/SKILL.md‎
Lines changed: 55 additions & 0 deletions
diff --git a/‎.agents/skills/fleet-agent-ci/reference.md‎
Lines changed: 60 additions & 0 deletions b/‎.agents/skills/fleet-agent-ci/reference.md‎
Lines changed: 60 additions & 0 deletions
diff --git a/‎.agents/skills/fleet-auditing-gha/SKILL.md‎
Lines changed: 121 additions & 0 deletions b/‎.agents/skills/fleet-auditing-gha/SKILL.md‎
Lines changed: 121 additions & 0 deletions
@@ -0,0 +1,55 @@
+---
+name: fleet-agent-ci
+description: Run this repo's GitHub Actions workflows locally in Docker with Agent CI to validate changes before pushing. Use before opening or updating a PR, after editing a workflow YAML under .github/workflows, or whenever catching a CI failure locally beats waiting on a remote runner.
+user-invocable: true
+allowed-tools: Bash, Read, Edit
+model: claude-haiku-4-5
+context: fork
+---
+
+# agent-ci
+
+Run the repo's CI pipeline locally before pushing. CI was green before you started, so any failure the local run surfaces comes from your changes.
+
+RedwoodJS wrote the upstream tool and skill (MIT, https://github.com/redwoodjs/agent-ci). The fleet pins `@redwoodjs/agent-ci` in the wheelhouse catalog and wires it as the `ci:local` package script (resolved via `node_modules/.bin`, never `pnpm exec`/`npx`). Edit only in `socket-wheelhouse/template/`; the cascade refreshes downstream copies.
+
+## Requirements
+
+- **Docker must be running** — each job runs in a container. On macOS the fleet uses **OrbStack** (`open -a OrbStack`; recommended over Docker Desktop). If the daemon is down, agent-ci fails fast with `couldn't use a Docker socket at /var/run/docker.sock … missing or a dangling symlink` and exit 1 — that's the daemon, not a workflow failure. Start the provider, confirm with `docker info`, re-run. No daemon and can't start one → fall back to `greening-ci` (push + watch remote).
+- **The dep is already installed** — `@redwoodjs/agent-ci` is a fleet devDependency (`catalog:`), provisioned by `pnpm install`.
+- **`--github-token` for remote reusable workflows** — every socket-\* repo's `ci.yml` calls a `SocketDev/socket-registry/.github/workflows/…` reusable workflow. agent-ci can't fetch it without a token; pass `--github-token` (no value → auto-resolves via `gh auth token`). Omitting it makes a remote-reusable CI silently fail to resolve.
+- **macOS jobs (`runs-on: macos-*`)** run in a throwaway VM and need `tart` + `sshpass` on an Apple Silicon host (`brew install cirruslabs/cli/tart hudochenkov/sshpass/sshpass`). Without both, macOS jobs are skipped with a reason — the rest of the run still proceeds.
+
+## Run
+
+The blessed entry is the canonical `ci:local` script — it already carries the full flag set (`--all --quiet --pause-on-failure --github-token`), and pnpm resolves the `agent-ci` binary from `node_modules/.bin` cross-platform:
+
+```bash
+pnpm run ci:local
+```
+
+`--all` runs the PR/push workflows for the current branch. `--quiet` suppresses the live renderer (pipe-safe). `--pause-on-failure` stops at the first failed step and holds the container open for `retry`. `--github-token` (bare → `gh auth token`) fetches the socket-registry reusable workflow every fleet `ci.yml` calls. Pipes are safe: when stdout is not a TTY the launcher detaches and the foreground process exits **77** the moment a step pauses, so `| tee log` and `> log.txt` work.
+
+There is no `--list` or dry-run flag — `run` executes. Args after the subcommand pass through, so a typo'd flag becomes a workflow arg rather than an error.
+
+To resolve the binary from a `.mts` script (not a package.json script — those resolve `node_modules/.bin` themselves), use the fleet helper, never a shelled-out `which`/`command -v` (which searches the global PATH and resolves the wrong binary — enforced by `socket/no-which-for-local-bin`):
+
+```ts
+import { whichSync } from '@socketsecurity/lib-stable/bin/which'
+
+const agentCi = whichSync('agent-ci', { path: nodeModulesBinDir, nothrow: true })
+```
+
+## Fix and retry
+
+When a step fails the run pauses (and the `run.paused` event carries the exact `retry_cmd` to copy). Fix the code, then retry the paused runner — don't restart the whole pipeline:
+
+```bash
+node_modules/.bin/agent-ci retry --name <runner-name>
+```
+
+Call the linked binary directly (the fleet form for an ad-hoc bin invocation, same as `node_modules/.bin/oxfmt` / `tsgo` in build scripts) — never `pnpm exec`/`npx`. Re-run from an earlier step with `--from-step <N>`. Repeat fix → retry until every job passes. Don't push to trigger remote CI when agent-ci can run it locally.
+
+## Reference
+
+- **Machine-readable `--json` event stream, the full requirements rationale, and the agent-ci-vs-remote-CI decision matrix**: see [reference.md](reference.md).
@@ -0,0 +1,60 @@
+# agent-ci reference
+
+## Contents
+
+- Machine-readable output (`--json`)
+- The exit-77 pause contract
+- Requirements rationale (Docker, install)
+- When to use agent-ci vs. remote CI
+- Command summary
+
+## Machine-readable output (`--json`)
+
+Add `--json` (or set `AGENT_CI_JSON=1`) to emit an NDJSON event stream on stdout — one JSON object per line. Use it for programmatic monitoring instead of grepping plaintext.
+
+Events:
+
+- `run.start` — carries `schemaVersion: 1` and `runId`.
+- `job.start`, `job.finish` — `status: passed | failed`.
+- `step.start`, `step.finish` — `status: passed | failed | skipped`.
+- `run.paused` — carries `runner` and `retry_cmd` (the exact command to resume).
+- `run.finish` — `status: passed | failed`.
+- `diagnostic` — non-fatal notices.
+
+`--json` is independent of `--quiet`. The diff renderer is auto-suppressed under `--json` so ANSI escapes don't collide with the stream.
+
+The robust agent loop: parse the stream, react to `run.paused` (fix the failure named in `runner`), then run the `retry_cmd` it carries. No plaintext parsing required.
+
+## The exit-77 pause contract
+
+When stdout is not a TTY (piped, redirected, captured by a parent process), the launcher detaches the run. The foreground process exits **77** the instant a step pauses. This frees the pipe — `| tee`, `> log.txt`, command substitution — while the container stays paused in the background, ready for `retry`. Exit 77 means "paused, awaiting retry," not "failed."
+
+## Requirements rationale
+
+- **Docker.** agent-ci executes each workflow job inside a container, the same way GitHub's runners do. It connects via `AGENT_CI_DOCKER_HOST` (default `unix:///var/run/docker.sock`) — **not** the standard `DOCKER_HOST` (setting `DOCKER_HOST` makes agent-ci exit with a rename error; use `AGENT_CI_DOCKER_HOST` for a remote `ssh://`/`tcp://` daemon). Without a running daemon the run cannot start; it fails fast with a dangling-socket message and exit 1. On macOS the fleet provider is **OrbStack** (`open -a OrbStack`, then `docker info` to confirm). There is no degraded mode; if you can't start a daemon, use `greening-ci` (push and watch remote CI) instead.
+- **Remote reusable workflows.** A fleet `ci.yml` doesn't contain the jobs — it `uses:` a `SocketDev/socket-registry/.github/workflows/ci.yml@<sha>` reusable workflow. agent-ci fetches that over the network, which needs `--github-token` (bare flag → `gh auth token`, or `AGENT_CI_GITHUB_TOKEN`). Without it the reusable workflow can't resolve and the run can't assemble the job graph.
+- **macOS jobs.** `runs-on: macos-*` jobs run in a real throwaway macOS VM via `tart` (Apple Silicon only) with `sshpass`. Missing either tool, or on Linux/Intel, those jobs **skip with a reason** rather than failing the run; the Linux/container jobs still execute. VM concurrency caps at `AGENT_CI_MACOS_VM_CONCURRENCY` (default 2 — tart's free tier). Windows jobs (`runs-on: windows-*`) always skip (unsupported).
+- **Missing tools in the runner image.** Jobs run in `ghcr.io/actions/actions-runner:latest`, which ships node/git/curl/jq/unzip but **not** build toolchains, `python3`, or `xz`. A job failing on a missing tool isn't your code — add a `.github/agent-ci.Dockerfile` (`FROM ghcr.io/actions/actions-runner:latest` + `apt-get install`); agent-ci picks it up automatically and caches by content hash.
+- **Install.** `@redwoodjs/agent-ci` is a fleet devDependency declared as `catalog:` in every repo's `package.json`, pinned in the wheelhouse `pnpm-workspace.yaml` catalog. `pnpm install` provisions it. The published package is a self-contained Node CLI (`dist/cli.js`) — it has no platform-binary dependencies and its `ssh2` native build scripts are declined in the fleet's `allowBuilds`/`allowScripts` (the CLI runs without them).
+
+## When to use agent-ci vs. remote CI
+
+| Situation                                                                                                    | Use                                                                                                     |
+| ------------------------------------------------------------------------------------------------------------ | ------------------------------------------------------------------------------------------------------- |
+| Edited a workflow YAML (`.github/workflows/*.yml`)                                                           | agent-ci first — a malformed workflow fails the same locally and remotely, skipping the push/wait loop. |
+| Code change that only needs lint / typecheck / unit tests                                                    | `pnpm run check --all` — faster than spinning up containers for the pure-Node gates.                    |
+| Workflow does something the local scripts don't (matrix, container steps, action wiring, secrets-shaped env) | agent-ci.                                                                                               |
+| No Docker, or the failure needs an off-machine action (a deploy, a remote service)                           | push and use `greening-ci`.                                                                             |
+
+## Command summary
+
+| Command                                                              | Purpose                                                     |
+| -------------------------------------------------------------------- | ----------------------------------------------------------- |
+| `pnpm run ci:local`                                                  | Blessed entry — `agent-ci run --all` via `node_modules/.bin`. |
+| `node_modules/.bin/agent-ci run --all --pause-on-failure --github-token` | Run the branch's PR/push workflows; pause on first failure; fetch remote reusable workflows. |
+| `node_modules/.bin/agent-ci run --workflow <path>`                   | Run a single workflow file.                                 |
+| `node_modules/.bin/agent-ci retry --name <runner>`                   | Resume a paused runner after a fix.                         |
+| `node_modules/.bin/agent-ci retry --name <runner> --from-step <N>`   | Resume from an earlier step.                                |
+| `node_modules/.bin/agent-ci abort --name <runner>`                   | Tear down a paused runner without retrying.                 |
+
+Add `--quiet` to suppress the live renderer, `--json` for the NDJSON stream. Invoke the binary via `node_modules/.bin/agent-ci` or the `ci:local` script — never `pnpm exec`/`npx` (fleet tooling ban).
@@ -0,0 +1,121 @@
+---
+name: fleet-auditing-gha
+description: Audits a repo's GitHub Actions permissions + allowlist against the fleet baseline. Reports drift only. Fixes are manual in Settings → Actions because flipping these silently is unsafe. Use when a CI failure looks like "action X is not allowed to be used", when onboarding a new fleet repo, or as a periodic fleet-wide health check.
+user-invocable: true
+allowed-tools: Read, Grep, Glob, Bash(gh:*), Bash(node:*), Bash(jq:*)
+model: claude-haiku-4-5
+context: fork
+---
+
+# auditing-gha
+
+Diff a fleet repo's GitHub Actions repository-level settings against the canonical baseline. Read-only: surfaces what to change, doesn't change it.
+
+## When to use
+
+- **"action X is not allowed to be used" CI failure**: the allowlist is missing an entry, or the policy got flipped from `selected` to `local_only`.
+- **Onboarding a new fleet repo**: before the first CI run, confirm the new repo matches the baseline so the first push doesn't hit policy errors.
+- **Periodic fleet health check**: drift accumulates. Somebody adds a workflow that needs a new action and silently flips `verified_allowed: true` to make it work instead of adding the explicit pattern.
+
+## What the baseline checks
+
+| Setting (per repo)                 | Baseline                   | Why                                                                                                                                                                                                                                                               |
+| ---------------------------------- | -------------------------- | ----------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------- |
+| `enabled`                          | `true`                     | Per-repo override is on. **Note**: `enabled: false` does NOT mean Actions are off — it means the per-repo override is unset and org policy is the source of truth. To get drift-detection on a repo, opt in to per-repo settings + mirror the canonical baseline. |
+| `allowed_actions`                  | `'selected'`               | "Allow enterprise, and select non-enterprise, actions and reusable workflows" — the only mode where the explicit allowlist is the source of truth.                                                                                                                |
+| `github_owned_allowed`             | `false`                    | Don't blanket-allow `actions/*`. The canonical patterns list already names every github-owned action we need; unlisted ones must be explicit.                                                                                                                     |
+| `verified_allowed`                 | `false`                    | Marketplace "verified creator" is not implicit allow — every action must be on the canonical patterns list.                                                                                                                                                       |
+| `patterns_allowed ⊇ canonical set` | Each fleet pattern present | Each canonical entry is referenced by at least one socket-registry shared workflow; missing one breaks every consumer.                                                                                                                                            |
+
+The **canonical patterns** (every fleet repo must have all of these):
+
+- `actions/cache/restore@*`
+- `actions/cache/save@*`
+- `actions/cache@*`
+- `actions/checkout@*`
+- `actions/deploy-pages@*`
+- `actions/download-artifact@*`
+- `actions/github-script@*`
+- `actions/setup-go@*`
+- `actions/setup-node@*`
+- `actions/setup-python@*`
+- `actions/upload-artifact@*`
+- `actions/upload-pages-artifact@*`
+- `depot/build-push-action@*`
+- `depot/setup-action@*`
+- `github/codeql-action/upload-sarif@*`
+
+Extras beyond the canonical set are tolerated (reported as info, not failure). A repo may pin a one-off action, but each extra should map to a real consumer; orphans should be pruned.
+
+**Third-party actions are NOT on the allowlist.** Anything outside `actions/`, `github/`, and `depot/` should be ported to a hand-rolled composite under `SocketDev/socket-registry/.github/actions/` rather than added here. The current set of socket-registry composite replacements:
+
+| Third-party                       | socket-registry composite  |
+| --------------------------------- | -------------------------- |
+| `dtolnay/rust-toolchain`          | `setup-rust-toolchain`     |
+| `hendrikmuhs/ccache-action`       | `setup-ccache`             |
+| `HaaLeo/publish-vscode-extension` | `publish-vscode-extension` |
+| `mlugg/setup-zig`                 | `setup-zig`                |
+| `pnpm/action-setup`               | `setup-pnpm`               |
+| `softprops/action-gh-release`     | `create-gh-release`        |
+| `Swatinem/rust-cache`             | `setup-rust-cache`         |
+
+Note: `enabled: false` from the per-repo API does NOT mean Actions are disabled. It means the per-repo override is unset and org-level policy is in effect. The skill explains this in its output.
+
+## How to invoke
+
+    node .claude/skills/fleet/auditing-gha/run.mts SocketDev/socket-btm SocketDev/socket-cli
+
+Or all-at-once with the canonical fleet list (manual today; the orchestrator skill prompt expands the list at call time):
+
+    node .claude/skills/fleet/auditing-gha/run.mts \
+      SocketDev/socket-btm \
+      SocketDev/socket-cli \
+      SocketDev/socket-lib \
+      SocketDev/socket-mcp \
+      SocketDev/socket-packageurl-js \
+      SocketDev/socket-registry \
+      SocketDev/socket-sdk-js \
+      SocketDev/socket-sdxgen \
+      SocketDev/socket-stuie \
+      SocketDev/socket-vscode \
+      SocketDev/socket-webext \
+      SocketDev/socket-wheelhouse \
+      SocketDev/ultrathink
+
+For machine-readable output (one finding per repo):
+
+    node .claude/skills/fleet/auditing-gha/run.mts --json SocketDev/socket-btm | jq
+
+## How to fix the findings
+
+Each finding line names the exact toggle to flip. The fix is **manual**: the runner does not write. Flipping these silently is a credible attack vector and should always be a human action.
+
+Two paths:
+
+1.  **Web UI (preferred)**: Repo → Settings → Actions → General. The settings map 1:1 with the audit findings:
+    - "Allow enterprise, and select non-enterprise, actions and reusable workflows" → flips `allowed_actions` to `selected`.
+    - Uncheck "Allow actions created by GitHub" → `github_owned_allowed: false`.
+    - Uncheck "Allow Marketplace actions by verified creators" → `verified_allowed: false`.
+    - "Allow specified actions and reusable workflows" textarea: paste the canonical patterns list (one per line). Existing extras can stay; remove only ones with no consumer.
+
+2.  **`gh api` PUT (admin-scoped tokens only)**: surfaced for completeness; prefer the UI:
+
+        gh api -X PUT repos/<owner>/<repo>/actions/permissions \
+          -F enabled=true -F allowed_actions=selected
+        gh api -X PUT repos/<owner>/<repo>/actions/permissions/selected-actions \
+          -F github_owned_allowed=false -F verified_allowed=false \
+          -f patterns_allowed[]='actions/cache/restore@*' \
+          -f patterns_allowed[]='actions/cache/save@*' \
+          # ...one -f per canonical pattern...
+
+    The whole-list replace semantics on the selected-actions endpoint mean **omitting a repo's existing extras drops them**. Preserve them when relevant.
+
+## Anti-patterns
+
+- **Auto-PUT-ing the baseline from a script.** Don't. The settings affect every workflow on the repo and a wrong setting silently weakens supply-chain posture. The user runs the audit, the user fixes.
+- **Adding an action to the allowlist to make a one-off workflow happy.** First ask: should the workflow use a shared socket-registry workflow that already references an approved action? Adding entries to the canonical set means cascading them to every consumer org. A real commitment.
+- **Treating the audit as a security review.** It checks policy state, not workflow content. A workflow that uses an allowed action insecurely (e.g. `pull_request_target` + `actions/checkout` of untrusted ref) is invisible to this audit; that's `pull-request-target-guard`'s job.
+
+## Companion: `greening-ci`
+
+If a CI failure shows `action <X> is not allowed by enterprise admin` or `not allowed to be used in this repository`, that's an allowlist gap. Run this audit, fix the gap manually, then re-run `/green-ci` to confirm the build goes green.