work in progress by Xenonesis · Pull Request #156 · KeygraphHQ/shannon

Xenonesis · 2026-02-21T19:24:05Z

No description provided.

fixes

Simplified

typo

italics

assets

gif

Updated Discord invite links in README.md to use a permanent invite link that will not expire. 🤖 Generated with [Claude Code](https://claude.com/claude-code) Co-Authored-By: Claude <noreply@anthropic.com>

chore: added logging

…he actual content

Simplified deliverable management by removing automatic copying to ~/Documents/pentest-deliverables/. All deliverables now remain only in <target-repo>/deliverables/, eliminating file duplication and improving UX. Changes: - Removed savePermanentDeliverables() function from src/setup/deliverables.js - Removed function call and related console output from shannon.mjs - Removed unused 'os' import from deliverables.js 🤖 Generated with [Claude Code](https://claude.com/claude-code) Co-Authored-By: Claude <noreply@anthropic.com>

Remove unnecessary screenshot storage to reduce file I/O and disk usage: - Removed screenshot directory creation - Removed --output-dir flag from Playwright MCP setup - Agents can still take screenshots, but they won't persist to disk Screenshots were not being used by any part of Shannon for analysis or reporting, making their storage unnecessary overhead. 🤖 Generated with [Claude Code](https://claude.com/claude-code) Co-Authored-By: Claude <noreply@anthropic.com>

…healing ## Unified Audit System (v3.0) - Implemented crash-safe, append-only logging to audit-logs/{hostname}_{sessionId}/ - Added session.json with comprehensive metrics (timing, cost, attempts) - Agent execution logs with turn-by-turn detail - Prompt snapshots saved to audit-logs/.../prompts/{agent}.md - SessionMutex prevents race conditions during parallel execution - Self-healing reconciliation before every CLI command ## Session Metadata Standardization - Fixed critical bug: standardized on 'id' field (not 'sessionId') throughout codebase - Updated: shannon.mjs (recon, report), src/phases/pre-recon.js - Added validation in AuditSession to fail fast on incorrect field usage - JavaScript shorthand syntax was causing wrong field names ## Schema Improvements - session.json: Added cost_usd per phase, removed redundant final_cost_usd - Renamed 'percentage' -> 'duration_percentage' for clarity - Simplified agent metrics to single total_cost_usd field - Removed unused validation object from schema ## Legacy System Removal - Removed savePromptSnapshot() - prompts now only saved by audit system - Removed target repo pollution (prompt-snapshots/ no longer created) - Single source of truth: audit-logs/{hostname}_{sessionId}/prompts/ ## Export Script Simplification - Removed JSON export mode (session.json already exists) - CSV-only export with clean columns: agent, phase, status, attempts, duration_ms, cost_usd - Tested on real session data ## Documentation - Updated CLAUDE.md with audit system architecture - Added .gitignore entry for audit-logs/ 🤖 Generated with [Claude Code](https://claude.com/claude-code) Co-Authored-By: Claude <noreply@anthropic.com>

Reasoning: - Shannon is a local CLI tool with direct filesystem access - Manual file editing (JSON, rm -rf) is simpler than reconciliation script - Automatic reconciliation runs before every command (built-in) - If auto-reconciliation has bugs, fix the code, don't create workarounds - Over-engineered for a local development tool For recovery: Just delete .shannon-store.json or edit JSON files directly 🤖 Generated with [Claude Code](https://claude.com/claude-code) Co-Authored-By: Claude <noreply@anthropic.com>

- Added comprehensive header comment explaining use case - Documents data source (session.json from audit-logs) - CSV output format and use cases clearly described - Includes usage examples and note about raw data access - Removes need for separate docs/ folder in repo Docs were design artifacts, not needed in open source repo. All relevant documentation now lives in code comments. 🤖 Generated with [Claude Code](https://claude.com/claude-code) Co-Authored-By: Claude <noreply@anthropic.com>

Reasoning: - Pollutes target repo with run-metadata.json - Redundant with audit system (session.json has all metadata) - Less useful than comprehensive audit logs - Target repos should stay clean - only deliverables belong there All debugging info now lives in audit-logs/{hostname}_{sessionId}/session.json 🤖 Generated with [Claude Code](https://claude.com/claude-code) Co-Authored-By: Claude <noreply@anthropic.com>

ROOT CAUSE: - Exploitation phase checked session.validationResults to determine eligibility - validationResults field was removed during audit system refactor - Field never existed in session schema, so all exploits were skipped THE FIX: - Exploitation phase now validates queue files directly when checking eligibility - Reads exploitation_queue.json and checks if vulnerabilities array is non-empty - No need to store validation results - just re-validate on demand CHANGES: 1. runParallelExploit() now calls safeValidateQueueAndDeliverable() directly 2. Removed validationResults parameter from markAgentCompleted() 3. Simplified calculateVulnerabilityAnalysisSummary() - no longer needs validation data 4. Simplified calculateExploitationSummary() - no longer needs validation data IMPACT: - Exploitation agents will now run when vulnerabilities are found - Queue files are the single source of truth for eligibility - Simpler architecture - no duplicate state storage 🤖 Generated with [Claude Code](https://claude.com/claude-code) Co-Authored-By: Claude <noreply@anthropic.com>

…c-ai/claude-agent-sdk Anthropic rebranded the SDK in 2025 from "Claude Code SDK" to "Claude Agent SDK". Updated all references across package.json, Dockerfile, and documentation to use the current @anthropic-ai/claude-agent-sdk package. 🤖 Generated with [Claude Code](https://claude.com/claude-code) Co-Authored-By: Claude <noreply@anthropic.com>

Remove unused files and exports to improve codebase maintainability: Phase 1 - Deleted files (5): - login_resources/generate-totp-standalone.mjs (replaced by MCP tool) - mcp-server/src/tools/index.js (unused barrel export) - mcp-server/src/utils/index.js (unused barrel export) - mcp-server/src/validation/index.js (unused barrel export) - src/agent-status.js (deprecated 309-line status manager) Phase 2 - Removed unused exports (3): - mcp-server/src/index.js: shannonHelperServer constant - mcp-server/src/utils/error-formatter.js: createFileSystemError function - src/utils/git-manager.js: cleanWorkspace (now internal-only) Phase 3 - Unexported internal functions (4): - src/checkpoint-manager.js: runSingleAgent, runAgentRange, runParallelVuln, runParallelExploit (internal use only) All Shannon CLI commands tested and verified working. 🤖 Generated with [Claude Code](https://claude.com/claude-code) Co-Authored-By: Claude <noreply@anthropic.com>

feat: add named workspaces with resume support

- Delete 4 dead files: pre-recon.ts, tool-checker.ts, input-validator.ts, environment.ts - Remove runClaudePromptWithRetry() and its now-unused imports from claude-executor.ts - De-export unused symbols: AGENT_ORDER, getParallelGroups, logError, isRouterMode, showHelp, displayTimingSummary - De-export unused types: ProcessingState, ProcessingResult, SdkMessage, MessageDispatchResult, MessageDispatchContext - Remove dead import (path from zx) in session-manager.ts and deprecated comment in config.ts

- Delete unused src/cli/ui.ts, remove zod dependency, drop 4 dead functions (logError, handleToolError, getRetryDelay, displayTimingSummary) - Remove 8 unused types/interfaces and 3 duplicate formatting utils from audit/utils.ts - Narrow export surface: make 7 message-handler functions private, remove unused audit re-exports, unexport AgentDefinition and path constants - Remove unused runClaudePrompt params (sessionMetadata, attemptNumber) and update caller - Enable tsconfig noUnusedLocals, noUnusedParameters, noImplicitReturns, noImplicitOverride, noFallthroughCasesInSwitch

- Remove 4 duplicate file I/O functions from audit/utils.ts, re-export from utils/file-io.ts - Consolidate AgentEndResult interface into new types/audit.ts - Use exported AgentDefinition from types/agents.ts in session-manager.ts - Rename AgentMetrics to AgentAuditMetrics to disambiguate from temporal/shared.ts

…cation - Add DI container (src/services/) with AgentExecutionService, ConfigLoaderService, and ExploitationCheckerService — pure domain logic with no Temporal dependencies - Introduce Result<T, E> type and ErrorCode enum for code-based error classification in classifyErrorForTemporal, replacing scattered string matching - Consolidate billing/spending cap detection into utils/billing-detection.ts with shared pattern lists across message-handlers, claude-executor, and error-handling - Extract LogStream abstraction for append-only logging with backpressure, used by both AgentLogger and WorkflowLogger - Simplify activities.ts from inline lifecycle logic to thin wrappers delegating to services, with heartbeat and error classification - Expand config-parser with human-readable AJV errors, security validation, and rule type-specific checks

- Add ActivityLogger interface wrapping Temporal's Context.current().log - Thread logger parameter through claude-executor, message-handlers, git-manager, prompt-manager, reporting, and agent validators - Remove chalk dependency from all service/activity files; CLI files keep console.log for terminal output - Replace colorFn: ChalkInstance parameter with structured logger.info/warn/error calls - Use replay-safe `log` import from @temporalio/workflow in workflows.ts

…and checkpoint

- Move error-handling, git-manager, prompt-manager, queue-validation, and reporting into src/services/ - Delete src/constants.ts — relocate AGENT_VALIDATORS and MCP_AGENT_MAPPING into session-manager.ts alongside agent definitions - Delete src/utils/output-formatter.ts — absorb filterJsonToolCalls and getAgentPrefix into ai/output-formatters.ts - Extract ActivityLogger interface into src/types/activity-logger.ts to break temporal/ → services circular dependency - Consolidate VulnType, ExploitationDecision into types/agents.ts and SessionMetadata into types/audit.ts - Remove dead timingResults/costResults globals from utils/metrics.ts and all consumers

- Remove empty section markers (// === ... ===, // --- ... ---) that duplicate JSDoc or function names - Remove "what" comments that restate the next line of code (e.g. // Save to disk, // Check for retryable patterns) - Remove file-level descriptions that restate the filename (e.g. // Pure functions for formatting console output) - Fix "Added by client" comment referencing implementation history → "Used for audit correlation" - Preserve all WHY comments: error classification groups, billing/session limit explanations, ESM interop, exactOptionalPropertyTypes, mutex reasoning

…nd agent-execution - client.ts: extract parseCliArgs, resolveWorkspace, buildPipelineInput, display helpers, waitForWorkflowResult from startPipeline - workflows.ts: extract runSequentialPhase, buildPipelineConfigs, aggregatePipelineResults to reduce workflow body - agent-execution.ts: add failAgent private method to deduplicate rollback+audit+error pattern in steps 6-8

- Add // N. Description steps to temporal layer (client, activities, workflows) - Add steps to AI layer (claude-executor: runClaudePrompt, buildMcpServers) - Add steps to services layer (prompt-manager, config-parser, git-manager) - Add steps to audit layer (metrics-tracker, audit-session) - Update CLAUDE.md comment guidelines with clearer numbered-step vs section-divider guidance

docs: add WSL2 setup guide for Windows users

refactor: decompose activities into services layer with structured error handling

- Add preflight activity that validates repo path, config, and credentials before agent execution - Add formatWorkflowError() with pipe-delimited segments for multi-line log rendering - Add remediation hints for common failures (auth, billing, config errors) - Add REPO_NOT_FOUND, AUTH_FAILED, BILLING_ERROR codes with error classification - Add formatErrorBlock() in WorkflowLogger for indented error display

…orkflow-errors.ts

Replaces validateApiKey and validateOAuthToken (direct fetch calls) with a single SDK-based query using claude-haiku-4-5-20251001. Uses SDKAssistantMessageError types for structured error classification and returns human-readable error messages for each failure case.

…type definition

…ation feat: add preflight validation phase with structured error reporting

ANTHROPIC_BASE_URL and ANTHROPIC_AUTH_TOKEN were not forwarded to the SDK subprocess environment, causing router mode to fail with "Authentication failed: Invalid API key" as the subprocess hit Anthropic directly with the placeholder key.

…through fix: pass router env vars to SDK subprocess

Copilot

Pull request overview

This PR appears to be a work-in-progress cleanup of selected XBEN benchmark deliverables (removing several generated reports/queues) plus a new repo-level VS Code setting related to the Snyk extension.

Changes:

Removed reconnaissance and injection-related deliverables for XBEN-079-24.
Removed reconnaissance + injection analysis deliverables for XBEN-041-24, and removed the comprehensive security assessment report for XBEN-022-24.
Added .vscode/settings.json enabling Snyk auto org selection in VS Code.

Reviewed changes

Copilot reviewed 6 out of 6 changed files in this pull request and generated 1 comment.

Show a summary per file

File	Description
xben-benchmark-results/XBEN-079-24/deliverables/recon_deliverable.md	Deletes the recon deliverable for XBEN-079-24.
xben-benchmark-results/XBEN-079-24/deliverables/injection_exploitation_queue.json	Deletes the injection exploitation queue for XBEN-079-24.
xben-benchmark-results/XBEN-041-24/deliverables/recon_deliverable.md	Deletes the recon deliverable for XBEN-041-24.
xben-benchmark-results/XBEN-041-24/deliverables/injection_analysis_deliverable.md	Deletes the injection analysis deliverable for XBEN-041-24.
xben-benchmark-results/XBEN-022-24/deliverables/comprehensive_security_assessment_report.md	Deletes the comprehensive security assessment report for XBEN-022-24.
.vscode/settings.json	Adds a repo-level VS Code setting for Snyk auto org selection.

💡 Add Copilot custom instructions for smarter, more guided reviews. Learn how to get started.

Copilot · 2026-02-21T19:26:46Z

.vscode/settings.json

@@ -0,0 +1,3 @@
+{
+    "snyk.advanced.autoSelectOrganization": true


This repo-level VS Code setting enables Snyk to auto-select an organization, which is typically a per-developer preference and can produce surprising behavior for other contributors (e.g., scanning against the wrong org or prompting unexpected auth). Consider removing this file from the repo, moving it to documentation, or scoping it to a project-specific setting that all contributors actually need (and/or adding .vscode/ to .gitignore if editor settings aren’t meant to be versioned).

Suggested change

"snyk.advanced.autoSelectOrganization": true

ajmallesh and others added 30 commits October 3, 2025 19:35

Initial commit

9327630

Update README.md

9e21331

fixes

Update LICENSE

9a69c70

Simplified

Update README.md

605b1d9

typo

Update README.md

e9ee158

italics

Add files via upload

3b3151c

assets

Add files via upload

ed7cbb0

gif

Create SHANNON-PRO.md

2239cee

Update README.md

64126c8

Update README.md

2ad8278

Update README.md

ad36ab6

docs: update Discord invite link to infinite expiry

c470983

Updated Discord invite links in README.md to use a permanent invite link that will not expire. 🤖 Generated with [Claude Code](https://claude.com/claude-code) Co-Authored-By: Claude <noreply@anthropic.com>

fix: renamed agent filename

76d4b3a

Update README.md

6cdd067

chore: added logging

e193d04

chore: optimized logging

ac15ac2

Merge pull request KeygraphHQ#1 from Khaushik-keygraph/main

dc290a8

chore: added logging

chore: upgrade model from Sonnet 4 -> Sonnet 4.5

de8c5ee

chore: save deliverable script decoupling deliverable creation from t…

be776c4

…he actual content

feat: migrate to use MCP tools instead of helper scripts

5571696

chore: remove deprecated scripts

4704982

ezl-keygraph and others added 23 commits February 17, 2026 00:23

Merge pull request KeygraphHQ#140 from KeygraphHQ/feat/resume-workspace

65fa830

feat: add named workspaces with resume support

feat: add resume header to workflow.log showing previous workflow ID …

81faa87

…and checkpoint

docs: update CLAUDE.md and commands for services-layer architecture

1c940ae

docs: add WSL2 setup guide for Windows users

41866bf

Merge pull request KeygraphHQ#142 from KeygraphHQ/docs/wsl-setup-guide

3412066

docs: add WSL2 setup guide for Windows users

Merge pull request KeygraphHQ#141 from KeygraphHQ/refactor/architecture

1dd9ac0

refactor: decompose activities into services layer with structured error handling

refactor: extract error formatting utilities from workflows.ts into w…

d7c3ed8

…orkflow-errors.ts

refactor: use SDK-exported SDKAssistantMessageError instead of local …

b0887e9

…type definition

Merge pull request KeygraphHQ#149 from KeygraphHQ/fix/preflight-valid…

1f726cf

…ation feat: add preflight validation phase with structured error reporting

fix: pass router env vars to SDK subprocess

2f62429

ANTHROPIC_BASE_URL and ANTHROPIC_AUTH_TOKEN were not forwarded to the SDK subprocess environment, causing router mode to fail with "Authentication failed: Invalid API key" as the subprocess hit Anthropic directly with the placeholder key.

Merge pull request KeygraphHQ#152 from KeygraphHQ/fix/router-env-pass…

f1995b7

…through fix: pass router env vars to SDK subprocess

work in progress

cd5f4b6

Copilot AI review requested due to automatic review settings February 21, 2026 19:24

Copilot started reviewing on behalf of Xenonesis February 21, 2026 19:24 View session

Copilot AI reviewed Feb 21, 2026

View reviewed changes

Add .env.example and update Copilot model configuration

46b50ff

guineenbfgservicedigital-source approved these changes Feb 25, 2026

View reviewed changes

ajmallesh closed this Mar 4, 2026

ajmallesh force-pushed the main branch from f3a833e to ab2c400 Compare March 4, 2026 23:53

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

work in progress#156

work in progress#156
Xenonesis wants to merge 167 commits intoKeygraphHQ:mainfrom
Xenonesis:main

Xenonesis commented Feb 21, 2026

Uh oh!

Copilot AI left a comment

Uh oh!

Copilot AI Feb 21, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

9 participants

		@@ -0,0 +1,3 @@
		{
		"snyk.advanced.autoSelectOrganization": true

Conversation

Xenonesis commented Feb 21, 2026

Uh oh!

Copilot AI left a comment

Choose a reason for hiding this comment

Pull request overview

Reviewed changes

Uh oh!

Copilot AI Feb 21, 2026

Choose a reason for hiding this comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

9 participants