SkillGuard

Security Audit Platform for AI Agent Skills

10 Dimensions · 109 Rules · A–F Risk Grading · Token Cost Estimation

Quick Start · Features · Dimensions · Architecture · Deep Scan · Skill Integration

What is SkillGuard?

SkillGuard performs comprehensive security audits on Claude Code Skills and other AI Agent plugins. Paste a GitHub or ClawHub URL — get a full security report in seconds.

Audit methodology is based on OWASP LLM Top 10, SLSA, and Google SAIF.

Quick Start

# 1. Install dependencies
pip install -r requirements.txt

# 2. Start Redis
redis-server --daemonize yes

# 3. Start Celery worker
celery -A app.workers worker --loglevel=info --concurrency=2 &

# 4. Start API server
python -m uvicorn app.main:app --host 0.0.0.0 --port 8011

Open http://localhost:8011 → paste a Skill URL → scan.

Features

🔍 Static Analysis

Pattern-based scanning across 10 security dimensions with 109 audit rules (67 built-in + 42 configurable). Covers OWASP LLM Top 10, SLSA supply chain, and Google SAIF framework — from prompt injection to license compliance, every risk surface is checked.

🧠 Deep Scan

Goes beyond static patterns. LLM-driven dynamic analysis executes the skill in a sandboxed multi-turn conversation, traces every tool call, and collects concrete risk evidence. Powered by your own API key — BYOK (Bring Your Own Key), nothing stored on the server.

📊 A–F Risk Grading

One glance, one grade. Every skill gets a clear A–F letter rating with per-dimension score breakdown, severity distribution, and actionable remediation in both Chinese and English.

💰 Token Cost Estimation

4-level token analysis (L1 SKILL.md → L2 eager/lazy references → L3 scripts) with multi-model cost projections. Know exactly how much a skill costs before you install it.

📦 Multi-Skill Detection

Drop a monorepo URL — SkillGuard automatically discovers every skill inside, scans them in parallel, and generates an aggregated summary report with per-skill deep links.

🌐 Bilingual Reports

Full CN/EN language toggle across all pages — report, summary, trace, and progress. One click to switch, preference persisted across sessions.

⚙️ Configurable Rules

YAML-based rule engine — enable, disable, adjust severity, or add whitelist entries per project. Ship your own security policy alongside the default ruleset.

Security Dimensions

#	Dimension	Coverage
1	Prompt Injection	Direct/indirect injection patterns, zero-width chars, hidden instructions
2	Permission Escalation	Missing `allowed-tools`, shell access, dangerous tool combinations
3	Data Exfiltration	Credential theft, env leaks, outbound HTTP, webhook tunneling
4	Resource Abuse	Infinite loops, excessive fetching, unbounded retries
5	Supply Chain	Pipe-to-shell, unpinned dependencies, unverified Docker images
6	License Compliance	Proprietary restrictions, non-commercial clauses, commercial platform locks
7	Code Execution	Arbitrary code eval, dynamic imports, subprocess spawning
8	Filesystem Access	Path traversal, sensitive file reads, recursive deletion
9	Network Access	Unrestricted outbound, DNS exfiltration, proxy abuse
10	Obfuscation	Base64 payloads, encoded strings, anti-analysis techniques

Architecture

flowchart TD
    Browser((Browser)) -->|"HTTP / WebSocket"| FastAPI[FastAPI + Uvicorn :8011]
    FastAPI --> Static[Static Scan Engine]
    FastAPI -->|"Task Dispatch"| Redis[(Redis Broker + Pub/Sub)]
    FastAPI --> SQLite[(SQLite — Scans · Findings · Traces)]
    Redis -->|"Task Consume"| Celery[Celery Worker — Deep Scan]
    Celery --> SQLite
    Static --> SQLite
    Celery -->|"Anthropic Messages API"| LLM{LLM API — BYOK}
    Celery -.->|"Optional Routing"| PIPELLM[PIPELLM Gateway]
    PIPELLM -.-> LLM

classDef startEndStyle fill:#e8f5e8,stroke:#4caf50,stroke-width:3px,color:#000
classDef processStyle fill:#e3f2fd,stroke:#2196f3,stroke-width:2px,color:#000
classDef decisionStyle fill:#fff3e0,stroke:#ff9800,stroke-width:2px,color:#000
classDef dataStyle fill:#f3e5f5,stroke:#9c27b0,stroke-width:2px,color:#000
classDef outputStyle fill:#fce4ec,stroke:#e91e63,stroke-width:2px,color:#000
classDef errorStyle fill:#ffebee,stroke:#f44336,stroke-width:2px,color:#000

class Browser startEndStyle
class FastAPI,Static,Celery processStyle
class LLM decisionStyle
class Redis,SQLite dataStyle
class PIPELLM errorStyle

Deep Scan

Deep Scan uses LLM-driven dynamic analysis to go beyond static pattern matching. It executes the skill in a sandboxed environment, traces tool calls, and collects risk evidence.

Configuration — provide your own API credentials at runtime:

Field	Description
Base URL	API endpoint (e.g. `https://api.anthropic.com`)
API Key	Your API key
Model	Sonnet 4.6 Sonnet / Sonnet 4.6 Opus

No API keys are stored on the server.

Claude Code Skill

SkillGuard is also available as a Claude Code Skill for in-terminal auditing.

# Audit a skill directly from Claude Code
python3 skill/audit.py https://github.com/user/my-skill
python3 skill/audit.py https://clawhub.ai/author/skill-name

Output includes risk grade (A–F), dimension breakdown, severity findings, and remediation guidance.

Tech Stack

Backend

Component	Technology	Role
Web Framework	FastAPI 0.115	Async REST API + SSR page rendering
ASGI Server	Uvicorn 0.32	High-performance HTTP server with WebSocket support
Task Queue	Celery 5.4	Distributed background task execution for scan jobs
Message Broker	Redis 5.2	Task broker + real-time progress pub/sub channel
Database	SQLite + aiosqlite	Lightweight persistent storage for scans, findings, traces
Data Validation	Pydantic 2.9	Request/response schema validation and serialization
HTTP Client	httpx	Direct calls to Anthropic Messages API (no SDK dependency)

Frontend

Component	Technology	Role
Templating	Jinja2 3.1	Server-side HTML rendering
Reactivity	Alpine.js	Lightweight client-side interactivity (modals, toggles, i18n)
Styling	Tailwind CSS	Utility-first CSS with dark mode support
Real-time	WebSocket	Live progress updates during scan execution

Audit Engine

Component	Technology	Role
Rule Engine	PyYAML	109 configurable rules in declarative YAML format
Static Scanner	Python regex + AST	Pattern matching across 10 security dimensions
Deep Scanner	Anthropic Messages API	LLM-driven multi-turn dynamic analysis with tool execution
Token Estimator	Built-in	4-level (L1/L2-eager/L2-lazy/L3) token & cost projection
LLM Gateway	PIPELLM	Optional API gateway for model routing and key management

License

This project is licensed under the MIT License.

You are free to use, modify, and distribute SkillGuard in both personal and commercial projects. See the LICENSE file for full terms.

_{Built for the AI Agent ecosystem · ClawHub · PIPELLM}

Name		Name	Last commit message	Last commit date
Latest commit History 17 Commits
app		app
scripts		scripts
skill		skill
.dockerignore		.dockerignore
.gitignore		.gitignore
LICENSE		LICENSE
README.md		README.md
requirements.txt		requirements.txt

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

SkillGuard

What is SkillGuard?

Quick Start

Features

🔍 Static Analysis

🧠 Deep Scan

📊 A–F Risk Grading

💰 Token Cost Estimation

📦 Multi-Skill Detection

🌐 Bilingual Reports

⚙️ Configurable Rules

Security Dimensions

Architecture

Deep Scan

Claude Code Skill

Tech Stack

Backend

Frontend

Audit Engine

License

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

SkillGuard

What is SkillGuard?

Quick Start

Features

🔍 Static Analysis

🧠 Deep Scan

📊 A–F Risk Grading

💰 Token Cost Estimation

📦 Multi-Skill Detection

🌐 Bilingual Reports

⚙️ Configurable Rules

Security Dimensions

Architecture

Deep Scan

Claude Code Skill

Tech Stack

Backend

Frontend

Audit Engine

License

About

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages