Security Incident Runbook

SEC-7.3 (bd-2kle2) -- Step-by-step procedures for common security incident categories.

General Incident Workflow
INC-1: Unauthorized Capability Use
INC-2: Secret Exposure
INC-3: Ledger Integrity Failure
INC-4: Scanner Bypass
INC-5: Quarantine Escape
INC-6: Resource Quota Abuse
Evidence Collection
Rollback Procedures

General Incident Workflow

Every incident follows this sequence regardless of classification:

DETECT  →  CONTAIN  →  COLLECT  →  VERIFY  →  ANALYZE  →  REMEDIATE  →  POSTMORTEM

Phase	SLA	Actions
Detect	Automatic	Alert raised by risk controller or CI gate
Contain	< 5 min	Kill extension, verify isolation
Collect	< 15 min	Export evidence bundle, snapshot config
Verify	< 30 min	Verify bundle hash, ledger chain, alert counts
Analyze	< 2 hours	Root cause analysis with forensic replay
Remediate	Varies	Policy update, scanner rule, bead filed
Postmortem	< 24 hours	Written, reviewed, artifacts archived

INC-1: Unauthorized Capability Use

Symptoms: Policy violation alert; a denied capability was somehow exercised.

Investigation

Identify the call.
- Check security alerts filtered by category policy_violation
- Note the extension_id, capability, method, and call_id
Check policy evaluation path.
```
pi --explain-extension-policy
```
- Verify the capability is in deny_caps
- Check if per-extension overrides exist for this extension
Examine the dispatch path.
- If the call succeeded despite a Deny decision, this indicates a bypass in the dispatch pipeline
- Check dispatch_host_call_shared() -- the capability must be derived server-side via required_capability_for_host_call_static(), NOT from the caller's capability field

Verify invariants.

cargo test --test policy_profile_hardening -- --nocapture
cargo test --test capability_denial_matrix -- --nocapture

Expected Artifacts

Security alert with category policy_violation
Risk ledger entry showing the denied call
Policy explanation snapshot at time of incident

Remediation

If policy misconfiguration: fix config, verify with --explain-extension-policy
If bypass bug: file Critical bead, add regression test in capability_denial_matrix.rs
Reference: Invariant INV-001 (policy precedence chain)

INC-2: Secret Exposure

Symptoms: Secret broker alert; environment variable value exposed to an extension.

Investigation

Identify the exposure.
- Check alerts with category secret_exposure
- Determine which env var was exposed and to which extension
Check the secret broker configuration.
- Default blocklist: 26 exact names, 11 suffixes (*_API_KEY, *_SECRET, etc.), 2 prefixes
- Verify the exposed variable matches a blocklist pattern
- Check if the variable was in the disclosure_allowlist (intentional pass-through)
Verify the broker was enabled.
- SecretBrokerPolicy.enabled must be true
- Check secret_broker_enabled in the policy explanation
Examine the exposure path.
- Was it via env.read hostcall?
- Was it via exec (child process inheriting env)?
- Was it via http (sent as header/body)?

Expected Artifacts

Secret broker ledger entry with name_hash of the exposed variable
Alert with category secret_exposure
Incident evidence bundle with secret_broker section populated

Remediation

Immediate: Rotate the exposed credential
Short-term: Add the variable pattern to the blocklist if not already covered
Long-term: If exposure was via exec, verify exec mediation blocks env passthrough
Reference: Invariant INV-009 (secret broker coverage)

INC-3: Ledger Integrity Failure

Symptoms: Hash chain verification fails; verify_runtime_risk_ledger_artifact() reports errors.

Investigation

Run ledger verification. The verification function checks:
- Schema version matches expected
- Each entry's ledger_hash matches recomputed hash
- Chain linkage: each prev_ledger_hash matches the prior entry's hash
- No missing or duplicate entries
Identify the break point.
- Check verification_errors in the report
- The first broken link indicates where tampering or corruption occurred
Distinguish corruption from tampering.
- Corruption: Typically a single entry with bad hash (memory issue, serialization bug)
- Tampering: Modified/removed entries break the chain at the point of alteration
Forensic replay.
- Replay the ledger up to the break point to reconstruct valid history
- Entries after the break are untrusted and must be re-evaluated

Expected Artifacts

RuntimeRiskLedgerVerificationReport with chain_valid: false
Specific verification_errors listing broken entries
Evidence bundle with risk_ledger section

Remediation

If corruption: investigate OOM or serialization bugs, file bead
If tampering: this is Critical severity -- full security audit required
Reference: Invariant INV-010 (ledger integrity)

INC-4: Scanner Bypass

Symptoms: Malicious or dangerous code was loaded despite compatibility scanning.

Investigation

Examine the extension source.
- Check for forbidden imports that the scanner should have caught: process.binding, eval, Function(), require('child_process'), etc.
Review scan results.
- The CompatibilityScanner classifies entries as forbidden, flagged, or safe
- Check if the dangerous pattern was in a form the scanner doesn't recognize

Test scanner detection.

cargo test --test install_time_security_scanner -- --nocapture

Check for evasion techniques.
- Dynamic requires: require(variable) vs require('child_process')
- String concatenation: eval('rm' + ' -rf')
- Encoded payloads: base64 or unicode escape sequences
- Prototype pollution paths

Expected Artifacts

Scanner results for the extension (compat ledger)
The extension source code
Evidence of what the extension actually executed

Remediation

Add the evasion pattern to the scanner's forbidden/flagged patterns
Add a regression test in install_time_security_scanner.rs
Reference: SLO-02 (scanner detection >= 95%)

INC-5: Quarantine Escape

Symptoms: An extension marked as quarantined resumed execution without explicit trust promotion.

Investigation

Check trust state transitions.
- ExtensionTrustTracker manages the state machine: Unknown -> Probation -> Trusted / Quarantined
- Verify the state transition log shows Quarantined -> Active without a valid promotion
Check the kill-switch state.
- If a kill-switch was active, verify it was not prematurely lifted
- Kill-switch lifted events should include audit metadata
Verify is_hostcall_allowed_for_trust() enforcement.
- This function gates hostcalls based on trust state
- Quarantined extensions should be blocked from all hostcalls

Expected Artifacts

Trust state transition history
Kill-switch activation/deactivation events
Security alerts with the quarantined extension's ID

Remediation

If state machine bug: fix the transition logic, add regression test
If promotion was unauthorized: investigate who/what triggered it
Reference: Threat T4 (runtime abuse escalation)

INC-6: Resource Quota Abuse

Symptoms: Extension exceeds resource quotas (memory, API calls, time).

Investigation

Check quota breach events.
- drain_quota_breach_events() returns all recorded breaches
- Each breach includes: extension_id, resource type, limit, actual, timestamp
Verify quota configuration.
- Default: max_memory_mb: 256
- Per-extension overrides via ExtensionOverride.quota
Assess impact.
- Did the breach cause OOM? Check system logs
- Did it degrade other extensions? Check isolation

Expected Artifacts

Quota breach events in incident evidence bundle
Memory/resource usage telemetry
Alert with category quota_breach

Remediation

Tighten quota for the offending extension
If legitimate high usage: increase quota with documentation
Reference: SLO-14 (runtime overhead <= 3%)

Evidence Collection

Export an Incident Evidence Bundle

The incident evidence bundle aggregates all security artifacts into a single, hash-verified package.

Components included:

Runtime risk ledger (hash-chained entries)
Security alerts (filtered by time/extension/category/severity)
Hostcall telemetry (call timing and context)
Exec mediation log (command allow/deny decisions)
Secret broker log (redaction events)
Quota breach events
Risk replay (reconstructed decision timeline)
Summary statistics

Bundle integrity:

bundle_hash: SHA-256 over all content sections
schema: pi.security.incident_evidence_bundle.v1
Verify with verify_incident_evidence_bundle()

Filtering

Bundles support scoping via IncidentBundleFilter:

start_ms / end_ms: Time range
extension_id: Specific extension
alert_categories: Specific alert types
min_severity: Minimum severity threshold

Redaction

Bundles support redaction via IncidentBundleRedactionPolicy:

redact_params_hash: Redact parameter fingerprints
redact_context_hash: Redact context hashes
redact_args_shape_hash: Redact argument shape hashes
redact_command_hash: Redact command hashes
redact_name_hash: Redact name hashes
redact_remediation: Redact remediation text

Use full redaction for external sharing; use no redaction for internal forensics.

Verification Steps

After collecting evidence:

Verify bundle hash matches recomputed hash
Verify ledger chain integrity (no broken links)
Verify summary counts match actual sub-artifact counts
Verify schema version matches expected
If redaction was applied, verify no raw values leak

Rollback Procedures

Rolling Back a Policy Change

// Before: settings.json (problematic)
{
  "extensionPolicy": {
    "profile": "permissive",
    "allowDangerous": true
  }
}

// After: rollback to safe defaults
{
  "extensionPolicy": {
    "profile": "safe",
    "allowDangerous": false
  }
}

Verification:

pi --explain-extension-policy  # Confirm safe profile active
cargo test --test security_conformance_benign -- --nocapture  # Confirm compatibility

Rolling Back Risk Controller Settings

// Revert to defaults
{
  "extensionRisk": {
    "enabled": false,
    "enforce": true,
    "alpha": 0.01,
    "windowSize": 128,
    "failClosed": true
  }
}

Or disable entirely: set enabled: false. The controller stops scoring but existing ledger entries are preserved.

Emergency: Kill All Extensions

pi --no-extensions  # Disable all extension discovery and loading

This is the nuclear option. Use only when containment requires complete extension isolation.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Security Incident Runbook

Table of Contents

General Incident Workflow

INC-1: Unauthorized Capability Use

Investigation

Expected Artifacts

Remediation

INC-2: Secret Exposure

Investigation

Expected Artifacts

Remediation

INC-3: Ledger Integrity Failure

Investigation

Expected Artifacts

Remediation

INC-4: Scanner Bypass

Investigation

Expected Artifacts

Remediation

INC-5: Quarantine Escape

Investigation

Expected Artifacts

Remediation

INC-6: Resource Quota Abuse

Investigation

Expected Artifacts

Remediation

Evidence Collection

Export an Incident Evidence Bundle

Filtering

Redaction

Verification Steps

Rollback Procedures

Rolling Back a Policy Change

Rolling Back Risk Controller Settings

Emergency: Kill All Extensions

FilesExpand file tree

incident-runbook.md

Latest commit

History

incident-runbook.md

File metadata and controls

Security Incident Runbook

Table of Contents

General Incident Workflow

INC-1: Unauthorized Capability Use

Investigation

Expected Artifacts

Remediation

INC-2: Secret Exposure

Investigation

Expected Artifacts

Remediation

INC-3: Ledger Integrity Failure

Investigation

Expected Artifacts

Remediation

INC-4: Scanner Bypass

Investigation

Expected Artifacts

Remediation

INC-5: Quarantine Escape

Investigation

Expected Artifacts

Remediation

INC-6: Resource Quota Abuse

Investigation

Expected Artifacts

Remediation

Evidence Collection

Export an Incident Evidence Bundle

Filtering

Redaction

Verification Steps

Rollback Procedures

Rolling Back a Policy Change

Rolling Back Risk Controller Settings

Emergency: Kill All Extensions