CivicProof

Agentic investigative control plane for federal spending transparency

Turn a vendor name, UEI, CAGE code, award ID, or insider tip into an evidence-grounded, citation-rich, complaint-ready case pack — assembled entirely from public federal data.

Quick Start • Architecture • API Reference • Contributing

What is CivicProof?

CivicProof is a governance-first, multi-agent investigative system that automates the most time-consuming part of federal fraud investigation: evidence assembly.

Investigators, journalists, and attorneys currently spend weeks manually pulling documents from USAspending, SEC EDGAR, DOJ press releases, and IG reports — then stitching them together by hand. CivicProof compresses that into minutes, while enforcing a hard rule that makes it defensible in legal and editorial contexts:

Every factual claim in a case pack must cite a stored artifact with a verifiable content hash — or it is blocked.

No hallucinations reach the output. The system is built to withstand scrutiny.

Key Properties

Property	Description
Evidence-grounded by construction	Every factual claim requires a citation pointing to a stored, hashed artifact. The Auditor Gate blocks anything unsupported.
Governance-first architecture	Policy controls, tool permissioning, PII redaction, and strict separation between "risk signals" and "accusations".
Multi-model routing	Vertex AI Gemini as primary (GCP credits), OpenRouter BYOK as fallback, local vLLM for dev. Provider-aware budgets and rate limits.
Reproducible audit trails	Same seed + same artifact snapshot always produces the same case pack hash. Every decision is logged.
Near-zero idle cost	Serverless GCP (Cloud Run, min instances = 0). Costs ~$0 when not processing cases.
Production-grade observability	OpenTelemetry traces, structured JSON logs queryable by `case_id`, Cloud Monitoring dashboards.

Architecture

CivicProof is designed as four independent planes. Each plane scales and fails independently.

flowchart LR
    subgraph SRC["Data Sources"]
        U1["USAspending.gov"]
        U2["SAM.gov"]
        U3["SEC EDGAR"]
        U4["DOJ API"]
        U5["Oversight.gov"]
        U6["OpenFEC"]
    end
    subgraph ING["Ingestion Plane"]
        SCH["Scheduler + Source Quotas"]
        WRK["Ingest Workers"]
        Q["Event Stream"]
    end
    subgraph EV["Evidence Plane"]
        OBJ["Artifact Store\nimmutable + versioned"]
        DB["Case Ledger\nPostgres"]
        IDX["Search Index"]
        GR["Evidence Graph"]
        KV["Cache + Rate Counters"]
    end
    subgraph AG["Agentic Control Plane"]
        ORCH["Multi-Agent Orchestrator"]
        GW["LLM Gateway\nVertex AI + OpenRouter"]
        AUD["Auditor Gate\ndeterministic"]
        EVAL["Eval + Red-Team Harness"]
    end
    subgraph SRV["Serving Plane"]
        API["REST API"]
        UI["Case Pack Viewer"]
        DASH["KPI Dashboards"]
    end
    SRC -->|"rate-limited pulls"| ING
    ING --> EV
    EV --> AG
    AG --> SRV

View interactive diagram in FigJam →

Multi-Agent Pipeline

A case is built by six agents in sequence. Each step is idempotent, logged, and budget-controlled. The final gate is deterministic — no LLM involved.

flowchart TB
    IN["Input: vendor / UEI / award ID / tip text"]

    subgraph PIPELINE["Agent Pipeline"]
        A["Entity Resolver\nCanonicalize vendor identities\nDetect aliases and subsidiaries"]
        B["Evidence Retrieval\nFetch artifacts from all sources\nRespect upstream rate limits"]
        C["Graph Builder\nBuild evidence graph\nEntities, awards, officers, addresses"]
        D["Anomaly Detector\nSole-source patterns\nModification inflation\nShared address rings"]
        E["Case Composer\nDraft structured dossier\nEvery factual claim requires citations"]
    end

    F{"Auditor Gate\nDeterministic rule engine\nNo LLM calls"}
    G["APPROVED\nCase Pack with citations,\ntimeline, risk signals"]
    H["BLOCKED\nViolation list\nMissing evidence gaps"]

    IN --> A --> B --> C --> D --> E --> F
    F -->|"all claims cited"| G
    F -->|"uncited claim detected"| H

View interactive diagram in FigJam →

Auditor Gate

The Auditor Gate is the central governance mechanism. It is a pure deterministic function — no model calls, no network calls, no exceptions. Every case pack passes through it before reaching output.

flowchart TB
    DRAFT["Case Draft from Composer"]

    R1{"Citation Required\nevery factual claim has a citation"}
    R2{"Citation Valid\ncitation points to existing artifact"}
    R3{"Hash Match\nartifact hash matches stored value"}
    R4{"No Accusations\nno banned phrases in claim text"}
    R5{"Source Diversity\nat least 2 independent sources cited"}
    R6{"PII Clean\nno SSN or personal phone in output"}

    PASS["APPROVED"]
    BLOCK["BLOCKED"]

    DRAFT --> R1
    R1 -->|"pass"| R2
    R1 -->|"fail"| BLOCK
    R2 -->|"pass"| R3
    R2 -->|"fail"| BLOCK
    R3 -->|"pass"| R4
    R3 -->|"fail"| BLOCK
    R4 -->|"pass"| R5
    R4 -->|"fail"| BLOCK
    R5 -->|"pass"| R6
    R5 -->|"fail"| BLOCK
    R6 -->|"pass"| PASS
    R6 -->|"fail"| BLOCK

View interactive diagram in FigJam →

Data Sources

All six sources are public and free. Rate limits are encoded in the system — not documented separately.

Source	Data	Rate Limit	Key Required
USAspending.gov	Federal awards, contracts, grants, subawards	5 RPS (courtesy)	No
SAM.gov	Contract opportunities, vendor registrations	4 RPS	Yes
SEC EDGAR	Corporate filings, 10-K, 10-Q, 8-K, DEF 14A	10 RPS strict	No
DOJ Press Releases	Enforcement actions, FCA settlements	4 RPS	No
Oversight.gov	Inspector General reports, recommendations	2 RPS (courtesy)	No
OpenFEC	Political contributions, committee filings	1,000 calls/hr	Yes

Request Flow

sequenceDiagram
    participant U as User
    participant API as CivicProof API
    participant OR as Orchestrator
    participant SRC as Source Connectors
    participant OBJ as Object Store
    participant DB as Case Ledger
    participant LLM as LLM Gateway
    participant AU as Auditor Gate

    U->>API: POST /v1/cases (seed: vendor / award / tip)
    API->>OR: enqueue case_build job
    OR->>SRC: fetch relevant artifacts (rate-limited)
    SRC->>OBJ: store raw artifacts with content hashes
    SRC->>DB: store artifact metadata
    OR->>LLM: propose hypotheses and extraction plans
    LLM->>DB: persist extracted entities and relations
    OR->>AU: validate all claims require citations
    AU->>DB: write policy decisions and audit log
    AU-->>API: case pack approved or blocked
    API-->>U: GET /v1/cases/{id}/pack

View interactive diagram in FigJam →

Data Model

flowchart TB
    DS["DATA_SOURCE"]
    IR["INGEST_RUN"]
    RA["RAW_ARTIFACT\ncontent_hash · storage_path · fetched_at"]
    PD["PARSED_DOC\nextracted_fields · extraction_version"]
    EM["ENTITY_MENTION\nentity_type · raw_text · confidence"]
    EN["ENTITY\ncanonical_name · uei · cage_code · cik"]
    RL["RELATIONSHIP\ntype · provenance_artifact_id"]
    CS["CASE\nseed_input · status"]
    CL["CLAIM\nclaim_type · claim_text"]
    CI["CITATION\nartifact_id · content_hash"]
    AE["AUDIT_EVENT\nstage · outcome"]

    DS -->|"triggers"| IR
    IR -->|"fetches"| RA
    RA -->|"produces"| PD
    PD -->|"contains"| EM
    EN -->|"referenced by"| EM
    EN -->|"node_a"| RL
    EN -->|"node_b"| RL
    CS -->|"contains"| CL
    CL -->|"supported by"| CI
    RA -->|"cited by"| CI
    CS -->|"logs"| AE

View interactive diagram in FigJam →

Infrastructure

flowchart TB
    subgraph CTL["Control"]
        SCH["Cloud Scheduler"]
        Q1["Pub/Sub Topics"]
        Q2["Cloud Tasks"]
    end
    subgraph COMP["Compute — Cloud Run"]
        ING["Ingestion Service"]
        WRK["Worker Service"]
        API["API Service"]
        GW["Gateway Service"]
    end
    subgraph STG["Storage"]
        GCS["Cloud Storage\nArtifact Lake · versioned · locked"]
        SQL["Cloud SQL Postgres\nCase Ledger · Audit Trail"]
        RED["Redis\nCache · Rate Limiters"]
        IDX["Search Index\nPostgres FTS → OpenSearch"]
    end
    subgraph AI["AI Layer"]
        VX["Vertex AI Gemini\nPrimary · GCP Credits"]
        OR["OpenRouter BYOK\nClaude · GPT-4o · Fallback"]
    end
    subgraph OBS["Observability"]
        LOG["Cloud Logging\nStructured JSON"]
        TRC["Cloud Trace\nOpenTelemetry"]
        MON["Cloud Monitoring\nKPI Dashboards"]
    end
    SCH -->|"cron triggers"| ING
    ING -->|"store artifacts"| GCS
    ING -->|"emit events"| Q1
    Q1 --> WRK
    WRK --> SQL
    WRK --> IDX
    WRK --> GW
    WRK -->|"enqueue retries"| Q2
    GW --> VX
    GW --> OR
    API --> SQL
    API --> IDX
    API --> RED
    WRK --> LOG
    API --> TRC
    WRK --> MON

View interactive diagram in FigJam →

LLM Gateway

The gateway treats model providers as unreliable dependencies with quotas and outages — not as magical oracles.

Task	Primary	Fallback	Max Cost/Call
Extraction	`gemini-2.0-flash`	`claude-3.5-haiku`	$0.005
Analysis	`gemini-2.0-pro`	`claude-sonnet-4`	$0.020
Composition	`gemini-2.0-pro`	`gpt-4o`	$0.050
Embeddings	`text-embedding-005`	`sentence-transformers`	$0.001

Budget controls: $0.50 per case, $5.00 per day — enforced via Redis token counters. Caching: Response cache keyed by SHA-256(model + prompt + schema). TTL: 1-24h by task type. Structured outputs: All factual outputs use JSON schema enforcement. Refusals are detectable programmatically.

Quick Start

Prerequisites

Python 3.11+
Docker + Docker Compose
make

1. Clone and configure

git clone https://github.com/d3v07/civicproof.git
cd civicproof
cp .env.example .env
# Edit .env — add your SAM.gov, OpenFEC, and OpenRouter keys

2. Start local dev stack

make dev-up
# Starts: Postgres, MinIO, Redis, Redpanda, Jaeger

3. Run migrations and seed data sources

make migrate
make seed-sources

4. Run your first case

# Trigger a case from a vendor name
curl -X POST http://localhost:8000/v1/cases \
  -H "Content-Type: application/json" \
  -d '{"seed": "Booz Allen Hamilton", "seed_type": "vendor_name"}'

# Poll for status
curl http://localhost:8000/v1/cases/{case_id}

# Download approved case pack
curl http://localhost:8000/v1/cases/{case_id}/pack

5. Run tests

make test           # unit + contract tests
make test-coverage  # with coverage report (gate: 80%)
make eval           # full eval harness (grounding + hallucination + retrieval)

API Reference

Method	Endpoint	Description
`POST`	`/v1/cases`	Create a case from a seed (vendor name, UEI, CAGE, award ID, or tip text)
`GET`	`/v1/cases/{id}`	Get case status and summary
`GET`	`/v1/cases/{id}/pack`	Download audited case pack as JSON. Returns 404 if blocked by Auditor.
`GET`	`/v1/search/entities`	Full-text entity search with autocomplete (`?q=acme&type=vendor`)
`GET`	`/v1/search/artifacts`	Full-text evidence search (`?q=false+claims+act&source=doj`)
`POST`	`/v1/ingest/runs`	Trigger a controlled backfill run for a specific source
`GET`	`/v1/metrics/public`	Live system KPIs — dossier pass rate, cost per case, hallucination caught rate

Full OpenAPI spec available at /docs when the API service is running.

Tech Stack

Layer	Local Dev	GCP Production
Compute	`uvicorn` direct	Cloud Run (min instances = 0)
Events	Redpanda	Cloud Pub/Sub + Cloud Tasks
Database	Postgres 16	Cloud SQL Postgres
Object store	MinIO	Cloud Storage + Object Lock
Search	Postgres FTS	OpenSearch (scale path)
Cache	Redis 7	Memorystore / Upstash
LLM	vLLM local	Vertex AI Gemini + OpenRouter
Tracing	Jaeger	Cloud Trace (OpenTelemetry)
Metrics	Prometheus	Cloud Monitoring

Testing

The test suite is layered to catch failures at the right level:

tests/
  unit/         parsers, normalizers, hashing, policy rules, rate limiter
  contract/     upstream API response shapes, Pub/Sub event schemas
  integration/  full tip-to-dossier pipeline on local emulators
  e2e/          end-to-end on deployed GCP environment
  red_team/     adversarial prompt injection, budget cap enforcement

Coverage gate: 80% minimum across all packages.

Eval harness release gates (must pass before any deploy):

Metric	Threshold
Audited dossier pass rate	≥ 95%
Hallucination block rate	≥ 95%
Retrieval recall@10	≥ 80%
Replay determinism	100%
Cost per case	≤ $1.00

Repository Structure

civicproof/
├── services/
│   ├── api/          REST API — public + internal endpoints (FastAPI)
│   ├── worker/       Async pipeline worker — Pub/Sub consumer + 6 agents
│   └── gateway/      LLM gateway — routing, caching, budget, content filter
├── packages/
│   ├── common/       Shared schemas, event contracts, hashing, rate limiter
│   └── eval/         Eval harness, synthetic fraud generators, red-team suite
├── infra/
│   └── terraform/    GCP infrastructure as code
├── tests/            Unit, contract, integration, e2e, red_team
├── .github/
│   └── workflows/    CI (lint + test + eval gate + deploy)
└── docker-compose.dev.yml

Security

No secrets in code — .env locally, GCP Secret Manager in production
Prompt injection defenses — content filter pre-screens all inputs before LLM calls
Rate limit compliance — upstream limits encoded per-source in the system
PII redaction — aligned with USAspending exclusion policies
Audit trails — NIST SP 800-92 aligned log management
Least-privilege IAM — dedicated service account per Cloud Run service
Output governance — system outputs "risk signals" and "hypotheses" only, never accusations

Contributing

# Branch naming
feat/S2-usaspending-connector
fix/S3-doj-parser-pagination
test/S6-hallucination-eval-suite

# Commit format
feat(worker): add USAspending V2 award connector with idempotent pagination
fix(gateway): enforce 10 RPS rate limit for SEC EDGAR
test(eval): add shell vendor ring synthetic fraud dataset

All PRs require: lint pass, test pass (80% coverage), contract tests pass, 1 review approval.

License

MIT © d3v07

Name		Name	Last commit message	Last commit date
Latest commit History 66 Commits
.github/workflows		.github/workflows
frontend		frontend
infra/terraform		infra/terraform
packages		packages
scripts		scripts
services		services
tests		tests
.gcloudignore		.gcloudignore
.gitignore		.gitignore
CLAUDE.md		CLAUDE.md
Makefile		Makefile
README.md		README.md
cloudbuild-api.yaml		cloudbuild-api.yaml
cloudbuild-frontend.yaml		cloudbuild-frontend.yaml
cloudbuild-gateway.yaml		cloudbuild-gateway.yaml
cloudbuild-migrator.yaml		cloudbuild-migrator.yaml
cloudbuild-worker.yaml		cloudbuild-worker.yaml
conftest.py		conftest.py
docker-compose.dev.yml		docker-compose.dev.yml
problem_handoff.md		problem_handoff.md
pyproject.toml		pyproject.toml

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

CivicProof

What is CivicProof?

Key Properties

Architecture

Multi-Agent Pipeline

Auditor Gate

Data Sources

Request Flow

Data Model

Infrastructure

LLM Gateway

Quick Start

Prerequisites

1. Clone and configure

2. Start local dev stack

3. Run migrations and seed data sources

4. Run your first case

5. Run tests

API Reference

Tech Stack

Testing

Repository Structure

Security

Contributing

License

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

CivicProof

What is CivicProof?

Key Properties

Architecture

Multi-Agent Pipeline

Auditor Gate

Data Sources

Request Flow

Data Model

Infrastructure

LLM Gateway

Quick Start

Prerequisites

1. Clone and configure

2. Start local dev stack

3. Run migrations and seed data sources

4. Run your first case

5. Run tests

API Reference

Tech Stack

Testing

Repository Structure

Security

Contributing

License

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages