Cortex

Status: Draft v0.1 · specs in docs/specs/ · architecture in docs/architecture.md

Cortex is the cognitive substrate of the HiveLLM ecosystem — a single, queryable, governed memory of every meaningful interaction an AI agent has with our codebases.

Today, when an LLM (Claude, Cursor, Gemini, Codex, Copilot) works on one of our projects, three things are lost the moment the session ends:

What happened — the conversation, tool calls, agent invocations, rationale.
What was decided — and why one path was chosen over another.
What was learned — patterns that worked, dead ends, recurring bugs, conventions that emerged.

Each new session starts blind. The model falls back to its weights and a few CLAUDE.md files. We've been losing institutional knowledge at every /clear.

Cortex changes the contract. Every interaction is captured, classified, embedded, related, and made retrievable. Before any model proposes a change, it consults Cortex. Decisions are formalized and audited. Laws govern behavior, violations are tracked. A dashboard makes the whole substrate observable.

The end state: an AI workflow that is analytical rather than purely generative — grounded in our own history, not just the model's training data.

What Cortex is (and isn't)

Is: an orchestrator that composes existing HiveLLM services (Vectorizer, Nexus, Synap, Rulebook) plus external pieces (Meilisearch, Claude Haiku via Claude Code CLI) into a capture → classify → retrieve → govern loop.

Isn't: a new vector DB, graph DB, search engine, or coding agent. Cortex informs agents; it does not generate code.

Architecture at a glance

┌──────────────────────────────────────────────────────────────────┐
│  AI Tools / IDEs                                                 │
│  Claude Code · Cursor · Gemini · Codex · Copilot · Windsurf      │
└──────────────────────────────────────────────────────────────────┘
                       ▲                       ▲
                       │ capture (hooks)       │ retrieve (MCP / HTTP)
                       ▼                       │
┌──────────────────────────────────────────────────────────────────┐
│                          C O R T E X                             │
│  Ingestion → Processing → Retrieval + Governance + Dashboard     │
└──────────────────────────────────────────────────────────────────┘
                              ▼
┌──────────────────────────────────────────────────────────────────┐
│  HiveLLM Data Services                                           │
│  Vectorizer · Nexus · Synap · Meilisearch · Haiku (CLI)          │
└──────────────────────────────────────────────────────────────────┘

Full architecture doc: docs/architecture.md.

Key capabilities

Layer	What it does	Spec
Capture	Hooks in Claude Code / Cursor / Codex / Gemini publish events (prompts, tool calls, decisions).	10, 17
Classify	Claude Haiku (via CLI or SDK) enriches events with topics, severity, PII risk.	05
Embed	Symbol-level code chunks (Tree-sitter) + doc sections go to Vectorizer (BM25 + dense).	06
Link	Nexus stores the graph of sessions, turns, tool calls, artifacts, decisions, laws, violations.	07
Full-text	Meilisearch indexes every enriched event with typo-tolerance and faceted filters.	08
Retrieve	Hybrid query API (vector + keyword + graph), fused with Reciprocal Rank Fusion. P50 < 50 ms.	11
Pre-thinking	Adapter injects a compact context bundle (decisions, laws, similar turns, snippets) into every prompt.	12
Govern	Laws with sandboxed detectors (Deno); blocking + observational; punishment ladder; trust scores.	13, 14
Analyze	Deep Analysis: structured multi-agent debate → audited Decision.	15
Observe	React + TS dashboard: live timeline, memory, decisions, laws, analyses, tool analytics, graph.	16
Bootstrap	One-shot + incremental CLI that backfills the ~17 existing HiveLLM repos before live capture.	09

Repository layout

Cortex/
├─ docs/
│  ├─ architecture.md          # the big picture (read first)
│  └─ specs/                   # 17 discrete, implementable unit specs
│     ├─ 00-index.md
│     └─ NN-*.md
├─ AGENTS.md                   # team-shared rules (generated by @hivellm/rulebook)
├─ AGENTS.override.md          # project-specific overrides (stable)
├─ CLAUDE.md                   # imports AGENTS.md + critical rules
├─ .claude/                    # agents, skills, hooks, rules, settings
├─ .rulebook/                  # specs, decisions, knowledge, tasks, memory
└─ README.md                   # you are here

Code (cortex-core, cortex-workers, cortex-api, cortex-classifier, cortex-laws, cortex-adapters, cortex-dashboard, cortex-cli) arrives as each spec is implemented — see Roadmap.

Getting started

Prerequisites (local dev)

Rust (stable toolchain) — for the core services.
Node.js 20+ — for the dashboard and detector sandbox tooling.
Docker + Compose — for the local data-services stack (Vectorizer, Nexus, Synap, Meilisearch). See docs/specs/03-local-stack.md.
Claude Code CLI — classifier default backend (SDK path is an optional alternative).

Read-first

docs/architecture.md — vision, goals, ecosystem context.
docs/specs/00-index.md — spec index + dependency order.
AGENTS.md — rules for contributors and AI agents working in this repo.

Because no spec has flipped to 🟢 implemented yet, there is no runnable binary in master at the time of writing. The work unfolds in spec order (see below).

First-time indexing (per repo)

The daemon answers /v1/query, cortex_pre_thinking, and the dashboard against repos that have been bootstrapped at least once. A brand-new repo returns empty hits with notice = { code: "repo_not_indexed", … } and cortex_status lists the indexed slugs under daemon.indexed_repos, so callers can detect the gap before issuing the next query.

To bootstrap a repo:

# canonical entry point: spec-09 bootstrap CLI
cargo run -p cortex-bootstrap -- --repo /path/to/repo

Subsequent calls scoped to that repo (scope.repo = "<slug>") drop the notice and start returning real results. The slug is the value cortex-storage's slug_for_repo produces — lowercase ASCII; the same form daemon.indexed_repos reports.

Roadmap

Five phases, mapped to the spec list:

Phase 0 — Foundations (2 weeks) Event schema, storage layout, docker-compose local stack, cortex-core skeleton, single-repo bootstrap proof of concept (Vectorizer). Specs: 01, 02, 03, 04.
Phase 1 — Bootstrap + Claude Code adapter + basic retrieval (4 weeks) Haiku classifier, embedder, graph writer, full-text indexer, bootstrap CLI for all 17 repos, Claude Code hooks, hybrid query API, pre-thinking injection. Specs: 05, 06, 07, 08, 09, 10, 11, 12.
Phase 2 — Governance (4 weeks) Laws DSL + detector sandbox, enforcement engine, trust score, dashboard v1. Specs: 13, 14, 16.
Phase 3 — Deep Analysis + multi-adapter (open) Analysis workflow, Cursor/Codex/Gemini adapters, retrieval-quality evaluation. Specs: 15, 17.
Phase 4+ — Hardening + HivehubCloud integration (open) Multi-tenant, distributed deployment, trust-score feed into cloud routing.

Exact timelines are aspirational; each spec has its own acceptance criteria and dependency list.

Contributing

This project is governed by @hivellm/rulebook. Before making changes:

Read AGENTS.md and AGENTS.override.md. These contain project-specific conventions that override generic guidance.
Use the Rulebook MCP tools (mcp__rulebook__*) for task management — never mkdir tasks manually.
Pick the first unchecked item from the lowest-numbered phase/spec. No cherry-picking.
Run diagnostics (type-check, lint) before tests. Zero warnings; tests must pass 100%; coverage ≥95%.
Commits use conventional commits (feat, fix, docs, refactor, perf, test, build, ci, chore) and stay in English.
Capture patterns, anti-patterns, and decisions through rulebook_knowledge_add, rulebook_learn_capture, rulebook_decision_create at the end of each task.

References

Vectorizer — https://github.com/hivellm/vectorizer — vector DB, MCP, embeddings.
Nexus — https://github.com/hivellm/nexus — graph DB, Cypher, KNN.
Synap — https://github.com/hivellm/synap — KV, streams, pub/sub.
Rulebook — https://github.com/hivellm/rulebook — rules, memory, MCP for task management.
Meilisearch — https://www.meilisearch.com — full-text search.
Claude Haiku — https://www.anthropic.com/claude/haiku — classification backend via Claude Code CLI.

License

Licensed under the Apache License, Version 2.0.

Name		Name	Last commit message	Last commit date
Latest commit History 336 Commits
.claude		.claude
.github		.github
.rulebook		.rulebook
bin		bin
configs/repos		configs/repos
cortex-plugin		cortex-plugin
crates		crates
docs		docs
gui		gui
scripts		scripts
tests/relevance		tests/relevance
.env.example		.env.example
.gitignore		.gitignore
.mcp.json		.mcp.json
AGENTS.md		AGENTS.md
AGENTS.override.md		AGENTS.override.md
CHANGELOG.md		CHANGELOG.md
CLAUDE.md		CLAUDE.md
Cargo.lock		Cargo.lock
Cargo.toml		Cargo.toml
Dockerfile		Dockerfile
LICENSE		LICENSE
Makefile		Makefile
README.md		README.md
bootstrap.workspace.toml		bootstrap.workspace.toml
bootstrap.workspace.toml.example		bootstrap.workspace.toml.example
cortex.toml		cortex.toml
docker-compose.yml		docker-compose.yml

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Cortex

What Cortex is (and isn't)

Architecture at a glance

Key capabilities

Repository layout

Getting started

Prerequisites (local dev)

Read-first

First-time indexing (per repo)

Roadmap

Contributing

References

License

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

Cortex

What Cortex is (and isn't)

Architecture at a glance

Key capabilities

Repository layout

Getting started

Prerequisites (local dev)

Read-first

First-time indexing (per repo)

Roadmap

Contributing

References

License

About

Resources

License

Code of conduct

Contributing

Security policy

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages