Rehydration Kernel

Node-centric context rehydration for agentic systems.

New here? Start with the Usage Guide — 3 steps to give your AI agent graph-aware context with sequence diagrams and examples.

What This Repo Is

rehydration-kernel is a generic context engine that turns knowledge graphs into LLM-ready text. It is built around four concepts:

Nodes — entities in the graph (incidents, decisions, tasks, artifacts). Each carries kind, status, summary, and optional provenance (who said it, when)
Relationships — the core signal. Each edge carries a semantic class (causal, motivational, evidential, constraint, procedural, structural) plus rationale (why it exists), method (how), decision_id, caused_by_node_id, and sequence (step order). This explanatory metadata is what lets the LLM reason about why things happened, not just what is connected to what
Extended node detail — rich per-node content (logs, configs, error traces) persisted in Valkey, loaded in batch (MGET). Separated from the graph to keep Neo4j lean and detail updates fast without graph writes
Salience ordering — relationships are ranked by explanatory weight (causal > motivational > evidential > constraint > procedural > structural). Under token pressure, the kernel preserves causal chains and drops structural noise

What the kernel is NOT:

Not an LLM — it does not generate text, only structures and renders context
Not a RAG system — it does not do similarity search; it traverses a typed graph
Not a vector database — relationships have semantic classes and rationale, not embeddings
Not tied to any model — works with GPT, Claude, Llama, Qwen, or any LLM

Why This Matters

Structured context for small models. A small model with causal chains and rationale metadata can perform bounded graph tasks that it fails without structure. The kernel provides the why, not just the what.

Auditable reasoning. The kernel knows what rationale exists in the graph (causal_density > 0). Consumers can cross-reference the LLM's response against this ground truth to detect when reasoning is fabricated rather than preserved. Without chain-of-thought, 97% of structural responses are fabricated with high confidence. With CoT enabled, 0% — the kernel's ground truth makes fabrication deterministically detectable without a second model (see Core Thesis).

graph LR
    A[Agent / LLM app] -- gRPC --> K[Rehydration Kernel]
    K -. rendered context .-> A

    K --> N4[(Neo4j)]
    K --> VK[(Valkey)]
    K --> NT[(NATS)]

    K -. metrics .-> G[Grafana]
    K -. logs .-> G

The kernel does not own product-specific nouns. Integrating products are expected to map their own domain language to this graph model at the edge. The kernel also assumes its own infrastructure dependencies are present: Neo4j, Valkey, and NATS are required runtime components, not optional features.

Current Status

v1beta1 — production-ready RPCs, known limitations documented in docs/beta-status.md.

What is in place:

Hexagonal domain/application/adapter/transport layers
gRPC + async (NATS) contracts with CI protection (buf breaking, AsyncAPI checks)
TLS/mTLS on all infrastructure boundaries
270 unit tests + 9 container-backed integration tests + LLM-as-judge E2E benchmark (primary empirical validation harness — methodology refinement ongoing)
Multi-resolution rendering (L0/L1/L2) with auto mode selection
Quality metrics with OTel + Loki observability
Helm chart with optional infrastructure sidecars

What is out of scope:

Product-specific domain nouns (the kernel is generic)
Product-side integration adapters, shadow mode, or rollout logic
Authorization backend (scope validation is set-comparison only)

Quickstart

# Toolchain: Rust 1.90.0 (pinned in rust-toolchain.toml)
cargo test --workspace               # 270 unit tests, no infra needed
bash scripts/ci/quality-gate.sh      # format + clippy + contract + tests

docker pull ghcr.io/underpass-ai/rehydration-kernel:latest

Full guides: usage | testing | container image | Helm deploy

Architecture

The kernel uses CQRS with Event Sourcing:

Command side: UpdateContext validates, appends events to an append-only store (NATS JetStream or Valkey), with optimistic concurrency (revision check) and idempotency key outcome recording
Projection: NATS JetStream durable consumers materialize events into the read model (Neo4j for graph, Valkey for detail). Explicit ack, at-least-once delivery
Query side: GetContext, GetContextPath, RehydrateSession read from the materialized projections and render token-budgeted text

graph LR
    subgraph Command
        UC[UpdateContext] --> ES[(Event Store<br/>NATS JetStream)]
    end

    subgraph Projection
        ES -. durable consumers .-> PR[Projection Runtime]
        PR --> N4[(Neo4j<br/>graph)]
        PR --> VK[(Valkey<br/>detail)]
    end

    subgraph Query
        GC[GetContext] --> N4
        GC --> VK
        GC -. rendered context .-> A[Agent]
    end

All connections TLS. gRPC and Valkey support mTLS. OTLP is plaintext (mTLS in progress).

DDD, hexagonal boundaries, one concept per file, one use case per file.

Infrastructure:

Neo4j — graph read model (nodes, relationships, traversal)
Valkey — node detail, snapshots, projection state (dedup + checkpoints)
NATS JetStream — event store (append-only, file-backed) + projection event bus
gRPC + TLS/mTLS — supports plaintext, server TLS, mutual TLS (default: plaintext)
cl100k_base — BPE tokenization (tiktoken-rs) for accurate token budgets
OpenTelemetry + Loki — 17 active instruments + structured JSON logs. See observability
Helm chart — optional Neo4j/NATS/Valkey/Loki/Grafana/OTel Collector sidecars

Multi-Resolution Rendering

Every render produces three tiers simultaneously. Consumers pick the level they need — no separate API calls, no re-rendering.

  L0 Summary          ~100 tokens    objective, status, blocker, next action
  L1 Causal Spine     ~500 tokens    root → focus → causal/motivational/evidential chain
  L2 Evidence Pack    remaining      structural relations, neighbors, extended details

Use case	Tier	Why
Status check / quick triage	L0	Fits in a system prompt alongside other tools
Failure diagnosis / handoff resume	L0 + L1	Causal chain is the dominant signal
Deep analysis / full audit	L0 + L1 + L2	Everything the graph knows, salience-ordered

RehydrationMode auto-selects strategy based on token pressure, endpoint type, focus path, and causal density:

ReasonPreserving (default) — all tiers populated, full signal
ResumeFocused — prunes distractor branches, keeps only the causal spine. Under 8x budget reduction (4096 → 512): -3pp task accuracy, +17pp recovery

Control via max_tier on the request or let the kernel decide with rehydration_mode = AUTO.

Security

All infrastructure boundaries support TLS. The gRPC transport supports mTLS.

Boundary	Transport	Authentication
Callers → Kernel	gRPC with server TLS or mTLS	Client certificate validation against trusted CA
Kernel → Neo4j	`bolt+s://` / `neo4j+s://` with CA pinning	URI-embedded credentials via K8s secrets
Kernel → Valkey	`rediss://` with mTLS	Client certificate + key from secrets
Kernel → NATS	TLS with CA pinning, `tls_first`	Client certificate or NATS credentials
Kernel → OTel Collector	gRPC with optional mTLS via env vars	`OTEL_EXPORTER_OTLP_CA_PATH`, `_CERT_PATH`, `_KEY_PATH`

Commands are protected by idempotency key outcome recording and optimistic concurrency (revision + content hash). Credentials are never inlined — always mounted from Kubernetes secrets.

Full threat model and Helm TLS configuration: security-model.md

Contracts

gRPC proto | AsyncAPI | examples
Integration contract — what consumers can depend on
Beta status — maturity, limitations, path to v1

Repo Layout

api/proto/          gRPC contracts (v1beta1)
api/asyncapi/       async contracts (NATS JetStream)
api/examples/       request, response, and event fixtures
crates/
  rehydration-domain/       domain model, value objects, invariants
  rehydration-application/  use cases, rendering pipeline
  rehydration-adapter-*/    Neo4j, Valkey, NATS adapters
  rehydration-transport-*/  gRPC server, proto mapping
  rehydration-observability/ OTel + Loki quality observers
  rehydration-server/       composition root
  rehydration-testkit/      dataset generator, evaluation harness
  rehydration-tests-*/      integration + benchmark tests
charts/             Helm chart (kernel + optional sidecars)
docs/               guides, operations, security, observability, testing
scripts/ci/         quality gates, integration runners, coverage

Benchmark

432 LLM-as-judge evaluations across two independent judges (GPT-5.4 and Claude Sonnet 4.6), three graph scales, four noise conditions, and three random seeds. Null hypothesis rejected at 95% confidence.

Context type	Task	Recovery	Reason	Gap vs structural
Explanatory (kernel)	72% [56%, 84%]	75% [59%, 86%]	72% [56%, 84%]	+69pp
Structural (edges only)	3% [0%, 14%]	0% [0%, 10%]	0% [0%, 10%]	baseline
Mixed (both)	92% [78%, 97%]	81% [65%, 90%]	89% [75%, 96%]	+89pp

Agent: Qwen3-8B with chain-of-thought (local). Judge: GPT-5.4. Wilson 95% CI in brackets. Cross-judge validated: Sonnet 4.6 produces the same gap (+67pp). Synthetic graphs, not production workloads. Full results, methodology, and statistical analysis: docs/research/

Research

The repository includes a paper draft on explanatory graph context rehydration: docs/research/

License

Apache-2.0. See LICENSE.

Name		Name	Last commit message	Last commit date
Latest commit History 373 Commits
.github		.github
api		api
artifacts		artifacts
charts/rehydration-kernel		charts/rehydration-kernel
crates		crates
docs		docs
k8s		k8s
scripts/ci		scripts/ci
.dockerignore		.dockerignore
.gitignore		.gitignore
CODE_OF_CONDUCT.md		CODE_OF_CONDUCT.md
CONTRIBUTING.md		CONTRIBUTING.md
Cargo.lock		Cargo.lock
Cargo.toml		Cargo.toml
Dockerfile		Dockerfile
LICENSE		LICENSE
README.md		README.md
SECURITY.md		SECURITY.md
THIRD_PARTY_NOTICES.md		THIRD_PARTY_NOTICES.md
baseline-planner-v1.yaml		baseline-planner-v1.yaml
baseline-v18-no-thinking.yaml		baseline-v18-no-thinking.yaml
paper-recalc-gpt54.yaml		paper-recalc-gpt54.yaml
pressure-test.yaml		pressure-test.yaml
reasoning-comparison.yaml		reasoning-comparison.yaml
rust-toolchain.toml		rust-toolchain.toml
smoke-run.yaml		smoke-run.yaml
sonar-project.properties		sonar-project.properties

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Rehydration Kernel

What This Repo Is

Why This Matters

Current Status

Quickstart

Architecture

Multi-Resolution Rendering

Security

Contracts

Repo Layout

Benchmark

Research

License

About

Uh oh!

Releases

Packages

Uh oh!

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

Rehydration Kernel

What This Repo Is

Why This Matters

Current Status

Quickstart

Architecture

Multi-Resolution Rendering

Security

Contracts

Repo Layout

Benchmark

Research

License

About

Topics

Resources

License

Code of conduct

Contributing

Security policy

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Uh oh!

Contributors

Uh oh!

Languages

Packages