Skip to content

feat: add logfire-backed intent discovery loop#82

Merged
rita-aga merged 29 commits intomainfrom
feat/intent-discovery-logfire-evidence
Mar 24, 2026
Merged

feat: add logfire-backed intent discovery loop#82
rita-aga merged 29 commits intomainfrom
feat/intent-discovery-logfire-evidence

Conversation

@nerdsane
Copy link
Owner

Summary

  • add the IntentDiscovery OS app, ADR, and proof report for the golden-soaring-cerf plan
  • route evolution analysis through a real TemperAgent with Anthropic + Logfire-backed evidence tooling
  • persist and materialize intent-shaped evolution records/issues, plus the supporting authz and observe fixes
  • fix CI/readability blockers discovered while preparing the branch for merge

Verification

  • cargo fmt --all --check
  • cargo check --workspace
  • cargo clippy --workspace --all-targets -- -D warnings
  • cargo bench --workspace --no-run
  • cargo test --workspace
  • cargo run -p temper-cli -- verify --specs-dir os-apps/intent-discovery/specs

rita-aga and others added 29 commits March 18, 2026 22:28
Implement the GEPA (Guided Evolution of Pareto-optimal Artifacts)
infrastructure for Temper's self-improvement loop per ADR-0034.

Phase 0: ADR-0034 documenting all architectural decisions
Phase 1: temper-ots crate — OTS type system with DST adaptations (65 tests)
Phase 2: MCP trace capture — TrajectoryBuilder in runtime.rs + protocol.rs
Phase 3a: GEPA algorithm primitives in temper-evolution (27 tests)
Phase 3b: host_evaluate_spec WASM host function (generic platform capability)
Phase 3c: 4 GEPA WASM modules (replay, score, pareto, reflective)
Phase 4: Evolution skill — EvolutionRun + SentinelMonitor IOA specs + Cedar
Phase 5: Sentinel OTS failure cluster rule (threshold: 5 failures/entity type)
Phase 6a: Apps → Skills rebrand across codebase with backward-compat aliases
Phase 6b: Skill guide format — skill_guide field, GET /api/skills/:name endpoint,
          temper.get_skill() MCP method, evolution skill registered in catalog

All specs pass L0-L3 verification cascade. 506+ tests pass.

Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>
Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>
Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>
Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>
1. Pareto dominates() now considers all objectives from both sides,
   not just a's keys — fixes asymmetric key handling
2. ReplayResult tracks invalid_transitions counter separately,
   fixing coverage score inflation
3. host_evaluate_spec returns -1 on memory read/write errors
   instead of silently proceeding with zero-filled buffers
4. SimWasmHost::evaluate_spec returns plain error string, not
   pre-formatted JSON that would get double-wrapped

Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>
Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>
…yntax, live E2E proof

Fixes three production bugs blocking the autonomous GEPA self-improvement loop:
- spec_evaluator_fn(): correct TransitionTable::evaluate API (state, count, action)
- WASM CTX_BUF_LEN: increase from 256KB to 512KB for multi-turn entity state
- IOA effect syntax: fix SentinelMonitor to use supported formats (set_bool, increment)

Adds entity state bloat prevention (32KB per-field cap in sync_fields), OTS trajectory
storage endpoints, and EvolutionRun Cedar policies with autonomy slider.

Full 11-step lifecycle verified on live server: Created → Selecting → Evaluating →
Reflecting → Proposing → Verifying → Scoring → Updating → AwaitingApproval →
Deploying → Completed.

Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>
Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>
Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>
Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>
Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>
Same fix as specs/policies/issue.cedar — the catch-all permit overrode
role-based Cedar policies, causing test_pm_assign_denies_openclaw_agent_type
to fail.

Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>
@rita-aga rita-aga merged commit de2dc4d into main Mar 24, 2026
5 checks passed
@rita-aga rita-aga deleted the feat/intent-discovery-logfire-evidence branch March 24, 2026 03:07
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

2 participants