feat: add logfire-backed intent discovery loop by nerdsane · Pull Request #82 · nerdsane/temper

nerdsane · 2026-03-24T01:16:40Z

Summary

add the IntentDiscovery OS app, ADR, and proof report for the golden-soaring-cerf plan
route evolution analysis through a real TemperAgent with Anthropic + Logfire-backed evidence tooling
persist and materialize intent-shaped evolution records/issues, plus the supporting authz and observe fixes
fix CI/readability blockers discovered while preparing the branch for merge

Verification

cargo fmt --all --check
cargo check --workspace
cargo clippy --workspace --all-targets -- -D warnings
cargo bench --workspace --no-run
cargo test --workspace
cargo run -p temper-cli -- verify --specs-dir os-apps/intent-discovery/specs

Implement the GEPA (Guided Evolution of Pareto-optimal Artifacts) infrastructure for Temper's self-improvement loop per ADR-0034. Phase 0: ADR-0034 documenting all architectural decisions Phase 1: temper-ots crate — OTS type system with DST adaptations (65 tests) Phase 2: MCP trace capture — TrajectoryBuilder in runtime.rs + protocol.rs Phase 3a: GEPA algorithm primitives in temper-evolution (27 tests) Phase 3b: host_evaluate_spec WASM host function (generic platform capability) Phase 3c: 4 GEPA WASM modules (replay, score, pareto, reflective) Phase 4: Evolution skill — EvolutionRun + SentinelMonitor IOA specs + Cedar Phase 5: Sentinel OTS failure cluster rule (threshold: 5 failures/entity type) Phase 6a: Apps → Skills rebrand across codebase with backward-compat aliases Phase 6b: Skill guide format — skill_guide field, GET /api/skills/:name endpoint, temper.get_skill() MCP method, evolution skill registered in catalog All specs pass L0-L3 verification cascade. 506+ tests pass. Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>

Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>

1. Pareto dominates() now considers all objectives from both sides, not just a's keys — fixes asymmetric key handling 2. ReplayResult tracks invalid_transitions counter separately, fixing coverage score inflation 3. host_evaluate_spec returns -1 on memory read/write errors instead of silently proceeding with zero-filled buffers 4. SimWasmHost::evaluate_spec returns plain error string, not pre-formatted JSON that would get double-wrapped Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>

Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>

…yntax, live E2E proof Fixes three production bugs blocking the autonomous GEPA self-improvement loop: - spec_evaluator_fn(): correct TransitionTable::evaluate API (state, count, action) - WASM CTX_BUF_LEN: increase from 256KB to 512KB for multi-turn entity state - IOA effect syntax: fix SentinelMonitor to use supported formats (set_bool, increment) Adds entity state bloat prevention (32KB per-field cap in sync_fields), OTS trajectory storage endpoints, and EvolutionRun Cedar policies with autonomy slider. Full 11-step lifecycle verified on live server: Created → Selecting → Evaluating → Reflecting → Proposing → Verifying → Scoring → Updating → AwaitingApproval → Deploying → Completed. Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>

Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>

Same fix as specs/policies/issue.cedar — the catch-all permit overrode role-based Cedar policies, causing test_pm_assign_denies_openclaw_agent_type to fail. Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>

…ifacts

…-logfire-evidence

rita-aga and others added 29 commits March 18, 2026 22:28

style: cargo fmt --all

a8fcb52

Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>

fix: clippy needless_borrows_for_generic_args in MCP runtime

693126c

Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>

chore: update readability baseline for GEPA crate additions

80134c6

Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>

style: cargo fmt

0fc8f9b

Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>

style: cargo fmt

b36e1c3

Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>

fix: clippy too_many_arguments in persist_ots_trajectory

c770391

Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>

fix: clippy collapsible_if and manual_strip in skills

8e9520e

Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>

chore: update readability baseline for GEPA additions

4c43763

Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>

fix: remove blanket permit from policies/issue.cedar

d2ae6ae

Same fix as specs/policies/issue.cedar — the catch-all permit overrode role-based Cedar policies, causing test_pm_assign_denies_openclaw_agent_type to fail. Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>

fix: restore MCP OTS deps after cherry-pick merge

148acaa

feat: complete GEPA wasm pipeline and frontier updates

7dc43ee

docs: record real claude GEPA live proof and trajectory

2ffd0aa

docs: expand GEPA live-proof trajectory and proof diagram

6df6316

feat: run GEPA proposer through TemperAgent with OTS-backed replay

b961da5

chore: refresh readability ratchet baseline for GEPA changes

adb326e

Fix single-run GEPA proposer reliability and document live OTS proof

7cbd965

docs: add explicit failures and limitations to GEPA live proof

6074eb8

feat: upgrade GEPA to workflow-level OTS replay and reflective patterns

7a605a7

chore: refresh readability baseline for GEPA workflow changes

521b726

docs: add comprehensive GEPA E2E proof with taxonomy and live run art…

33e1691

…ifacts

feat(temper-agent): add provider auth modes and openrouter support

1e00ea8

Restore apps as primary catalog naming

c40cc56

feat(gepa): enforce optimizer-only mutations and unmet-intent handoff

a08f66e

chore(gepa): remove remaining JEPA naming

64fe5b5

feat(intent-discovery): add logfire-backed intent evidence loop

83641e9

Merge remote-tracking branch 'origin/main' into feat/intent-discovery…

62565f9

…-logfire-evidence

rita-aga merged commit de2dc4d into main Mar 24, 2026
5 checks passed

rita-aga deleted the feat/intent-discovery-logfire-evidence branch March 24, 2026 03:07

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

feat: add logfire-backed intent discovery loop#82

feat: add logfire-backed intent discovery loop#82
rita-aga merged 29 commits intomainfrom
feat/intent-discovery-logfire-evidence

nerdsane commented Mar 24, 2026

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Conversation

nerdsane commented Mar 24, 2026

Summary

Verification

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants