RLM v1 (draft — base API decision needed) by darinkishore · Pull Request #86 · krypticmouse/DSRs

darinkishore · 2026-05-05T04:42:11Z

Draft PR for the RLM v1 stack. Not ready to land yet — there's an unresolved base-API decision documented below.

Summary

44 commits implementing the RLM (Reasoning Language Module) architecture: PyO3 runtime, perception loop, REPL/passthrough adapters, Phase 1–5 work, and downstream tightening (schema dedup, sub-LM batching, prompt redesign, etc.).
Branch tip: nqzvplvs 94685374 "RLM schema rendering overhaul: type dedup, nested methods, clean unions".

⚠️ Base-API ambiguity (blocker)

The branch is built on top of an orphaned multi-turn-predict refactor that never made it to main, and main has since taken a different shape. Specifically:

Surface	This branch's base (`ynqyxtnl`)	Main (`3c850e60` + `5bb65ca5`)
Forward signature	`forward(input, history: Option<Chat>) -> Predicted`	`forward(input) -> Predicted`
Continue conversation	Same `forward` with `Some(history)`	Separate `forward_continue(chat) -> (Predicted, Chat)`
Internal split	`compose_chat` + `execute_chat` (private)	`build_chat` + `call_and_parse` (public)
LM call signature	`lm.call(chat, tools, ToolLoopMode::Auto)`	`lm.call(chat, tools)` (no `ToolLoopMode`)
Per-instance LM override	not present	`Predict.lm: Option<Arc<LM>>` + `.lm()` builder

How it ended up like this

2026-02-19 — initial multi-turn-predict commits (zqytoppy, kqyktwmv) with the split API (forward(input) + forward_continue(chat)) + ToolLoopMode::CallerManaged.
2026-02-21 23:08 — local refactor 1e9d503a "collapse call API" (= ynqyxtnl on this branch): collapses split API back to unified forward(input, history), switches to ToolLoopMode::Auto, uses Role enum.
2026-02-22 00:21:20 — PR squash-merged to main as 3c850e60 containing only the first two commits. The "collapse call API" refactor was not included.
2026-02-22 00:21:27 — 7 seconds after the squash-merge, the local multi-turn-predict branch got the orphaned refactor as 659c3938 ynqyxtnl. RLM v1 was then built on top.
2026-04-08 — main got 5bb65ca5 feat(predict): add per-instance LM override on top of the older split API.

So the local stack uses the newer/collapsed API; main uses the older/split API plus an LM-override addition.

Two paths forward

(A) Keep main's split API. Port the RLM stack to use forward(input) + forward_continue(chat) + drop ToolLoopMode::Auto for the equivalent main-side construct. The ynqyxtnl "collapse call API" refactor is effectively discarded.

(B) Restore the unified API. Land ynqyxtnl (the orphaned refactor) into main first as its own PR, re-apply 5bb65ca5 (per-instance LM override) on top, then rebase RLM. Preserves the local API design.

Both involve a non-trivial API migration in the rebase; the "update with main" delta itself is only one small commit (5bb65ca5, 143 lines).

Test plan

Decide A vs B
Resolve the API mismatch (port RLM call sites or re-land the orphaned refactor)
Rebase onto current main
Full cargo test across dspy-rs + RLM-derive + integration suites
Re-verify RLM live demos (OpenAI Responses, Anthropic prompt caching) still pass

- Refactor Message from flat enum to Role + Vec<ContentBlock> - Reasoning continuity preserved through rig round-trips - From<RigMessage> trivial (no data loss), RigChatMessage removed - Predict API split: forward(input) + forward_continue(chat) - ToolLoopMode::CallerManaged for caller-controlled tool loops - Full conversation history in LMResponse.chat - temp_env replaces unsafe set_var in all tests - 14 new tests: round-trip, CallerManaged conversation, reasoning preservation

Two trybuild tests (render_invalid_jinja, render_non_literal) failed on CI because syn::Error::new(span, msg) with .span() produces different underline widths on stable (CI) vs nightly (local). Switch to syn::Error::new_spanned(tokens, msg) which reliably spans from first to last token regardless of compiler version.

…, collapse call API * refactor: unify Predict history API and chat contracts - collapse Predict to forward(input, history) with call() wrapper - preserve full provider content in CallerManaged LMResponse.output - remove inaccurate lossless conversion claim - remove legacy Chat JSON parsing; enforce canonical grouped format - update conversation and chat roundtrip tests * refactor: harden LM transcript fidelity, unify Predicted return shape, collapse call API

… perception-based user messages, custom repr support

…erve

This reverts commit cd46a7b.

- Type dedup: persistent visited set, types expand once then reference by name - Nested method visibility: collect methods by type (schema-driven resolution), render on all class blocks, not just top-level vars - Data enum de-wrapper: single-payload variants render as direct payload types instead of Entry_X { type, data } wrappers - Doc comment normalization: multi-line docs collapse to single line - Union indentation fix: preserve nested indentation in continuation lines - Docstring gating removed: methods without docs still appear in schema - Defensive synthetic variant guard: prevent method contamination on BAML-generated variant classes Schema output: 409 → 222 lines (46% reduction), 0 → 11 nested methods visible. All changes are general infrastructure — any program using the RLM benefits.

darin and others added 30 commits February 19, 2026 18:46

Implement RLM v1 architecture and adapter/runtime deltas

1d32cc3

Preserve raw REPL output in submit feedback

621a13b

feat(rlm): merge PyO3 runtime modules into feature branch

367b65a

Fix passthrough input prompt ceremony leak

0d03547

Wire RLM runtime defaults and OpenAI Responses live demo

431e816

Add V4 sub-LM budget interaction tests

1fc4d6a

Harden RLM turn policy and error feedback paths

3595c54

Include bounded raw snippet in recoverable parse feedback

65edd81

Implement RLM V5 fallback extractor flow

8729f46

Remove dead Xml chat adapter dialect

6e4e808

Add RLM sub-LM integration coverage

f3f0713

Deduplicate type-name formatting across adapters

7f25f01

rlm: tighten internals and unify type-name formatting

db52a3c

rlm: fix py_bridge path stack leakage on errors

f00bb08

rlm: remove redundant clones and dead submit/tool state

fd2baac

rlm: migrate previews to peek-first renderer

ad9cd00

rlm: restore media preview parity in peek renderer

774333b

rlm: harden tool runtime and reserved global names

e7f1980

rlm: remove unreachable preview fallback

3079788

rlm: implement Phase 2 injection and Phase 3 passthrough

c058bc9

Add rlm-derive macros for Phase 1 native RLM types

cc5f153

rlm: harden phase2 bridge contract

d8fa8a0

rlm: implement Phase 4 preview renderer

3f01518

rlm: add phase5 integration demo test

8b6a428

Wire LM additional_params and Anthropic prompt caching

3a919c9

rlm: add tracing instrumentation to RLM module and previews

f34e522

RLM prompt redesign: BAML schema renderer, structured system message,…

d638564

… perception-based user messages, custom repr support

darinkishore added 14 commits February 27, 2026 19:53

RLM exec: strip markdown fences and leading prose before code execution

71db0b2

RLM loop: inject synthetic Turn 0 REPL demo before first model turn

ebfa07b

RLM passthrough: execute all fenced code blocks

77fa8d7

RLM fallback: preserve action-loop chat history

263b68a

RLM runtime: partial sub-LM batching, submit repair, doc newline pres…

7ca3304

…erve

RLM tools: clarify partial batch result alignment

3dd45f9

Redesign RLM perception loop as flow-state REPL

5c6ed00

Lower output cap and add truncation var hints

3c43e56

REPL consonance: resource error formatting + situation-aware prompt

8df37da

REPL: namespace count in situation prompt

5734678

RLM: inject cleanup helper and document tool

cd46a7b

Revert "RLM: inject cleanup helper and document tool"

015d87e

This reverts commit cd46a7b.

REPL: collapse stable namespace to summary line

3c01bf7

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

RLM v1 (draft — base API decision needed)#86

RLM v1 (draft — base API decision needed)#86
darinkishore wants to merge 44 commits into
mainfrom
feature/rlm-v1

darinkishore commented May 5, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

Conversation

darinkishore commented May 5, 2026

Summary

⚠️ Base-API ambiguity (blocker)

How it ended up like this

Two paths forward

Test plan

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant