You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
Same live test as #(reform-recipes issue): the agent burned 15+ run_python iterations trying to guess how to express a 1pp basic-rate reform via free-form Python. Never converged.
Diagnosis
PR #55 (feat/register-typed-tools) re-exposes calculate_household, run_economy_simulation, and analyse_microdata — typed tools that pin reform semantics inside the engine objects.
…once, instead of free-form Python on Parameters(...) 15 times. The engine handles the partial-override semantics correctly.
Per the commit message of 249f9c8:
"These functions have existed since the early agent_tools.py but were quietly removed from the registry in PR #11 (compute → run_python rewrite) — no documented rationale."
This issue tracks landing PR #55 specifically to unblock the reform-API failure class.
Symptom (live test 2026-05-28)
Same live test as #(reform-recipes issue): the agent burned 15+
run_pythoniterations trying to guess how to express a 1pp basic-rate reform via free-form Python. Never converged.Diagnosis
PR #55 (
feat/register-typed-tools) re-exposescalculate_household,run_economy_simulation, andanalyse_microdata— typed tools that pin reform semantics inside the engine objects.With those tools registered, the agent calls:
…once, instead of free-form Python on
Parameters(...)15 times. The engine handles the partial-override semantics correctly.Per the commit message of
249f9c8:This issue tracks landing PR #55 specifically to unblock the reform-API failure class.
Action
Either:
analyse_microdatatyped tool into a smaller PR — that's the one that fixes today's symptom.Constraints
run_python-only.run_python— it's still the right fallback for ad-hoc analysis.