v2: Executive harness built on autoresearch by Bbasche · Pull Request #1 · Bbasche/ooda-engine

Bbasche · 2026-03-10T19:27:08Z

Summary

Adds a higher-order OODA loop that sits above Karpathy's autoresearch instances and manages them as a portfolio toward business goals.

Mission decomposition: LLM translates business goals into parallel research tracks, each becoming an autoresearch instance with its own program.md and git branch
Autoresearch adapter: Clones the repo, sets up workspaces, generates program.md (the control surface), parses results.tsv, detects plateaus, connects to agenthub API
Executive OODA cycle: OBSERVE (poll all instances' results.tsv) → ORIENT (LLM analyzes portfolio) → DECIDE (spawn/kill/redirect/combine/escalate) → ACT
Cross-pollination: Insights from one instance get injected into other instances' program.md files
Escalation: When all instances plateau, re-decomposes goals with entirely fresh approaches
Generalized autoresearch: The protocol works beyond ML — any "modify file → run → measure → keep/discard" loop. Examples: API latency, Lighthouse scores, conversion rates

Complex use cases this now enables

Multi-objective ML research — Parallel autoresearch instances targeting different metrics (val_bpb, VRAM, throughput), executive finds Pareto frontier
Full-stack product optimization — Backend perf + frontend Lighthouse + conversion copy, all coordinated
Adversarial research — Red-team instance finds vulnerabilities, blue-team hardens, executive feeds findings back and forth
Scientific hypothesis portfolio — Multiple hypotheses explored in parallel, dead ends killed, promising ones doubled-down
Continuous production optimization — Perpetual experimentation in staging, executive promotes proven improvements
Multi-codebase coordination — Model + serving + SDK instances kept compatible via program.md contract updates
Market-responsive R&D — External signals trigger goal re-decomposition
Cost-optimized research — Exploratory tracks on cheap GPUs, promising directions promoted to expensive hardware
Multi-agent knowledge synthesis — Cross-domain insights combined into emergent research directions
Automated ablation studies — One instance per component, executive integrates per-component optima

New files

executive.js — CLI entry point
lib/autoresearch.js — Autoresearch adapter (workspace, program.md, results.tsv, hub API)
lib/executive-loop.js — Meta-OODA cycle
lib/instance-manager.js — Instance lifecycle management
lib/hub.js — Coordination hub
lib/mission.js — Goal decomposition
examples/executive.config.js — ML optimization portfolio
examples/executive.generalized.config.js — Full-stack product optimization

v1 ooda.js unchanged — fully backwards compatible.

Test plan

node executive.js --help shows usage
node executive.js --dry-run decomposes goals into tracks without spawning
node executive.js --agent=claude-code spawns autoresearch instances with Claude
node executive.js --status reads hub state and shows portfolio
v1 still works: node ooda.js --help
All files pass node -e "require('./file')" syntax check

🤖 Generated with Claude Code

Add a higher-order OODA loop that orchestrates multiple autoresearch instances toward business goals. The executive generates program.md files (autoresearch's control surface), monitors results.tsv, detects plateaus, cross-pollinates insights between instances, and makes portfolio-level decisions (spawn/kill/redirect/escalate). New files: - executive.js: CLI entry point for v2 - lib/autoresearch.js: Adapter for Karpathy's autoresearch (workspace setup, program.md generation, results.tsv parsing, hub API client, plateau detection) - lib/executive-loop.js: Meta-OODA cycle over instances - lib/instance-manager.js: Spawn/monitor/steer/kill autoresearch agents - lib/hub.js: Central coordination hub (file-based + agenthub API) - lib/mission.js: LLM-powered goal decomposition into research tracks - examples/executive.config.js: ML optimization portfolio example - examples/executive.generalized.config.js: Full-stack product optimization v1 ooda.js unchanged — fully backwards compatible. Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

v2: Executive harness built on autoresearch#1

v2: Executive harness built on autoresearch#1
Bbasche wants to merge 1 commit intomainfrom
feat/executive-harness

Bbasche commented Mar 10, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

Conversation

Bbasche commented Mar 10, 2026

Summary

Complex use cases this now enables

New files

Test plan

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant