34 lines (28 loc) · 1.06 KB

Architecture

Flow

Load ragopt.yaml.
Load benchmark dataset.
For each candidate:
- run generation over each query
- compute per-case metrics
- aggregate metrics
- apply hard constraints
Rank candidates by weighted score.
Persist JSON artifact + markdown report.
Optionally post markdown to GitHub PR.
UI loads run artifact JSON for interactive inspection.

Modules

ragopt/models.py: schema and result types
ragopt/config.py: config and dataset loading
ragopt/adapters.py: generation provider abstraction
ragopt/metrics.py: metric and scoring functions
ragopt/engine.py: orchestration and comparison
ragopt/reporting.py: markdown outputs
ragopt/github.py: PR comment helper
ragopt/cli.py: public command interface
ui/: React + TypeScript dashboard for viewing run artifacts

Extension points

Add providers in adapters.py.
Add new metrics in metrics.py and wire into engine.
Add policy checks in engine.py compare/run paths.
Connect ui/ to a live backend API instead of file upload only.