Skip to content

feat(scorers): artifact scorer suite — bibliography, governance receipt, ARCANA essay#8

Open
hummbl-dev wants to merge 2 commits intomainfrom
feat/claude/artifact-scorer-suite
Open

feat(scorers): artifact scorer suite — bibliography, governance receipt, ARCANA essay#8
hummbl-dev wants to merge 2 commits intomainfrom
feat/claude/artifact-scorer-suite

Conversation

@hummbl-dev
Copy link
Copy Markdown
Owner

Summary

Adds the full Arbiter artifact scorer suite — a penalty-based, multi-dimensional quality scoring system for knowledge artifacts before CLP ingest.

  • artifact_scorer.py — base framework: ArtifactScorer(ABC), ArtifactScorerRegistry, DEFAULT_REGISTRY, _score_from_findings() penalty engine (CRITICAL=-25, HIGH=-15, MEDIUM=-7, LOW=-3), A–F grade scale
  • BibliographyScorer — 5 dimensions: DOI coverage, tier distribution, tag density, completeness, source density
  • GovernanceReceiptScorer — 5 dimensions: completeness, chain_of_custody, timestamp_validity, evidence_ratio, schema_compliance; EU AI Act Article 12 + NIST AI RMF GOVERN 1.2 aligned
  • ArcanaEssayScorer — adversarial gate for ARCANA synthesis essays; 5 dimensions: empirical_grounding (0.30), citation_density (0.25), structural_integrity (0.20), source_diversity (0.15), on_topic_ratio (0.10); rules ARC101–502; blocks grade F from CLP ingest
  • scorers/__init__.py — registers all 3 scorers into DEFAULT_REGISTRY
  • 51 tests — all green

Test plan

  • test_artifact_scorer.py — 51 tests covering all 3 scorers + base framework
  • Grade scale verified: A≥90, B≥80, C≥70, D≥60, F<60
  • ARC304 rule: zero citations → HIGH severity on empirical_grounding
  • ArcanaEssayScorer blocks grade F from ledger ingest (wired in arcana_ingest.py)

🤖 Generated with Claude Code

Claude (agent) and others added 2 commits April 5, 2026 22:00
Only registered runner (windows-desktop-1) is offline. Arbiter is pure
Python — ubuntu-latest works fine.

Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com>
Adds the Arbiter knowledge artifact scoring infrastructure:

- ArtifactScorer (ABC) + ArtifactScorerRegistry + _score_from_findings() in
  artifact_scorer.py — penalty-based weighted scoring, CRITICAL/HIGH/MEDIUM/LOW
  severity grades, A–F letter grade scale
- BibliographyScorer (5 dims): DOI coverage, tier distribution, tag density,
  completeness, source density
- GovernanceReceiptScorer (5 dims): completeness, chain_of_custody,
  timestamp_validity, evidence_ratio, schema_compliance — EU AI Act Article 12
  + NIST AI RMF GOVERN 1.2 alignment
- ArcanaEssayScorer (5 dims): empirical_grounding, citation_density,
  structural_integrity, source_diversity, on_topic_ratio — adversarial gate
  blocks echo-chamber synthesis (grade F) from entering CLP ledger; rules
  ARC101–502 including ARC304 (zero-citation block)
- All three scorers registered in DEFAULT_REGISTRY via scorers/__init__.py
- 51 tests green (35 existing + 16 new for ArcanaEssayScorer)

Closes ARCANA → Arbiter gate → CLP ingest flywheel.

Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com>
@chatgpt-codex-connector
Copy link
Copy Markdown

You have reached your Codex usage limits for code reviews. You can see your limits in the Codex usage dashboard.
To continue using code reviews, you can upgrade your account or add credits to your account and enable them for code reviews in your settings.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

1 participant