Add literature freshness review assistant by karollooool · Pull Request #394 · SCIBASE-AI/SCIBASE.AI

karollooool · 2026-05-23T00:57:34Z

/claim #16

Summary

Adds a distinct literature freshness review assistant for the AI-Powered Research Assistant Suite.

This slice compares manuscript claims against a synthetic current-evidence ledger before AI reviewer packets are released, checking:

stale support citations outside the freshness window
missing newer systematic reviews, reporting guidelines, dataset releases, and benchmark updates
newer contradictory evidence that should hold a broad claim
dataset version drift for claims using current-evidence language
benchmark leaderboard drift for state-of-the-art claims
reviewer actions, research-gap prompts, stable audit digests, and deterministic reviewer artifacts

Non-overlap

This is not another broad assistant suite, evidence/protocol trace, statistics review, research-gap planner, rebuttal pack, ethics/data availability check, citation-context reconciler, reporting-guideline compliance module, benchmark-leakage audit, figure/table consistency assistant, analysis-variable provenance assistant, domain-template selector, grant-fit module, limitations disclosure assistant, uncertainty calibration assistant, supplement-readiness assistant, prompt-safety guard, study-power checker, COI/funding disclosure checker, retraction sentinel, preregistration deviation assistant, external-validity assistant, image-integrity assistant, or assay control/calibration assistant. It focuses specifically on whether reviewer-facing claims are current against newer evidence and temporal-drift signals.

Safety

Synthetic fixtures only in literature-freshness-review-assistant/sample-data.js
No private manuscripts, patient data, credentials, external APIs, model calls, payment data, wallet material, or network calls
Dependency-free evaluator/tests; ffmpeg is only optional for regenerating the committed demo video

Demo artifacts

literature-freshness-review-assistant/reports/demo.json
literature-freshness-review-assistant/reports/demo.md
literature-freshness-review-assistant/reports/demo.svg
literature-freshness-review-assistant/reports/demo.mp4

Validation

npm run check
npm test
npm run demo
npm run demo:video with FFMPEG_PATH pointing to a temporary ffmpeg-static binary outside the repo
ffmpeg -v error -i literature-freshness-review-assistant/reports/demo.mp4 -f null -
git diff --check
git diff --cached --check
rg -n "token|secret|password|private key|BEGIN|sk-|ghp_|github_pat|wallet|seed phrase" README.md literature-freshness-review-assistant -> no matches

AI-assisted with Codex; reviewed and locally verified before submission.

Copilot

Pull request overview

Adds a new “literature freshness review assistant” slice to the SCIBASE AI-Powered Research Assistant Suite, intended to screen reviewer-facing manuscript claims for staleness and evidence drift prior to releasing reviewer packets.

Changes:

Introduces a new literature-freshness-review-assistant/ module with a freshness evaluator, deterministic audit digests, and Markdown/SVG renderers.
Adds dependency-free tests plus deterministic demo artifact generation (JSON/MD/SVG) and an optional ffmpeg-based MP4 demo renderer.
Updates the repo README to link to the new slice.

Reviewed changes

Copilot reviewed 10 out of 12 changed files in this pull request and generated 4 comments.

Show a summary per file

File	Description
README.md	Adds a “Bounty Slices” section linking to the new assistant.
literature-freshness-review-assistant/README.md	Documents the slice scope, files, and validation commands.
literature-freshness-review-assistant/package.json	Adds per-slice scripts (`check`, `test`, `demo`, `demo:video`).
literature-freshness-review-assistant/index.js	Implements freshness evaluation logic, digests, and Markdown/SVG rendering.
literature-freshness-review-assistant/sample-data.js	Adds synthetic policy, evidence ledger, and manuscript packets fixtures.
literature-freshness-review-assistant/test.js	Adds assertion-based tests for evaluator + renderers + digest stability.
literature-freshness-review-assistant/demo.js	Generates deterministic demo JSON/MD/SVG artifacts.
literature-freshness-review-assistant/scripts/render-demo-video.js	Optional ffmpeg script to render an MP4 demo.
literature-freshness-review-assistant/reports/demo.json	Committed example JSON output.
literature-freshness-review-assistant/reports/demo.md	Committed example Markdown output.
literature-freshness-review-assistant/reports/demo.svg	Committed example SVG output.

💡 Add Copilot custom instructions for smarter, more guided reviews. Learn how to get started.

karollooool · 2026-05-23T13:51:49Z

+    if (signal.currentVersion && claim.datasetVersion && claim.datasetVersion !== signal.currentVersion) {
+      addFinding(findings, {
+        severity: "major",
+        code: "DATASET_VERSION_DRIFT",
+        claimId: claim.id,
+        topic: claim.topic,
+        detail: `Claim uses dataset ${claim.datasetVersion}; current ledger version is ${signal.currentVersion}.`,
+        requiredAction: signal.requirement,
+        evidenceSignalId: signal.id
+      });
+    }
+
+    if (signal.currentVersion && claim.benchmarkVersion && claim.benchmarkVersion !== signal.currentVersion) {
+      addFinding(findings, {
+        severity: "major",
+        code: "BENCHMARK_VERSION_DRIFT",
+        claimId: claim.id,
+        topic: claim.topic,
+        detail: `Claim uses benchmark ${claim.benchmarkVersion}; current ledger version is ${signal.currentVersion}.`,
+        requiredAction: signal.requirement,
+        evidenceSignalId: signal.id
+      });
+    }


Addressed in 1cddc7d. Dataset drift now only fires for dataset_release signals and benchmark drift only fires for benchmark_update signals, and both are gated on current/latest/state-of-the-art wording or an explicit currentEvidenceClaim flag. I added a regression test for a historical versioned claim that should not receive a drift finding.

karollooool · 2026-05-23T13:51:50Z

+function evaluateClaimFreshness(claim, ledger, policy) {
+  const findings = [];
+  const signals = evidenceForTopic(ledger, claim.topic);
+  const latestCitation = latestDate(claim.citationDates || []);
+  const reviewDate = policy.reviewDate;
+


Addressed in 1cddc7d. The evaluator now filters topic signals through policy.requiredEvidenceTypes, so unsupported signal types are ignored for policy-driven freshness holds. I added coverage for an unsupported blog_post signal.

karollooool · 2026-05-23T13:51:51Z

+    lines.push("| Severity | Code | Claim | Detail | Required action |");
+    lines.push("| --- | --- | --- | --- | --- |");
+    for (const finding of manuscript.findings) {
+      lines.push(
+        `| ${finding.severity} | ${finding.code} | ${finding.claimId} | ${finding.detail} | ${finding.requiredAction} |`
+      );


Addressed in 1cddc7d. Markdown table cells now escape pipes and normalize newlines to <br>, with a regression test covering both cases.

karollooool · 2026-05-23T13:51:52Z

+function yearsBetween(olderDate, newerDate) {
+  const older = new Date(`${olderDate}T00:00:00Z`);
+  const newer = new Date(`${newerDate}T00:00:00Z`);
+  return (newer.getTime() - older.getTime()) / (365.25 * 24 * 60 * 60 * 1000);
+}
+
+function latestDate(dates) {
+  const validDates = dates.filter(Boolean).sort();


Addressed in 1cddc7d. Date parsing now validates ISO YYYY-MM-DD strings and finite timestamps. Invalid support dates produce an INVALID_SUPPORT_DATE finding and, if no valid support date remains, the existing missing-date hold still fires. Added regression coverage.

karollooool · 2026-05-23T13:51:53Z

Follow-up commit 1cddc7d addresses the four Copilot review comments:

Dataset/benchmark drift is now gated by matching evidence type and current/latest/state-of-the-art claim language.
policy.requiredEvidenceTypes is now enforced when selecting current-evidence signals.
Markdown table cells escape pipes and normalize newlines.
Support dates are validated before freshness calculations; invalid dates produce explicit findings.

Re-ran:

npm run check
npm test
npm run demo
npm run demo:video
MP4 decode check with ffmpeg
git diff --check
git diff --cached --check
secret-pattern scan over README and module paths

Add literature freshness review assistant

49ac695

Copilot AI review requested due to automatic review settings May 23, 2026 00:57

algora-pbc Bot added the 🙋 Bounty claim label May 23, 2026

algora-pbc Bot mentioned this pull request May 23, 2026

AI-Powered Research Assistant Suite #16

Open

Copilot started reviewing on behalf of karollooool May 23, 2026 00:58 View session

Copilot AI reviewed May 23, 2026

View reviewed changes

Address literature freshness review feedback

1cddc7d

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add literature freshness review assistant#394

Add literature freshness review assistant#394
karollooool wants to merge 2 commits into
SCIBASE-AI:mainfrom
karollooool:codex/literature-freshness-review-assistant

karollooool commented May 23, 2026

Uh oh!

Copilot AI left a comment

Uh oh!

karollooool May 23, 2026

Uh oh!

karollooool May 23, 2026

Uh oh!

karollooool May 23, 2026

Uh oh!

karollooool May 23, 2026

Uh oh!

karollooool commented May 23, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Conversation

karollooool commented May 23, 2026

Summary

Non-overlap

Safety

Demo artifacts

Validation

Uh oh!

Copilot AI left a comment

Choose a reason for hiding this comment

Pull request overview

Reviewed changes

Uh oh!

karollooool May 23, 2026

Choose a reason for hiding this comment

Uh oh!

karollooool May 23, 2026

Choose a reason for hiding this comment

Uh oh!

karollooool May 23, 2026

Choose a reason for hiding this comment

Uh oh!

karollooool May 23, 2026

Choose a reason for hiding this comment

Uh oh!

karollooool commented May 23, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants