fix(github): drop eager PR diff fetch — judges don't read it anyway by jacsamell · Pull Request #234 · aetheronhq/agent-cube

jacsamell · 2026-05-24T23:19:28Z

Summary

Unblocks cube prv on large PRs by removing the unnecessary diff fetch entirely.

Jacob's question: "why are we diffing tho, judges can use READ no?" — exactly right. Judges literally don't read the PR diff. judge_panel.py:353: "Don't pull the whole PR diff. Read files individually."

So we were eagerly fetching the diff via gh pr diff (capped at 20k lines, 406 on big PRs) for two consumers:

empty-PR check (just needs a boolean)
dedupe agent (truncates to 8k chars anyway)

What changes

fetch_pr no longer fetches the diff at all. Returns metadata + cwd/repo handle and is done. Cuts a network round-trip from every panel launch.

PRData gains:

has_changes() — cheap boolean via git diff --quiet exit status. No content fetch, no API cap. Defaults to True on git failure so we never silently skip a real PR.
get_diff(max_lines=8000) — lazy. Local-git first (no cap), API fallback only when refs aren't resolvable. Truncates at max_lines with a marker. Memoised.

Callers updated:

peer_review.py: pr.diff.strip() → pr.has_changes(), pr_diff=pr.diff → pr_diff=pr.get_diff()
ui_review.py: pr.diff → pr.get_diff(max_lines=20000)
pr_fix.py: didn't read .diff, unaffected

Back-compat

PRData.diff still exists as a str attribute (default ""), populated on first get_diff() call. External callers reading it directly see "" rather than AttributeError.

Test plan

9 new tests pinning the lazy contract (fetch_pr doesn't call diff, get_diff prefers local, memoises, truncates, surfaces workaround hint on 20k cap; has_changes returns boolean / defaults True on git failure)
Full suite: 501/501 pass
mypy clean, ruff clean
Scope: 4 files (pulls.py + 2 callers + new tests), no scope creep

🤖 Generated with Claude Code

Summary

Prevent "HTTP 406: PullRequest.diff too_large" failures by preferring local git for PR diffs and falling back to the GitHub API only when local refs are unavailable; provide a clear local-git workaround when the API hits the 20,000-line cap.

Changes

python/cube/github/pulls.py

Make PR diffs lazy: PRData.diff is no longer eagerly populated; add PRData.has_changes() and PRData.get_diff(max_lines=8000) with memoisation and truncation.
fetch_pr now fetches PR metadata only and stores cwd/repo for later diff retrieval.
Diff retrieval (_fetch_pr_diff) tries a local three-dot git diff (git fetch origin then git diff origin/...origin/) first and falls back to gh pr diff when local refs can't be resolved.
When gh pr diff fails due to the 20k-line cap (HTTP 406), raise an error including a copy-pasteable git-fetch/git-diff workaround.
Overall: local-git-first behaviour, API-only fallback outside checkouts, capped lazy diffs.

tests/cli/test_pulls_local_diff.py

New/rewritten tests asserting:
- fetch_pr does not eagerly fetch a diff (calls gh pr view only).
- get_diff() prefers local git (runs git fetch + git diff) and falls back to gh pr diff on local failure.
- get_diff() memoises, truncates large diffs and includes a truncation marker when capped.
- Error message for the 20k-line API cap includes the literal workaround command.
- has_changes() semantics: git diff --quiet exit codes interpreted correctly and conservative True fallback when git cannot run.

python/cube/commands/peer_review.py

Use pr.has_changes() and lazily call pr.get_diff() during dedupe; adjust some user-facing messages and formatting (archive message, failed git command formatting, spacing consistency).

python/cube/commands/ui_review.py

Use pr.get_diff(max_lines=20000) then filter for UI diffs (keeps existing filtering but sources the larger capped diff).

Tests & quality

mypy and ruff clean; full test suite reported 496/496 passing.
Tests added/updated to cover local-git preference, API fallback, truncation and workaround messaging.

Scope & impact

Changes limited to pulls.py, related command call-sites (peer_review, ui_review) and tests; no public API signature changes.
Code review effort: High (core diff fetching logic) / Medium (tests and minor command changes).

coderabbitai · 2026-05-24T23:19:41Z

Warning

Review limit reached

@jacsamell, we couldn't start this review because you've used your available PR reviews for now.

Your plan includes 5 reviews of capacity. Refill in 23 minutes and 41 seconds.

Your organization has run out of usage credits. Purchase more in the billing tab.

⌛ How to resolve this issue?

After more review capacity refills, a review can be triggered using the @coderabbitai review command as a PR comment. Alternatively, push new commits to this PR.

We recommend that you space out your commits to avoid hitting the rate limit.

🚦 How do rate limits work?

CodeRabbit enforces hourly rate limits for each developer per organization.

Our paid plans have higher rate limits than trial, open-source, and free plans. In all cases, review capacity refills continuously over time.

Please see our FAQ for further information.

ℹ️ Review info

⚙️ Run configuration

Configuration used: Organization UI

Review profile: CHILL

Plan: Pro

Run ID: bc838b14-ac86-4357-b0b8-ed7cd322c3f9

📥 Commits

Reviewing files that changed from the base of the PR and between ea1f448 and 33a5aa1.

📒 Files selected for processing (3)

python/cube/commands/ui_review.py
python/cube/github/pulls.py
tests/cli/test_pulls_local_diff.py

Walkthrough

The PR makes PR diffs lazy and local-git-first: fetch_pr no longer fetches diffs, PRData gains has_changes() and get_diff(max_lines) (memoised, truncated). Diff retrieval tries a local three-dot git diff and falls back to gh pr diff, appending a targeted hint on large-diff rejections. Callers and UI/peer review flows use the new API; tests validate all paths.

Changes

Local Git Diff Preference with API Fallback

Layer / File(s)	Summary
PRData: lazy diff API and has_changes() `python/cube/github/pulls.py`	Add `PRData.diff` (initially empty), `_cwd`/`_repo`, `has_changes()` (cheap local `git diff --quiet` check with conservative fallback), and `get_diff(max_lines)` which memoises and truncates diffs.
Local-first helper implementations `python/cube/github/pulls.py`	Add `_fetch_pr_diff` that tries `_try_local_git_diff` (three-dot `git diff` after `git fetch`) and falls back to `gh pr diff`, appending a workaround hint on likely HTTP 406 large-diff rejections.
Call sites: peer_review & UI review adjustments `python/cube/commands/peer_review.py`, `python/cube/commands/ui_review.py`	Use `pr.has_changes()` instead of `pr.diff.strip()` and call `pr.get_diff()` lazily for dedupe/UI filtering; also simplify several log/message format strings and adjust inline-comment spacing.
Test suite validating lazy local-first diff behaviour `tests/cli/test_pulls_local_diff.py`	Rewrite tests to assert `fetch_pr` doesn't eagerly fetch diffs, validate local-git-first retrieval, API fallback, memoisation, truncation marker for capped diffs, 20k-line cap error messaging including `git fetch origin` hint, and `has_changes()` semantics.

Estimated code review effort

🎯 4 (Complex) | ⏱️ ~45 minutes

Possibly related PRs

aetheronhq/agent-cube#175: Modifies peer review worktree sync flow; overlaps with messaging and worktree-related changes in this PR.

Poem

🐰 I nibble three dots, silent and spry,
Local diffs first before API sky,
Memoised lines, trimmed to the brim,
If Git trips, the GH path will swim —
Hop, fetch, and patch: the rabbit says "try!"

🚥 Pre-merge checks | ✅ 3

✅ Passed checks (3 passed)

Check name	Status	Explanation
Title check	✅ Passed	The title accurately describes the main change: shifting PR diff fetching from eager (during fetch_pr) to lazy, addressing the core performance and failure-handling improvement in the PR.
Linked Issues check	✅ Passed	Check skipped because no linked issues were found for this pull request.
Out of Scope Changes check	✅ Passed	Check skipped because no linked issues were found for this pull request.

_{✏️ Tip: You can configure your own custom pre-merge checks in the settings.}

_{Comment @coderabbitai help to get the list of available commands and usage tips.}

coderabbitai

🧹 Nitpick comments (3)

tests/cli/test_pulls_local_diff.py (2)

186-191: ⚡ Quick win

Tighten the “skip local path” test to also block git fetch.

Line 186 currently blocks only git diff. If local logic still runs git fetch with missing refs, this test would not catch it.

Proposed guard update

+        if cmd[:2] == ["git", "fetch"]:
+            raise AssertionError("local git path must NOT run when head/base are missing")
         if cmd[:3] == ["git", "diff", "--no-color"]:
             raise AssertionError("local git path must NOT run when head/base are missing")

🤖 Prompt for AI Agents

Verify each finding against current code. Fix only still-valid issues, skip the
rest with a brief reason, keep changes minimal, and validate.

In `@tests/cli/test_pulls_local_diff.py` around lines 186 - 191, The test
currently only blocks the local-path call for git diff (the condition checking
cmd[:3] == ["git", "diff", "--no-color"]) but misses git fetch; update that
conditional to also detect git fetch (e.g., if cmd[:3] ==
["git","diff","--no-color"] or cmd[:2] == ["git","fetch"]) and raise the same
AssertionError so any attempt to run git fetch when head/base are missing fails
the test; leave the ["gh","pr","diff"] branch (api_called["v"] = True)
unchanged.

65-67: ⚡ Quick win

Assert the exact three-dot diff refspec in the happy-path test.

At the moment, Line 65 and Line 78 only verify git diff --no-color was called, not that it used origin/<base>...origin/<head>. A two-dot or wrong ref regression could still pass.

Proposed test hardening

-        # `git diff --no-color origin/main...origin/feat/big-branch`
-        if cmd[:3] == ["git", "diff", "--no-color"]:
+        # `git diff --no-color origin/main...origin/feat/big-branch`
+        if cmd[:4] == ["git", "diff", "--no-color", "origin/main...origin/feat/big-branch"]:
             return subprocess.CompletedProcess(cmd, 0, stdout="diff --git a/x b/x\n+huge content\n", stderr="")
+        if cmd[:3] == ["git", "diff", "--no-color"]:
+            raise AssertionError(f"unexpected git diff args: {cmd}")
@@
-    assert any(cmd[:3] == ["git", "diff", "--no-color"] for cmd in calls), "local diff command must have been called"
+    assert any(
+        cmd[:4] == ["git", "diff", "--no-color", "origin/main...origin/feat/big-branch"]
+        for cmd in calls
+    ), "local three-dot diff command must have been called"

Also applies to: 78-79

🤖 Prompt for AI Agents

Verify each finding against current code. Fix only still-valid issues, skip the
rest with a brief reason, keep changes minimal, and validate.

In `@tests/cli/test_pulls_local_diff.py` around lines 65 - 67, The test currently
matches only the prefix cmd[:3] == ["git","diff","--no-color"], so it doesn’t
assert the exact three-dot refspec; update the mock in
tests/cli/test_pulls_local_diff.py (the block that returns
subprocess.CompletedProcess) to verify the full git diff command includes the
exact refspec "origin/<base>...origin/<head>" (or the concrete
"origin/main...origin/feat/big-branch" used in the test) instead of only the
first three args—e.g. check cmd ==
["git","diff","--no-color","origin/main...origin/feat/big-branch"] or assert
cmd[3] == "origin/main...origin/feat/big-branch"; apply the same stricter check
for the second occurrence around the later mock (the one at lines 78-79) so
two-dot/refspec regressions are caught.

python/cube/github/pulls.py (1)

148-149: 💤 Low value

Use iterable unpacking for list construction.

Per RUF005, prefer unpacking over concatenation for cleaner syntax.

♻️ Suggested fix

     diff_result = subprocess.run(
-        ["gh", "pr", "diff", str(pr_number)] + repo_flags,
+        ["gh", "pr", "diff", str(pr_number), *repo_flags],
         capture_output=True,

🤖 Prompt for AI Agents

Verify each finding against current code. Fix only still-valid issues, skip the
rest with a brief reason, keep changes minimal, and validate.

In `@python/cube/github/pulls.py` around lines 148 - 149, The subprocess.run call
building the command for diff_result currently concatenates two lists; change it
to use iterable unpacking instead of + (update the call where diff_result is
assigned and the subprocess.run invocation that builds ["gh", "pr", "diff",
str(pr_number)] + repo_flags) so the command list is constructed with unpacking
of repo_flags; also make sure repo_flags is an iterable (list/tuple) before
unpacking to avoid runtime errors.

🤖 Prompt for all review comments with AI agents

Verify each finding against current code. Fix only still-valid issues, skip the
rest with a brief reason, keep changes minimal, and validate.

Nitpick comments:
In `@python/cube/github/pulls.py`:
- Around line 148-149: The subprocess.run call building the command for
diff_result currently concatenates two lists; change it to use iterable
unpacking instead of + (update the call where diff_result is assigned and the
subprocess.run invocation that builds ["gh", "pr", "diff", str(pr_number)] +
repo_flags) so the command list is constructed with unpacking of repo_flags;
also make sure repo_flags is an iterable (list/tuple) before unpacking to avoid
runtime errors.

In `@tests/cli/test_pulls_local_diff.py`:
- Around line 186-191: The test currently only blocks the local-path call for
git diff (the condition checking cmd[:3] == ["git", "diff", "--no-color"]) but
misses git fetch; update that conditional to also detect git fetch (e.g., if
cmd[:3] == ["git","diff","--no-color"] or cmd[:2] == ["git","fetch"]) and raise
the same AssertionError so any attempt to run git fetch when head/base are
missing fails the test; leave the ["gh","pr","diff"] branch (api_called["v"] =
True) unchanged.
- Around line 65-67: The test currently matches only the prefix cmd[:3] ==
["git","diff","--no-color"], so it doesn’t assert the exact three-dot refspec;
update the mock in tests/cli/test_pulls_local_diff.py (the block that returns
subprocess.CompletedProcess) to verify the full git diff command includes the
exact refspec "origin/<base>...origin/<head>" (or the concrete
"origin/main...origin/feat/big-branch" used in the test) instead of only the
first three args—e.g. check cmd ==
["git","diff","--no-color","origin/main...origin/feat/big-branch"] or assert
cmd[3] == "origin/main...origin/feat/big-branch"; apply the same stricter check
for the second occurrence around the later mock (the one at lines 78-79) so
two-dot/refspec regressions are caught.

ℹ️ Review info

⚙️ Run configuration

Configuration used: Organization UI

Review profile: CHILL

Plan: Pro

Run ID: f14e490d-43bc-486d-8f20-1d1ffd76965c

📥 Commits

Reviewing files that changed from the base of the PR and between 5eaa789 and 4d6d541.

📒 Files selected for processing (2)

python/cube/github/pulls.py
tests/cli/test_pulls_local_diff.py

📜 Review details

🧰 Additional context used

🪛 Ruff (0.15.13)

python/cube/github/pulls.py

[error] 148-148: subprocess call: check for execution of untrusted input

(S603)

[warning] 149-149: Consider ["gh", "pr", "diff", str(pr_number), *repo_flags] instead of concatenation

Replace with ["gh", "pr", "diff", str(pr_number), *repo_flags]

(RUF005)

[error] 193-193: subprocess call: check for execution of untrusted input

(S603)

[error] 194-194: Starting a process with a partial executable path

(S607)

[error] 206-206: subprocess call: check for execution of untrusted input

(S603)

[error] 207-212: Starting a process with a partial executable path

(S607)

🔇 Additional comments (4)

tests/cli/test_pulls_local_diff.py (1)

1-64: LGTM!

Also applies to: 68-77, 80-185, 192-199

python/cube/github/pulls.py (3)

85-98: LGTM!

100-111: LGTM!

168-223: LGTM!

Original problem: ``cube prv <N>`` aborted on large PRs with ``HTTP 406: PullRequest.diff too_large`` (GitHub's REST diff API caps at 20,000 lines). Jacob's question: "judges can use Read, no?" — exactly right. The judges literally don't use the diff. judge_panel.py:353 tells them "Don't pull the whole PR diff. Read files individually." The only consumers are: - the empty-PR safety check (boolean, doesn't need content) - the dedupe agent (truncates to ~8k chars anyway) The fix is two-pronged: 1. ``fetch_pr`` no longer eagerly fetches the diff. It returns metadata + cwd/repo and is done. Removes the failure mode AND cuts a network round-trip from every panel launch. 2. ``PRData`` gains two lazy methods: - ``has_changes()`` — cheap boolean via `git diff --quiet` exit status. No content fetch, no API cap. Defaults to True on git failure so we never skip a real PR because the network blipped. - ``get_diff(max_lines=8000)`` — local-git-first (no cap), fallback to ``gh pr diff`` when refs aren't resolvable. Truncates at ``max_lines`` with a marker line. Memoised. Callers updated: - peer_review.py: `pr.diff.strip()` → `pr.has_changes()`, `pr_diff=pr.diff` → `pr_diff=pr.get_diff()` (lazy). - ui_review.py: `pr.diff` → `pr.get_diff(max_lines=20000)` (generous ceiling for big component bumps). - pr_fix.py: didn't read .diff, unaffected. Back-compat: ``PRData.diff`` still exists as a (str) attribute, starts empty, gets populated on the first ``get_diff()`` call. Any external caller reading ``.diff`` directly sees "" rather than AttributeError. Error UX: when the API IS hit AND fails with the 20k-line cap, the error surfaces a copy-pasteable workaround command pointing at the local-git path. Beats raw HTTP 406. Tests (9 new): - fetch_pr does NOT call any diff command eagerly - get_diff prefers local git, falls back to API on git failure - get_diff is memoised across repeated calls - get_diff truncates at max_lines with a clear marker - 20k cap error includes the workaround hint - has_changes returns True/False correctly on git exit-status, defaults to True on git failure Full suite: 501/501 pass. mypy clean. ruff clean. Scope: 4 files (pulls.py + 2 callers + the new tests). No scope creep. Co-Authored-By: Claude Opus 4.7 <noreply@anthropic.com>

coderabbitai

Actionable comments posted: 2

🤖 Prompt for all review comments with AI agents

Verify each finding against current code. Fix only still-valid issues, skip the
rest with a brief reason, keep changes minimal, and validate.

Inline comments:
In `@python/cube/github/pulls.py`:
- Around line 235-240: The subprocess.run call that assigns diff_result should
include a timeout (e.g., timeout=30) to avoid hanging; update the invocation of
subprocess.run([...], capture_output=True, text=True, cwd=cwd) to pass a
sensible timeout and wrap it in a try/except for subprocess.TimeoutExpired,
handling the timeout by logging/raising a clear error or returning an
empty/failure result so the caller can proceed (refer to the diff_result
variable and the subprocess.run call in this module).
- Around line 104-118: The current memoization stores a possibly truncated value
in self.diff so subsequent calls with a larger max_lines cannot recover missing
lines; change the caching so the full upstream text from _fetch_pr_diff is
stored (e.g. self._full_diff or similar) and only apply the max_lines truncation
when returning (keeping the splitlines/truncate logic), and update any
early-return that checks self.diff to instead check the full-cache
(self._full_diff) or compute truncation on every call; ensure you still use
_fetch_pr_diff(self.number, head_branch=self.head_branch,
base_branch=self.base_branch, cwd=self._cwd, repo_flags=(["--repo", self._repo]
if self._repo else [])) to populate the full cache.

🪄 Autofix (Beta)

Fix all unresolved CodeRabbit comments on this PR:

Push a commit to this branch (recommended)
Create a new PR with the fixes

ℹ️ Review info

⚙️ Run configuration

Configuration used: Organization UI

Review profile: CHILL

Plan: Pro

Run ID: 504d2a6c-a1b5-4c65-85c4-308fb7197b78

📥 Commits

Reviewing files that changed from the base of the PR and between 4d6d541 and ea1f448.

📒 Files selected for processing (4)

python/cube/commands/peer_review.py
python/cube/commands/ui_review.py
python/cube/github/pulls.py
tests/cli/test_pulls_local_diff.py

📜 Review details

🧰 Additional context used

🪛 Ruff (0.15.13)

python/cube/github/pulls.py

[error] 64-64: subprocess call: check for execution of untrusted input

(S603)

[error] 65-65: Starting a process with a partial executable path

(S607)

[error] 72-72: subprocess call: check for execution of untrusted input

(S603)

[error] 73-78: Starting a process with a partial executable path

(S607)

[error] 235-235: subprocess call: check for execution of untrusted input

(S603)

[warning] 236-236: Consider ["gh", "pr", "diff", str(pr_number), *repo_flags] instead of concatenation

Replace with ["gh", "pr", "diff", str(pr_number), *repo_flags]

(RUF005)

[error] 280-280: subprocess call: check for execution of untrusted input

(S603)

[error] 281-281: Starting a process with a partial executable path

(S607)

[error] 293-293: subprocess call: check for execution of untrusted input

(S603)

[error] 294-299: Starting a process with a partial executable path

(S607)

🔇 Additional comments (3)

python/cube/commands/peer_review.py (1)

565-569: LGTM!

Also applies to: 858-867, 945-946, 1173-1174, 1246-1247

python/cube/commands/ui_review.py (1)

74-77: LGTM!

Also applies to: 464-465

tests/cli/test_pulls_local_diff.py (1)

52-72: LGTM!

Also applies to: 79-128, 130-176, 183-213, 220-277

the-agent-cube

🤖 Agent Cube Review

⚠️ 2/5 judges approved. Some requested changes.

⚠️ Cube panel: 2/5 approved.

Per-judge:

judge_1: APPROVED
judge_2: REQUEST_CHANGES
judge_3: REQUEST_CHANGES
judge_4: REQUEST_CHANGES
judge_5: APPROVED

🤖 Agent Cube Peer Review

the-agent-cube · 2026-05-24T23:46:35Z

+                        capture_output=True,
+                        timeout=30,
+                    )
+                    return diff_check.returncode == 1


[WARNING] git diff --quiet returns 0 for no changes, 1 for changes, and values above 1 for errors. Treating every non-1 result as False can make an unresolvable ref after a successful fetch look like an empty PR, causing _run_pr_review() to exit instead of using the documented conservative fallback.

— Agent Cube / Judge Backend 🤖

the-agent-cube · 2026-05-24T23:46:36Z

+    on the API path. The local checkout has the full diff with no cap.
+    """
+    if head_branch and base_branch:
+        local_diff = _try_local_git_diff(


[WARNING] fetch_pr(..., repo=...) stores repo flags for the API fallback, but _fetch_pr_diff() still tries the local origin/<base>...origin/<head> diff first from the current checkout. For ui-review --pr N --repo other/repo, matching local branch names could produce a diff from this repository instead of the requested one.

— Agent Cube / Judge Backend, Judge Frontend & UX 🤖

the-agent-cube · 2026-05-24T23:46:36Z

+    # UI review filters the diff by extension. Fetch lazily (local git
+    # first, no API cap). Cap is generous — ui_review's filter shrinks
+    # this further; we want headroom for big component bumps.
+    ui_diff = _filter_ui_diff(pr.get_diff(max_lines=20000))


[WARNING] UI review caps the full PR diff at 20,000 lines before _filter_ui_diff() runs. In a large mixed PR with many non-UI changes before the first UI file, this can drop UI hunks and make the review incomplete or report no UI diff even though local git can provide the full diff.

— Agent Cube / Judge Backend 🤖

the-agent-cube

🤖 Agent Cube Review

❌ 5 judge(s) requested changes.

⚠️ Cube panel: 0/5 approved.

Per-judge:

judge_1: REQUEST_CHANGES
judge_2: REQUEST_CHANGES
judge_3: REQUEST_CHANGES
judge_4: REQUEST_CHANGES
judge_5: REQUEST_CHANGES

🤖 Agent Cube Peer Review

…emoise uncapped, timeout Round-2 cube findings (5 findings, all addressed): 1. has_changes treated >1 exit as empty (judge_1,2,3,4): git diff --quiet returns 0/1/>1 (no-diff/diff/error). Now: 0→False, 1→True, anything else falls through to conservative True. Matches docstring. 2. cross-repo regression (judge_1,2,3,4,5): local-git path ignored the --repo flag. cube ui-review --pr N --repo other/owner could diff THIS repo's branches if names happened to match. Both has_changes and _fetch_pr_diff now skip the local path when _repo is set; gh --repo is used instead so the diff comes from the target. 3. ui_review filter-then-truncate (judge_1,2,3,4): get_diff was capping at 20k BEFORE the UI filter ran. Large mixed PRs with non-UI hunks first lost their UI sections. Now: fetch with 1M-line ceiling, filter to UI only, THEN cap the FILTERED result at 20k. 4. memoisation contract (judge_2,3,4): self.diff cached the truncated value, so a later call with a larger cap got the smaller cap's result. Now memoise the UNCAPPED diff (_uncapped_diff), apply max_lines per-call. Each caller's cap is independent; second call doesn't re-shell out. 5. missing timeout on gh pr diff (judge_1,2,3,4): the API fallback had no timeout while every other subprocess in this file did. A stalled gh would hang the panel forever. Added timeout=120 to match the local-git path; TimeoutExpired raises a clear RuntimeError pointing operators at the local-git workaround. Tests: 2 new (14 total): - max_lines respected per-call after memoisation (larger cap recovers lines a smaller cap dropped, no second shell-out) - gh pr diff subprocess passes timeout= kwarg; TimeoutExpired surfaces as RuntimeError Full suite: 506/506 pass. mypy clean. Co-Authored-By: Claude Opus 4.7 <noreply@anthropic.com>

the-agent-cube

🤖 Agent Cube Review

✅ All 5/5 judges approved this PR.

Issues:

Resolve the stale existing thread at python/cube/github/pulls.py:255; the current cross-repo guard at python/cube/github/pulls.py:253-261 addresses it.

✅ Auto-approved by cube — 5/5 judges APPROVED, zero review blockers.

Per-judge:

judge_1: APPROVED
judge_2: APPROVED
judge_3: APPROVED
judge_4: APPROVED
judge_5: APPROVED

CI runs in parallel and continues to gate merge; cube does not race it for approval.

🤖 Agent Cube Peer Review

Real-world hit: collins-platform PR #12 (123,773 additions, merged). After #234 landed, cube prv ran the local-git path but git diff --quiet origin/main...origin/feat/... returned exit 0 (no diff), so the empty-PR bail fired. False-empty. Root cause: three-dot origin/<base>...origin/<head> diffs from the merge-base. For OPEN PRs, merge-base = where head branched off base. For MERGED PRs, base has absorbed head — so merge-base IS head itself, and merge-base..head is empty. GitHub's diff endpoint dodges this by diffing the merge commit's two parents (^1 = base-before-merge, ^2 = head). cube does the same locally now: - fetch_pr requests mergeCommit alongside other PR fields - PRData.merge_commit_oid carries the SHA - has_changes() / get_diff() prefer git diff <merge>^1 <merge>^2 when the SHA is set. Fetches by SHA so this works even when the head branch was deleted post-merge. - Falls back to three-dot, then to API, then to conservative True. Smoke test on the real PR: has_changes True (was: False, silently bailed) get_diff 130k+ lines (was: empty) Tests: 2 new (16 total). Full suite: 508/508. mypy clean. Co-Authored-By: Claude Opus 4.7 <noreply@anthropic.com>

…Rs (#236) Real-world hit: collins-platform PR #12 (123,773 additions, merged). After #234 landed, cube prv ran the local-git path but git diff --quiet origin/main...origin/feat/... returned exit 0 (no diff), so the empty-PR bail fired. False-empty. Root cause: three-dot origin/<base>...origin/<head> diffs from the merge-base. For OPEN PRs, merge-base = where head branched off base. For MERGED PRs, base has absorbed head — so merge-base IS head itself, and merge-base..head is empty. GitHub's diff endpoint dodges this by diffing the merge commit's two parents (^1 = base-before-merge, ^2 = head). cube does the same locally now: - fetch_pr requests mergeCommit alongside other PR fields - PRData.merge_commit_oid carries the SHA - has_changes() / get_diff() prefer git diff <merge>^1 <merge>^2 when the SHA is set. Fetches by SHA so this works even when the head branch was deleted post-merge. - Falls back to three-dot, then to API, then to conservative True. Smoke test on the real PR: has_changes True (was: False, silently bailed) get_diff 130k+ lines (was: empty) Tests: 2 new (16 total). Full suite: 508/508. mypy clean. Co-authored-by: Claude Opus 4.7 <noreply@anthropic.com>

coderabbitai Bot reviewed May 24, 2026

View reviewed changes

jacsamell force-pushed the fix/large-pr-diff-via-local-git branch from 4d6d541 to ea1f448 Compare May 24, 2026 23:33

jacsamell changed the title ~~fix(github): use local git diff for PR review, fallback to API only on failure~~ fix(github): drop eager PR diff fetch — judges don't read it anyway May 24, 2026

coderabbitai Bot reviewed May 24, 2026

View reviewed changes

Comment thread python/cube/github/pulls.py Outdated

Comment thread python/cube/github/pulls.py Outdated

the-agent-cube Bot reviewed May 24, 2026

View reviewed changes

the-agent-cube Bot requested changes May 24, 2026

View reviewed changes

the-agent-cube Bot approved these changes May 24, 2026

View reviewed changes

jacsamell merged commit 62dbc6e into main May 24, 2026
1 of 2 checks passed

jacsamell deleted the fix/large-pr-diff-via-local-git branch May 24, 2026 23:58

jacsamell mentioned this pull request May 25, 2026

fix(pulls): merge-commit parent diff for merged PRs (collins-platform #12) #236

Merged

3 tasks

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

fix(github): drop eager PR diff fetch — judges don't read it anyway#234

fix(github): drop eager PR diff fetch — judges don't read it anyway#234
jacsamell merged 2 commits into
mainfrom
fix/large-pr-diff-via-local-git

jacsamell commented May 24, 2026 •

edited by coderabbitai Bot

Loading

Uh oh!

coderabbitai Bot commented May 24, 2026 •

edited

Loading

Review limit reached

Uh oh!

coderabbitai Bot left a comment

Uh oh!

coderabbitai Bot left a comment

Uh oh!

Uh oh!

Uh oh!

the-agent-cube Bot left a comment

Uh oh!

the-agent-cube Bot May 24, 2026

Uh oh!

the-agent-cube Bot May 24, 2026

Uh oh!

the-agent-cube Bot May 24, 2026

Uh oh!

the-agent-cube Bot left a comment

Uh oh!

the-agent-cube Bot left a comment

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

Conversation

jacsamell commented May 24, 2026 • edited by coderabbitai Bot Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Summary

What changes

Back-compat

Test plan

Summary

Changes

Tests & quality

Scope & impact

Uh oh!

coderabbitai Bot commented May 24, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Review limit reached

Walkthrough

Changes

Estimated code review effort

Possibly related PRs

Poem

Uh oh!

coderabbitai Bot left a comment

Choose a reason for hiding this comment

Uh oh!

coderabbitai Bot left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

the-agent-cube Bot left a comment

Choose a reason for hiding this comment

🤖 Agent Cube Review

Uh oh!

the-agent-cube Bot May 24, 2026

Choose a reason for hiding this comment

Uh oh!

the-agent-cube Bot May 24, 2026

Choose a reason for hiding this comment

Uh oh!

the-agent-cube Bot May 24, 2026

Choose a reason for hiding this comment

Uh oh!

the-agent-cube Bot left a comment

Choose a reason for hiding this comment

🤖 Agent Cube Review

Uh oh!

the-agent-cube Bot left a comment

Choose a reason for hiding this comment

🤖 Agent Cube Review

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

jacsamell commented May 24, 2026 •

edited by coderabbitai Bot

Loading

coderabbitai Bot commented May 24, 2026 •

edited

Loading