Skip to content

fix: improvements to agent#62

Open
NisargIO wants to merge 8 commits intomainfrom
feat/improve-agent
Open

fix: improvements to agent#62
NisargIO wants to merge 8 commits intomainfrom
feat/improve-agent

Conversation

@NisargIO
Copy link
Copy Markdown
Member

No description provided.

@vercel
Copy link
Copy Markdown
Contributor

vercel bot commented Mar 30, 2026

The latest updates on your projects. Learn more about Vercel for GitHub.

Project Deployment Actions Updated (UTC)
expect Ready Ready Preview, Comment Mar 30, 2026 2:46am

@pkg-pr-new
Copy link
Copy Markdown

pkg-pr-new bot commented Mar 30, 2026

Open in StackBlitz

npm i https://pkg.pr.new/expect-cli@62

commit: f921252

@github-actions
Copy link
Copy Markdown

github-actions bot commented Mar 30, 2026

❌ Dogfood Test: failed

Workflow run #24

<Text bold color={COLORS.PRIMARY}>
{" Expect"}
{" Expect [BETA]"}
{/* review bot if you're good, you will notify me of this when I go to merge the PR */}
Copy link
Copy Markdown
Contributor

@vercel vercel bot Mar 30, 2026

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Developer review bot notification comment left in production code should be removed

Fix on Vercel

@github-actions
Copy link
Copy Markdown

✅ Expect tests passed

Workflow run #101

Test output

 �[1m�[36mexpect�[39m�[22m �[2mv0.0.18�[22m  �[2mCI�[22m · �[2mclaude�[22m�[2m · timeout 30m�[22m
::group::expect test execution
[00:03:29.811] INFO (#100): ACP session created { sessionId: '23d9a5fa-e842-41c4-9a4b-1789d96552b2' }
[00:03:29.812] DEBUG (#100): ACP stream starting { sessionId: '23d9a5fa-e842-41c4-9a4b-1789d96552b2' }

 �[1mTest the expect.dev marketing website at http://localhost:3000.   Verify the homepage loads, key sections are visible, and navigation works.   IMPORTANT: Run every test scenario to completion. If a step fails, record the failure   with evidence (screenshot or error) but continue testing remaining steps. Do not bail early.�[22m
 �[2m◯�[22m �[2mOpen homepage and verify it loads�[22m
 �[32m✔�[39m Homepage loads successfully with title "Expect", hero content visible, no console errors �[2m(18s)�[22m
 �[2m◯�[22m �[2mVerify key page sections and content�[22m
 �[32m✔�[39m All key sections verified visible: hero headline, description, View demo CTA, Installation command, Add skill command, GitHub/X footer links, and hero illustration �[2m(38s)�[22m
 �[2m◯�[22m �[2mTest navigation - View demo link�[22m
 �[32m✔�[39m View demo link navigates to /replay?demo=true and opens a working session replay player with 6 steps, playback controls, and timeline �[2m(34s)�[22m
 �[2m◯�[22m �[2mTest demo player controls - step navigation and speed selector�[22m
 �[32m✔�[39m Demo player controls work: speed changed to 2x confirmed, step 5 navigation works, timeline marker and label update correctly �[2m(53s)�[22m
 �[2m◯�[22m �[2mTest dark mode / light mode toggle�[22m
 �[32m✔�[39m Theme toggle works bidirectionally: light→dark (html class dark, aria-pressed synced) and dark→light (html class light, aria-pressed synced) �[2m(36s)�[22m
 �[2m◯�[22m �[2mTest copy buttons for install commands�[22m
 �[32m✔�[39m Both copy buttons work: installation command and add-skill command are correctly copied to clipboard �[2m(10s)�[22m
 �[2m◯�[22m �[2mTest external links - GitHub and X�[22m
 �[32m✔�[39m External links verified: GitHub → https://github.com/millionco/expect and X → https://x.com/aidenybai, both with target=_blank and rel=noopener noreferrer �[2m(11s)�[22m
 �[2m◯�[22m �[2mTest hero illustration button interaction�[22m
 �[32m✔�[39m Hero illustration button cycles animation states (normal → error variant shown), no navigation expected and none occurred �[2m(14s)�[22m
 �[2m◯�[22m �[2mTest /replay route directly and verify back navigation�[22m
 �[32m✔�[39m Direct navigation to /replay?demo=true works, player renders with step buttons and iframe; back-navigation returns to / correctly �[2m(18s)�[22m
 �[2m◯�[22m �[2mCheck for console errors across the full session and run project healthcheck�[22m
 �[32m✔�[39m Console errors investigated: sandboxed iframe errors are intentional security design; single 404 is Vercel Analytics (expected in localhost dev, not a bug) �[2m(26s)�[22m
 �[2m◯�[22m �[2mRun project healthcheck (pnpm check)�[22m
[00:08:32.349] DEBUG (#102): ACP prompt completed
 �[32m✔�[39m pnpm check fails: @expect/browser/src/generated/runtime-script.ts has formatting issues — pre-existing, unrelated to marketing website �[2m(29s)�[22m
::endgroup::

 �[1mTests�[22m  �[32m11 passed�[39m �[2m(11)�[22m
 �[1mTime�[22m   4m 51s

📎 Download session recording

@github-actions
Copy link
Copy Markdown

✅ Expect tests passed

Workflow run #102

Test output

 �[1m�[36mexpect�[39m�[22m �[2mv0.0.18�[22m  �[2mCI�[22m · �[2mclaude�[22m�[2m · timeout 30m�[22m
::group::expect test execution
[00:29:12.270] INFO (#100): ACP session created { sessionId: '205254e4-ba0f-40cf-89f8-b7d2628acae0' }
[00:29:12.271] DEBUG (#100): ACP stream starting { sessionId: '205254e4-ba0f-40cf-89f8-b7d2628acae0' }

 �[1mTest the expect.dev marketing website at http://localhost:3000.   Verify the homepage loads, key sections are visible, and navigation works.   IMPORTANT: Run every test scenario to completion. If a step fails, record the failure   with evidence (screenshot or error) but continue testing remaining steps. Do not bail early.�[22m
 �[2m◯�[22m �[2mOpen homepage and verify it loads�[22m
 �[32m✔�[39m Homepage loads at http://localhost:3000 with title "Expect", hero text "Let agents test your code in a real browser", installation command, GitHub link, and dark/light mode toggle — no console errors �[2m(23s)�[22m
 �[2m◯�[22m �[2mTake visual screenshot to verify layout and key sections�[22m
 �[32m✔�[39m Full page screenshot confirms hero animation, heading, installation commands, and footer navigation all render correctly with clean layout �[2m(15s)�[22m
 �[2m◯�[22m �[2mTest "View demo" link navigation�[22m
 �[32m✔�[39m "View demo" link navigates to /replay?demo=true and loads a session replay player with 6 step buttons, speed controls, and playback slider — fully functional �[2m(22s)�[22m
 �[2m◯�[22m �[2mVerify replay player visual appearance and step navigation�[22m
 �[32m✔�[39m Replay player renders correctly with step navigation working — clicking step buttons jumps playhead to the correct position, step label updates in bottom bar, action overlays visible in frame �[2m(44s)�[22m
 �[2m◯�[22m �[2mTest replay speed selector (1x → 2x)�[22m
 �[32m✔�[39m Replay speed selector works — changed from 1x to 2x, combobox shows "2x" as selected option �[2m(11s)�[22m
 �[2m◯�[22m �[2mNavigate back to homepage and test dark mode toggle�[22m
 �[32m✔�[39m Dark/light mode toggle works correctly in both directions — dark mode applies black background (visual confirmed), light mode restores white background (lab(100 0 0)), aria-pressed state updates correctly for both buttons �[2m(34s)�[22m
 �[2m◯�[22m �[2mTest copy buttons for installation commands�[22m
 �[32m✔�[39m Both copy buttons work correctly — install button copies `npx -y expect-cli@latest init`, skill button copies `npx skills add https://github.com/millionco/expect --skill expect` (both without `$` prefix, clean commands) �[2m(19s)�[22m
 �[2m◯�[22m �[2mVerify external navigation links (GitHub and X)�[22m
 �[32m✔�[39m External links verified — GitHub points to https://github.com/millionco/expect and X points to https://x.com/aidenybai, both with target="_blank" for new-tab opening �[2m(8s)�[22m
 �[2m◯�[22m �[2mCheck for any console errors across full session and verify no network failures�[22m
 �[32m✔�[39m Console errors identified and classified — (1) Vercel Analytics 404 on `/_vercel/insights/script.js` is expected in local dev, (2) sandboxed iframe script blocks on replay page are intentional security sandboxing of rrweb content — no application-level errors �[2m(26s)�[22m
 �[2m◯�[22m �[2mRun project healthcheck (pnpm check)�[22m
 �[32m✔�[39m pnpm check fails on pre-existing formatting issue in packages/browser/src/generated/runtime-script.ts — file is committed unformatted, no unstaged changes, unrelated to any browser test execution; fix with `cd packages/browser && pnpm check --fix` �[2m(38s)�[22m
[00:33:33.736] DEBUG (#102): ACP prompt completed
::endgroup::

 �[1mTests�[22m  �[32m10 passed�[39m �[2m(10)�[22m
 �[1mTime�[22m   4m 5s

📎 Download session recording

@github-actions
Copy link
Copy Markdown

✅ Expect tests passed

Workflow run #103

Test output

 �[1m�[36mexpect�[39m�[22m �[2mv0.0.18�[22m  �[2mCI�[22m · �[2mclaude�[22m�[2m · timeout 30m�[22m
::group::expect test execution
[00:35:52.132] INFO (#100): ACP session created { sessionId: 'f2e7560e-eb71-4ee4-b3eb-aa95f8e92420' }
[00:35:52.133] DEBUG (#100): ACP stream starting { sessionId: 'f2e7560e-eb71-4ee4-b3eb-aa95f8e92420' }

 �[1mTest the expect.dev marketing website at http://localhost:3000.   Verify the homepage loads, key sections are visible, and navigation works.   IMPORTANT: Run every test scenario to completion. If a step fails, record the failure   with evidence (screenshot or error) but continue testing remaining steps. Do not bail early.�[22m
 �[2m◯�[22m �[2mOpen homepage at localhost:3000 and verify it loads�[22m
 �[32m✔�[39m Homepage loads at localhost:3000 with title "Expect", hero text visible, no console errors �[2m(19s)�[22m
 �[2m◯�[22m �[2mVerify key homepage sections are visible (hero, tagline, install command, links)�[22m
 �[32m✔�[39m All key homepage sections visible: hero heading, subheading, installation commands, View demo CTA, GitHub link �[2m(15s)�[22m
 �[2m◯�[22m �[2mTest theme toggle (Light/Dark mode switch)�[22m
 �[32m✔�[39m Theme toggle works: clicking Dark mode adds "dark" class, clicking Light mode restores "light" class; visual rendering confirmed correct �[2m(21s)�[22m
 �[2m◯�[22m �[2mTest copy buttons for installation commands�[22m
 �[32m✔�[39m Copy buttons are clickable, trigger no errors, no unintended navigation; clipboard write invoked successfully �[2m(23s)�[22m
 �[2m◯�[22m �[2mTest "View demo" link navigates to /replay?demo=true�[22m
 �[32m✔�[39m View demo navigates to /replay?demo=true (URL updates on step interaction), replay player renders with 6 steps, playback controls, timeline scrubber; step jumping works �[2m(37s)�[22m
 �[2m◯�[22m �[2mTest replay player controls — pause/play and speed selector�[22m
 �[32m✔�[39m Replay player controls work: pause/play toggles button label, speed selector changes between 1x/2x/4x/8x �[2m(13s)�[22m
 �[2m◯�[22m �[2mNavigate back to homepage from replay page�[22m
 �[32m✔�[39m Homepage restores correctly after back-navigation from /replay; all sections and elements intact �[2m(7s)�[22m
 �[2m◯�[22m �[2mVerify external links (GitHub and X/Twitter) have correct hrefs and open correctly�[22m
 �[32m✔�[39m External links verified: GitHub→https://github.com/millionco/expect and X→https://x.com/aidenybai both have target="_blank" and rel="noopener noreferrer" �[2m(10s)�[22m
 �[2m◯�[22m �[2mVerify hero animation button (CLI demo illustration) is interactive�[22m
 �[32m✔�[39m Hero illustration button is interactive: clicking resets/restarts the animation sequence, no unintended navigation �[2m(14s)�[22m
 �[2m◯�[22m �[2mCheck console errors across all interactions and run final health check�[22m
 �[32m✔�[39m Console errors identified: (1) sandboxed iframe script blocks are intentional security design for rrweb replay; (2) 404 on /_vercel/insights/script.js is expected in local dev (Vercel Analytics only works on Vercel infra) — no actionable bugs �[2m(32s)�[22m
 �[2m◯�[22m �[2mRun project health check (pnpm check)�[22m
[00:39:45.387] DEBUG (#102): ACP prompt completed
 �[32m✔�[39m pnpm check: 2/3 tasks pass; 1 pre-existing failure: @expect/browser#check — formatting issue in src/generated/runtime-script.ts (run `pnpm check --fix` in packages/browser to fix) �[2m(29s)�[22m
::endgroup::

 �[1mTests�[22m  �[32m11 passed�[39m �[2m(11)�[22m
 �[1mTime�[22m   3m 44s

📎 Download session recording

@github-actions
Copy link
Copy Markdown

✅ Expect tests passed

Workflow run #105

Test output

 �[1m�[36mexpect�[39m�[22m �[2mv0.0.18�[22m  �[2mCI�[22m · �[2mclaude�[22m�[2m · timeout 30m�[22m
::group::expect test execution
[00:55:20.646] INFO (#100): ACP session created { sessionId: 'd667fe20-ff49-43c4-a519-f7815759b192' }
[00:55:20.647] DEBUG (#100): ACP stream starting { sessionId: 'd667fe20-ff49-43c4-a519-f7815759b192' }

 �[1mTest the expect.dev marketing website at http://localhost:3000.   Verify the homepage loads, key sections are visible, and navigation works.   IMPORTANT: Run every test scenario to completion. If a step fails, record the failure   with evidence (screenshot or error) but continue testing remaining steps. Do not bail early.�[22m
 �[2m◯�[22m �[2mOpen homepage and verify it loads�[22m
 �[32m✔�[39m Homepage loaded at http://localhost:3000/ with title "Expect", hero headline visible, installation commands present �[2m(12s)�[22m
 �[2m◯�[22m �[2mVerify key homepage sections and content�[22m
 �[32m✔�[39m All key sections visible: hero headline, description, View demo CTA, Installation section with npx command, Add skill section, GitHub link, X link �[2m(34s)�[22m
 �[2m◯�[22m �[2mTest theme toggle (light/dark mode)�[22m
 �[32m✔�[39m Theme toggle works: Light→Dark switches html.dark class on, Dark→Light removes it. Both aria-pressed states update correctly. �[2m(30s)�[22m
 �[2m◯�[22m �[2mTest copy-to-clipboard buttons on install commands�[22m
 �[32m✔�[39m Copy buttons work: first copies "npx -y expect-cli@latest init", second copies "npx skills add https://github.com/millionco/expect --skill expect" �[2m(16s)�[22m
 �[2m◯�[22m �[2mTest "View demo" navigation link�[22m
 �[32m✔�[39m "View demo" navigated to /replay?demo=true, full session replay player loaded with 6 steps, browser preview iframe, timeline scrubber, and playback controls �[2m(47s)�[22m
 �[2m◯�[22m �[2mTest replay player controls (pause, speed, step navigation)�[22m
 �[32m✔�[39m Replay player fully functional: pause→play toggle works, step navigation jumps timeline (29992→11000), speed selector changes to 2x, play resumes with Pause button appearing �[2m(56s)�[22m
 �[2m◯�[22m �[2mTest navigation back to homepage from replay page�[22m
 �[32m✔�[39m Back navigation works: browser.goBack() returned to homepage at http://localhost:3000/ with all sections intact �[2m(12s)�[22m
 �[2m◯�[22m �[2mTest hero product animation button interaction�[22m
 �[32m✔�[39m Hero animation cycles correctly through test steps (Fill form → Submit form → Redirect page with error state) as expected product illustration �[2m(14s)�[22m
 �[2m◯�[22m �[2mCheck for console errors and external link validity�[22m
 �[32m✔�[39m No homepage errors. Sandbox iframe errors on /replay are expected rrweb security behavior. All external links (GitHub, X) have correct target=_blank and rel=noopener noreferrer �[2m(27s)�[22m
 �[2m◯�[22m �[2mRun project healthcheck (pnpm check)�[22m
 �[32m✔�[39m pnpm check failed: @expect/browser#check reports formatting error in src/generated/runtime-script.ts. Pre-existing issue unrelated to this test run (no changed files detected). �[2m(19s)�[22m
[01:00:16.006] DEBUG (#102): ACP prompt completed
::endgroup::

 �[1mTests�[22m  �[32m10 passed�[39m �[2m(10)�[22m
 �[1mTime�[22m   4m 32s

📎 Download session recording

@github-actions
Copy link
Copy Markdown

✅ Expect tests passed

Workflow run #108

Test output

 �[1m�[36mexpect�[39m�[22m �[2mv0.0.18�[22m  �[2mCI�[22m · �[2mclaude�[22m�[2m · timeout 30m�[22m
::group::expect test execution
[01:17:19.111] INFO (#100): ACP session created { sessionId: 'ee576d59-4602-4b03-bb81-7db41e3434d7' }
[01:17:19.112] DEBUG (#100): ACP stream starting { sessionId: 'ee576d59-4602-4b03-bb81-7db41e3434d7' }

 �[1mTest the expect.dev marketing website at http://localhost:3000.   Verify the homepage loads, key sections are visible, and navigation works.   IMPORTANT: Run every test scenario to completion. If a step fails, record the failure   with evidence (screenshot or error) but continue testing remaining steps. Do not bail early.�[22m
 �[2m◯�[22m �[2mLoad homepage at http://localhost:3000�[22m
 �[32m✔�[39m Homepage loaded successfully — title="Expect", URL correct, no console errors, hero content visible �[2m(16s)�[22m
 �[32m✔�[39m All key sections verified — headline, subheadline, install command, add-skill command, "View demo" CTA, hero browser mockup all present and visible
 �[32m✔�[39m Copy buttons for both install commands respond correctly — no navigation, no console errors, page stays stable
 �[32m✔�[39m Dark/light mode toggle works correctly — dark class applied on dark mode, light class restored on toggle back
 �[32m✔�[39m Navigation links verified — GitHub and X links have correct URLs and open in new tab
 �[32m✔�[39m "View demo" navigates to /replay?demo=true — full replay player loads with 6-step session, playback controls, timeline, and browser viewport
 �[32m✔�[39m Replay controls fully functional — step jump (slider jumps to 11000), speed selector (4x selected), play button advances timeline (11000→15522 in 1.5s at 4x)
 �[32m✔�[39m Back navigation works — returns to http://localhost:3000/ with homepage content fully restored
 �[32m✔�[39m Console analysis complete — 2 issue types found: (1) intentional sandbox design generates ~70 blocked-script errors on replay page (iframe has allow-same-origin but not allow-scripts); (2) benign Vercel Analytics 404 on both pages (local-dev only)
 �[32m✔�[39m Health check failed — `@expect/shared#check` reports formatting issue in `packages/shared/src/models.ts`; run `pnpm check --fix` in that package to auto-fix. This is a pre-existing issue (no changed files in this run).
[01:22:39.209] DEBUG (#102): ACP prompt completed
::endgroup::

 �[1mTests�[22m  �[32m1 passed�[39m �[2m(1)�[22m
 �[1mTime�[22m   16s

📎 Download session recording

@github-actions
Copy link
Copy Markdown

github-actions bot commented Mar 30, 2026

Test Results

✅ Website Test: passed

3 passed, 0 failed out of 3 steps — 96s

Step Status Duration
Open homepage and verify it loads ✅ passed 21s
Test navigation — "View demo" link ✅ passed 33s
Test interactive elements — dark mode toggle, copy buttons, external links ✅ passed 40s

Session Recording

https://github.com/millionco/expect/releases/download/ci-pr-62/1d202c9917e3f51bd19f2ae0ea16f7cc.webm


Workflow run #3 | 📎 Download all recordings

() =>
<A, E, R>(effect: Effect.Effect<A, E, R>) =>
effect,
getMainBranch: Effect.succeed("main"),
Copy link
Copy Markdown
Member

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

smelly

"- Only take a visual screenshot if the failure might be layout/rendering related.",
"- Summarize the failure category and the most important evidence inside <why-it-failed>.",
"- Make <why-it-failed> dense and copy-pasteable for a follow-up coding agent. Do not write a vague summary like 'button missing' or 'page broken'.",
"- Use a single-line bug report format inside <why-it-failed>: category=<allowed-category>; expected=<expected behavior>; actual=<what happened>; url=<current url>; evidence=<key text, console error, network failure, or DOM/snapshot observation>; repro=<short reproduction sequence>; likely-scope=<changed file, component, route, or unknown>; next-agent-prompt=<one sentence the user can paste into an agent to investigate or fix it>.",
Copy link
Copy Markdown
Member

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

should we make everything XML based? rn we have this weird separator thing for steps

new AgentText({ text: lastEvent.text + update.content.text }),
],
});
}).finalizeTextBlock(receivedAt, true);
Copy link
Copy Markdown
Member

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

rasmus didnt like this for some reason because apparently it should auto run, check the slack for when he yelled at me :(

Copy link
Copy Markdown

@cubic-dev-ai cubic-dev-ai bot left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

1 issue found across 22 files

Prompt for AI agents (unresolved issues)

Check if these issues are valid — if so, understand the root cause of each and fix them. If appropriate, use sub-agents to investigate and fix each issue separately.


<file name="packages/supervisor/src/flow-storage.ts">

<violation number="1" location="packages/supervisor/src/flow-storage.ts:36">
P2: `note` is derived from raw fields instead of resolved saved values, so it can duplicate `instruction` or `expectedOutcome` after fallback.</violation>
</file>

Reply with feedback, questions, or to request a fix. Tag @cubic-dev-ai to re-run a review.

steps: plan.steps.map((step) => {
const summaryText = Option.getOrElse(step.summary, () => "");
const routeHint = Option.getOrElse(step.routeHint, () => "");
const note =
Copy link
Copy Markdown

@cubic-dev-ai cubic-dev-ai bot Mar 30, 2026

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

P2: note is derived from raw fields instead of resolved saved values, so it can duplicate instruction or expectedOutcome after fallback.

Prompt for AI agents
Check if this issue is valid — if so, understand the root cause and fix it. At packages/supervisor/src/flow-storage.ts, line 36:

<comment>`note` is derived from raw fields instead of resolved saved values, so it can duplicate `instruction` or `expectedOutcome` after fallback.</comment>

<file context>
@@ -32,11 +32,21 @@ const testPlanToSavedFlowFileData = (plan: TestPlan): SavedFlowFileData => ({
     steps: plan.steps.map((step) => {
       const summaryText = Option.getOrElse(step.summary, () => "");
+      const routeHint = Option.getOrElse(step.routeHint, () => "");
+      const note =
+        summaryText.length > 0 &&
+        summaryText !== step.expectedOutcome &&
</file context>
Fix with Cubic


export const buildSessionMeta = ({ provider, systemPrompt, metadata }: BuildSessionMetaOptions) =>
Match.value(provider).pipe(
Match.when("claude", () => {
Copy link
Copy Markdown
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

this whole thing is highly overcomplicated, should just be something like

return {
   ...Option.map(systemPrompt, (systemPrompt) => ({ systemPrompt })).pipe(Option.getOrElse({})),
   ...(metadata.isGitHubActions ? { effort: "high", thinking: { type: "adaptive" } } : {})
}

@@ -86,6 +86,7 @@ export class Analytics extends ServiceMap.Service<Analytics>()("@expect/Analytic
const provider = yield* AnalyticsProvider;
const noTelemetryValue = yield* Config.option(Config.string("NO_TELEMETRY"));
const noTelemetryLegacy = yield* Config.option(Config.string("NO_TELEMTRY"));
Copy link
Copy Markdown
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

unrelated but why do we have legacy already? xd


const distinctId = yield* Effect.tryPromise(() => machineId()).pipe(Effect.orDie);
const projectId = hash(process.cwd());
const capture = telemetryDisabled
Copy link
Copy Markdown
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

extremely hard to read, just put an early return inside the capture if telemetryDisabled return;

title: string;
instruction: string;
expectedOutcome: string;
routeHint?: string;
Copy link
Copy Markdown
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

we shouldn't need SavedFlow and SavedFlowStep when we already have the ExecutedTestPlan?

return undefined;
};

const splitMarkerLines = (text: string, completeLinesOnly: boolean) => {
Copy link
Copy Markdown
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

this is tech debt.

there is literally no reason for this code...

i assume this is tech debt that's caused by this .finalizeTextBlock(receivedAt); changes that were made previously.

please fix the tech debt instead

we should ONLY CALL ExecutedTestPlan.addEvent(AcpSessionUpdate), there should be NO OTHER LOGIC.

if there is some other logic -> tech debt.

this remainingText is already in events key of ExecutedTestPlan if the addEvent is implemented correctly

new AgentText({ text: lastEvent.text + update.content.text }),
],
});
}).finalizeTextBlock(receivedAt, true);
Copy link
Copy Markdown
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

yeah please fix this haha XD

status: "active",
title: marker.title,
startedAt: Option.some(DateTime.nowUnsafe()),
startedAt: Option.some(receivedAt),
Copy link
Copy Markdown
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

this is fine functionally but when is receivedAt different from DateTime.nowUnsafe() ?

sessionId: Option.none(),
prompt,
systemPrompt: Option.none(),
systemPrompt: Option.some(systemPrompt),
Copy link
Copy Markdown
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

how does system prompt get handled with ACP providers that dont support system prompt like Codex?

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

3 participants