AI QA Automation — RealWorld

English · 한국어

Live-coding MVP that lets a Claude Agent SDK–powered LLM write and run Playwright E2E tests against demo.realworld.show, streaming the agent activity, generated TypeScript code, and pass/fail result back to the browser in real time.

Quickstart

cp .env.example .env
# Edit .env to set ANTHROPIC_API_KEY=sk-ant-...

npm install                    # also installs the chromium browser via postinstall
npm run dev                    # http://localhost:3000

Open http://localhost:3000, optionally pick one of the 6 preset scenarios, and click Run.

Architecture (MVP)

Frontend: single static page (public/index.html + public/app.js), Tailwind via CDN, EventSource for SSE.
Server: Hono on Node 22 with tsx (no build).
Agent: Claude Agent SDK query() with three custom MCP tools (write_test_file, run_playwright, report_result). vendor/realworld-spec/SELECTORS.md is baked into the system prompt.
Test runner: Playwright spawned as a subprocess against runs/<runId>/test.spec.ts.
Concurrency: one run at a time (runLock). Subsequent requests get HTTP 409.
No persistence: events are kept in-memory and disposed 10 minutes after a run completes.

For the full spec see docs/superpowers/specs/2026-05-07-ai-qa-automation-mvp-design.md. For the implementation walkthrough see docs/superpowers/plans/2026-05-07-ai-qa-automation-mvp.md.

Demo notes

The target site (demo.realworld.show) is third-party. Run a quick health check before a live demo: curl -I https://demo.realworld.show.
Each run signs up a new fake user with a qa-<timestamp> username so repeated runs do not collide.
Hard caps: maxTurns=12, single retry, 60s test timeout. Adjust in src/orchestrator.ts and playwright.config.ts.

Tutorial

Follow the full one-hour journey of building this MVP through conversational AI coding:

Vibe Coding Tutorial — 7 chapters + appendix covering brief → brainstorming → spec pivot → plan → subagent-driven execution → ship.

Chapters	Time	Cost (est.)	Key Topics
7 + appendix	~55 minutes wall clock	~$12–14 API	Superpowers skills, AskUserQuestion, RealWorld spec utilization, Claude Agent SDK + MCP tools, parallel subagent pipelining

Acceptance checklist (manual)

README quickstart works end-to-end on a clean clone in under 5 minutes.
Preset signup returns a PASS within 60 seconds against demo.realworld.show.
A purposely broken scenario ("click a button labeled 'Definitely Not Here'") returns a FAIL with a non-empty reason.
Submitting a second POST /api/run while one is in flight returns HTTP 409.

Name		Name	Last commit message	Last commit date
Latest commit History 12 Commits
docs		docs
public		public
src		src
vendor/realworld-spec		vendor/realworld-spec
.env.example		.env.example
.gitignore		.gitignore
README.ko.md		README.ko.md
README.md		README.md
package-lock.json		package-lock.json
package.json		package.json
playwright.config.ts		playwright.config.ts
tsconfig.json		tsconfig.json

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

AI QA Automation — RealWorld

Quickstart

Architecture (MVP)

Demo notes

Tutorial

Acceptance checklist (manual)

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

AI QA Automation — RealWorld

Quickstart

Architecture (MVP)

Demo notes

Tutorial

Acceptance checklist (manual)

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages