Launch feedback: what browser-agent failures should BrowserTrace capture?

BrowserTrace is in its first public launch-ready shape. This issue collects workflow feedback from people building browser agents, not stars/upvotes.

Browser Use is the primary feedback path now. Stagehand, Skyvern, Playwright + LLM, and computer-use workflows are still supported as secondary integrations.

Try first:

- Live demo: https://aaronlab.github.io/browsertrace/
- Browser Use guide: https://aaronlab.github.io/browsertrace/browser-use-debugging.html
- Debugging walkthrough: https://aaronlab.github.io/browsertrace/debug-browser-agent-failure.html
- Failure patterns: https://aaronlab.github.io/browsertrace/browser-agent-failure-patterns.html
- PyPI: https://pypi.org/project/browsertrace/
- Release: https://github.com/aaronlab/browsertrace/releases/tag/v0.1.19
- Public-safe demo export: https://github.com/aaronlab/browsertrace/releases/download/v0.1.19/browsertrace-demo-public.html

Current demo story:

Browser Use tries to upload `file:///tmp/browsertrace-report.html`, navigates to the local file path instead, and the upload preview never appears. BrowserTrace replays the screenshot, URL, action, model output, status, and first red step so the failure is inspectable.

Current comparison path:

```bash
browsertrace compare <failed_run_id> <success_run_id>
browsertrace compare <failed_run_id> <success_run_id> --json
```

Fastest local path from PyPI:

```bash
uvx --from "browsertrace[ui]" browsertrace doctor
uvx --from "browsertrace[ui]" browsertrace demo
uvx --from "browsertrace[ui]" browsertrace list
uvx --from "browsertrace[ui]" browsertrace
```

Persistent install from PyPI:

```bash
pip install "browsertrace[ui]"
browsertrace doctor
browsertrace demo
browsertrace
```

For public-safe trace sharing:

```bash
browsertrace export <run_id> --public -o public.html
```

`--public` omits prompt/model I/O, screenshots, and URLs from the standalone HTML export. For security-sensitive reports or changes, or anything that includes private trace data, use the Security Policy before sharing details publicly: https://github.com/aaronlab/browsertrace/blob/main/SECURITY.md

If you build with Browser Use, Stagehand, Skyvern, Playwright + LLMs, or computer-use stacks, please share:

- What failed run was hardest for you to debug?
- What data did you wish you had captured?
- Which framework/runtime should BrowserTrace support better first?
- Would a portable HTML export be enough for sharing failures, or do you need hosted links?
- Which fields should be redacted before public sharing?
- For failed-vs-good Browser Use runs, which final-result fields should comparison separate: extracted content, tool output, final result, retry/repair attempts, or one normalized summary?

Concrete traces, screenshots, sanitized snippets, and anonymized examples are especially useful.

For small docs fixes while discussing feedback, use the current good-first issue queue and First PR Recipe:

- https://github.com/aaronlab/browsertrace/labels/good%20first%20issue
- Current good-first issue: https://github.com/aaronlab/browsertrace/issues/371
- https://github.com/aaronlab/browsertrace/blob/main/CONTRIBUTING.md#first-pr-recipe


Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Launch feedback: what browser-agent failures should BrowserTrace capture? #3

Metadata

Assignees

Labels

Projects

Milestone

Relationships

Development

Launch feedback: what browser-agent failures should BrowserTrace capture? #3

Description

Metadata

Metadata

Assignees

Labels

Projects

Milestone

Relationships

Development

Issue actions