Browser Testing Agent Loop
Agentic loop that exercises a web app like a new user, files bug reports with screenshots + reproduction steps.
🧠 Why it works
Traditional E2E test suites only catch regressions in scenarios a human has already thought of. An agent with a vision model catches the messy class of bugs nobody scripted: broken layouts, accessible-but-invisible elements, misleading copy. The report-then-continue pattern prevents one bug from blocking discovery of others, and the screenshot-plus-DOM snapshot gives developers a concrete repro artifact, not a hallucinated bug description.
⚙️ How it works
1) Spin up a headless browser (Playwright). 2) Navigate to the start URL; capture full-page screenshot + accessibility tree. 3) Feed both to the vision LLM with a system prompt: 'you're a QA engineer; choose the next action that explores a new area of this app'. 4) Parse returned action: click / type / navigate. 5) Execute; re-capture. If a condition from the user story is violated (404, network error, visual regression vs a baseline), emit a bug record with: title, reproduction steps from the action log, severity, attached screenshot. 6) Continue until the budget is hit or every link has been touched. 7) Output: bugs.jsonl + a browsable HTML report.
Description
A reusable skill wrapping Playwright + a vision-capable LLM. Given a URL and an optional user story, it navigates the app, tries variations of user flows, and files structured bug reports (title, steps, expected, actual, severity, screenshot). Acts as a tireless QA pass for pre-release builds.
Install this skill
A Claude skill is a skill.md file with YAML frontmatter and a markdown body.
Drop the file into your tool of choice — or pick a different format if you use Cursor, Windsurf, Copilot, or something else.
mkdir -p ~/.claude/skills/browser-testing-agent-loop \
&& curl -L https://aidiveforge.com/skill/browser-testing-agent-loop.skill-md \
-o ~/.claude/skills/browser-testing-agent-loop/skill.md
Save to ~/.claude/skills/browser-testing-agent-loop/skill.md
Recommended Use
Sign in to suggestTools and workflow packs this skill pairs well with. Forge picks are auto-generated from category + capability signals; Community picks are added by people who've used the pairing.
Forge Auto-recommended matches
Workflow packs
-
Podcast Production Automation post 31%
Capture, transcribe, edit, and distribute podcast episodes with automatic audio cleanup, timestamped transcripts, and multi-platform preparation—reducing manual editing from 5 hours to 1 hour per episode.
Why: caps 0/0 · io-match · name-mention
-
Multilingual Content Localization post 31%
Convert English content into multiple languages with culturally adapted audio and subtitled videos. Reach global audiences in 2-3 hours instead of weeks of manual translation.
Why: caps 0/0 · io-match · name-mention
Report compatibility
Tell the community which tool you used this skill with, and whether it worked.
Suggest a pairing
Recommend a tool or workflow pack that this skill works well with. Up to 5 recommendations per day.