Skip to main content
AIDiveForge AIDiveForge
🕷️

Browser Testing Agent Loop

Agents & Workflows · by AIDiveForge · Apr 17, 2026 · Advanced · ✓ 1 verified compat

Agentic loop that exercises a web app like a new user, files bug reports with screenshots + reproduction steps.

🧠 Why it works

Traditional E2E test suites only catch regressions in scenarios a human has already thought of. An agent with a vision model catches the messy class of bugs nobody scripted: broken layouts, accessible-but-invisible elements, misleading copy. The report-then-continue pattern prevents one bug from blocking discovery of others, and the screenshot-plus-DOM snapshot gives developers a concrete repro artifact, not a hallucinated bug description.

⚙️ How it works

1) Spin up a headless browser (Playwright). 2) Navigate to the start URL; capture full-page screenshot + accessibility tree. 3) Feed both to the vision LLM with a system prompt: 'you're a QA engineer; choose the next action that explores a new area of this app'. 4) Parse returned action: click / type / navigate. 5) Execute; re-capture. If a condition from the user story is violated (404, network error, visual regression vs a baseline), emit a bug record with: title, reproduction steps from the action log, severity, attached screenshot. 6) Continue until the budget is hit or every link has been touched. 7) Output: bugs.jsonl + a browsable HTML report.

Description

A reusable skill wrapping Playwright + a vision-capable LLM. Given a URL and an optional user story, it navigates the app, tries variations of user flows, and files structured bug reports (title, steps, expected, actual, severity, screenshot). Acts as a tireless QA pass for pre-release builds.

Install this skill

A Claude skill is a skill.md file with YAML frontmatter and a markdown body. Drop the file into your tool of choice — or pick a different format if you use Cursor, Windsurf, Copilot, or something else.

Download skill.md
mkdir -p ~/.claude/skills/browser-testing-agent-loop \
  && curl -L https://aidiveforge.com/skill/browser-testing-agent-loop.skill-md \
       -o ~/.claude/skills/browser-testing-agent-loop/skill.md

Save to ~/.claude/skills/browser-testing-agent-loop/skill.md

Recommended Use

Tools and workflow packs this skill pairs well with. Forge picks are auto-generated from category + capability signals; Community picks are added by people who've used the pairing.