Skip to main content
AIDiveForge AIDiveForge
Visit Agent-QA

Get This Tool

License: License: unverified
Local-run terms: Source code is publicly available on GitHub and can be self-hosted without vendor involvement. Users must bring their own LLM credentials or run local/open-source models.

Share This Tool

Compare This Tool
📋 Embed this tool on your site

Copy this code to embed a compact tool card:

Agent-QA

FreemiumOpen SourceAPISelf-HostedAgentic

Pricing

Free Tier
@vostride/agent-qa-subscription-auth required for certain features

Summary

Test suites that rely on hard-coded selectors break the moment a designer renames a button or restructures a modal — and the QA engineer spends Monday triaging failures that have nothing to do with actual regressions. agent-qa exists to absorb that maintenance overhead.

The tool lets you write test steps in plain language — 'Click on the Create issue icon', 'Verify that the created issue is shown' — and an agent translates those into browser actions at runtime, reading visible labels and screen state instead of fragile CSS selectors. After each run, it builds execution memory: observations about navigation contracts, UI quirks, and previously healed steps, which get injected into future runs so the agent stops rediscovering the same UI patterns. Self-healing means that when a component shifts, the agent iterates through recovery attempts rather than failing immediately. The ceiling appears when test logic branches on conditional application state — the YAML authoring model is built for linear flows, and complex branching sends teams back to scripting.

Bottom line: Pick agent-qa when your team's biggest QA problem is selector rot and maintenance overhead on stable user flows; look elsewhere when your test suite depends on conditional branching across application states that a linear natural-language step model cannot express.

Community Performance Report Card

No community ratings yet. Be the first to rate this tool!

Best For: Teams building web and mobile applications, QA engineers authoring tests in natural language, Projects requiring adaptive, self-healing test automation, Teams using custom or open-source LLM models

Community Benchmarks Community

No community benchmarks yet. Be the first to share a real-world data point.

  • Natural language test authoring against visible UI labels rather than DOM selectors, so a component rename or layout shift does not immediately break the test suite the way a hard-coded selector would.
  • Execution memory that accumulates across runs with trust scores and confirmation counts, which means the agent stops wasting run time rediscovering navigation patterns it has already mapped — later assertions stay focused on actual page behavior.
  • Self-healing iteration within a single run — when an action fails, the agent retries with updated screen state observation rather than failing the step immediately, so transient UI delays cause fewer false negatives.
  • Support for custom and open-source LLM models at the infrastructure level, so teams with data-residency requirements or API cost constraints can run inference locally without forking the tool.
  • Open-source codebase with self-hosted deployment option, which means teams are not locked into a vendor's uptime or data pipeline when running tests against internal staging environments.
  • The YAML step format is built for linear flows — action, verify, action, verify. Test scenarios that branch based on runtime application state (for example, different assertion paths depending on what a previous step returned from the server) have no native expression in the authoring model. Teams with conditional logic either maintain a parallel scripting layer or restructure tests into multiple flat suites, which defeats the maintenance advantage.
  • Execution memory is only as reliable as the trust scores the agent has accumulated. On a new application or after a major redesign, early runs produce low-confidence observations and the agent behaves closer to a first-run tool — the adaptive advantage appears after repeated runs against a stable-ish UI, not on day one.
  • Teams whose test requirements outgrow linear natural-language flows — particularly those already running Playwright or Cypress suites with custom fixtures, parameterized data, and programmatic assertions — will find agent-qa's authoring model too constrained and switch back to code-first frameworks where branching logic is a function call, not a workaround.

Community Reviews

No reviews yet. Be the first to share your experience.

About

Platforms
Web and mobile (Chromium, mobile drivers)
API Available
Yes
Self-Hosted
Yes
Last Updated
2026-06-01T03:18:42.492Z

Best For

Who it's for

  • Teams building web and mobile applications
  • QA engineers authoring tests in natural language
  • Projects requiring adaptive, self-healing test automation
  • Teams using custom or open-source LLM models

What it does well

  • Catch regressions before releases ship
  • Natural language test authoring for web and mobile applications
  • Self-healing test automation that adapts to UI changes
  • Building execution memory across test suites

Integrations

OpenAIAnthropicGeminiMCPGitHub ActionsDocker

Discussion Community

No discussion yet. Sign in to start the conversation.

Spotted incorrect or missing data? Join our community of contributors.

Sign Up to Contribute

Community Notes & Tips Community

Be the first to contribute. General notes, observations, gotchas, and tips from people who use this tool day-to-day.

Frequently Asked Questions

Is Agent-QA free?
Agent-QA is a paid tool. No permanent free tier is offered.
Is Agent-QA open source?
Yes. Agent-QA is open source — the source repository is at https://github.com/vostride/agent-qa.
Does Agent-QA have an API?
Yes. Agent-QA exposes a developer API. See the official documentation at https://vostride.com for details.
Can I self-host Agent-QA?
Yes. Agent-QA supports self-hosting on your own infrastructure.
What platforms does Agent-QA support?
Agent-QA is available on: Web and mobile (Chromium, mobile drivers).

Hours Saved & ROI Stories Community

Be the first to contribute. Concrete time/cost savings, with context. e.g. "Cut my code review backlog from 4h to 45m per week."

Agent-QA

Every test run against a live UI carries the risk that yesterday’s selector is today’s broken reference. agent-qa addresses this by replacing selector-based test scripts with natural language YAML files — steps written as human-readable actions and assertions — which an LLM-backed agent then executes by reading visible screen state, labels, and roles at runtime. The vendor describes the agent as planning steps autonomously, executing them via a browser, re-observing UI state, and iterating through multiple attempts within a single run when an action fails.

The differentiating feature is execution memory. After each run, agent-qa produces structured observations — tagged with trust scores and confirmation counts — about how the application behaves: which navigation elements stay visible, how a command palette scopes results, when a toolbar appears. These observations are categorized as product memory, suite memory, and test memory, and the docs describe them being added as context to future runs. The result is that an agent accumulates a working model of the UI over time, reducing redundant rediscovery and keeping assertions focused on actual behavior rather than UI scaffolding.

Agent-qa is built for teams running web and mobile test suites where the pain is maintenance, not test logic complexity. It is open-source with a public GitHub repository and the vendor states it supports custom and open-source LLM models — meaning teams not willing to send test data to a third-party API can run inference locally. The dashboard, CLI, and YAML authoring format are the primary surfaces. Where the model shows strain is in conditionally branching test flows: the step-by-step YAML format favors linear sequences, and test scenarios that fork based on runtime application state require workarounds outside the authoring model. Teams with that requirement will hit the ceiling before they finish the first complex suite.