Open Source Coding Assistants

As of June 2026, AIDiveForge tracks 33 open source coding assistants. Curated open source coding assistants tracked by AIDiveForge. Each project has a verified public source repository. Listings are verified against each tool's live website and re-checked regularly.

Last updated June 12, 2026 · 33 tools

1. Agent-QA
The tool lets you write test steps in plain language — 'Click on the Create issue icon', 'Verify that the created issue is shown' — and an agent translates those into browser actions at runtime, reading visible labels and screen state instead of fragile CSS selectors. After each run, it builds execution memory: observations about navigation contracts, UI quirks, and previously healed steps, which get injected into future runs so the agent stops rediscovering the same UI patterns. Self-healing means that when a component shifts, the agent iterates through recovery attempts rather than failing immediately. The ceiling appears when test logic branches on conditional application state — the YAML authoring model is built for linear flows, and complex branching sends teams back to scripting.
PaidOpen Source
2. AgentKitten
Orbit selects a task from a dependency-ordered backlog, hands it to the configured agent adapter, runs tests, lint, and type checks against the result, and only advances the orbit when those gates pass. Every run writes four artifacts: structured agent output, rubric scoring, an accept-or-iterate recommendation, and a human-readable progress log. The workflow is agent-neutral — Claude, Codex, Cursor, or any adapter you wire up behind the same contract. Where it breaks: Orbit is intentionally minimal, so teams expecting a hosted dashboard, a GUI, or built-in multi-agent parallelism will find precious little of that. The harness is a loop, not a platform.
FreeOpen Source
3. AI Pair Programmer for Emacs
CodeTutor is a free, open-source Emacs package that watches your file saves, gathers project context, and routes the diff to a local AI backend configured to respond like a senior engineer talking you through your own decision — not handing you the answer. The boundary is explicit by design: it will explain the concept, show a compact illustrative snippet, and recommend a next step, but it does not write into your files, produce patches, or hand you a paste-ready implementation. Architecture notes accumulate automatically in a `.codetutor/ARCHITECTURE.md` file as you work. This is early-stage, single-maintainer software with two commits on record — you are not buying into a mature product.
FreeOpen Source
4. AI-Engineering-Coach
The extension passively analyzes AI coding assistant activity across your workspace and surfaces usage metrics, prompt patterns, and code generation volume in a single dashboard — without requiring any API or cloud dependency. It covers any AI coding harness, not just Copilot, so teams running a mix of tools get consolidated signal instead of siloed logs. The anti-pattern detection flags weak prompting habits before they calcify across the team. Where it breaks: this is a read-only observer, not an enforcer. The docs describe an 'agentic readiness audit' framing, but no task is executed on your behalf — you get diagnostics, not automation.
FreeOpen Source
5. AICTL
Each 'orbit' is one task: the harness selects it from a dependency-ordered backlog, runs the agent, then requires passing tests, lint, and type checks before closing the loop — no proof, no progress. Every run produces structured JSON artifacts (agent output, rubric scoring, a human-readable progress log) that you can inspect or replay without re-running the agent. The deterministic replay demo runs without an API key, so you can see the full cycle before wiring in a real model. Orbit is intentionally small — no hosted infrastructure, no GUI — which keeps it auditable and keeps you in control, but also means everything outside the core loop is your problem to build.
FreeOpen Source
6. AutoMaxFix
AutoMaxFix runs a detect-reproduce-repair loop: it watches for test failures or runtime drift, surfaces one ticket at a time, lets an AI agent propose a patch, and stops cold until a human approves it. That deliberate stop is the point. The vendor describes it explicitly as 'the boring opposite of an autonomous agent' — one ticket, one patch attempt, one approval, one report. Every fix is logged with provenance so you can trace what changed and why. The ceiling arrives fast: the tool handles one ticket per execution, so teams running parallel failure streams will need external orchestration to manage the queue.
FreeOpen Source
7. Catcher
You describe tests in plain English, and Catcher's LLM-powered planner executes them in a real browser — no script authoring, no Selenium boilerplate. The vision-based fallback handles dynamic UIs where element selectors break, which is where most scripted test frameworks quietly start failing your CI. Because you supply the API key directly, LLM costs land on your own account — nothing is proxied through a vendor margin. The ceiling arrives when you need a test management dashboard, CI pipeline integrations, or a shared test artifact store across a team: the repo describes none of those, and you are building that infrastructure yourself.
FreeOpen Source
8. Cline
Open-source autonomous AI coding agent for VS Code and other IDEs, with human-in-the-loop approval, multi-provider support, and MCP extensibility.
FreeOpen Source
9. Code Review Graph
The tool builds a dependency graph of your codebase locally, then exposes that graph through MCP so Claude Code, Cursor, or any compatible assistant can ask targeted questions: which files are affected by this change, what is the impact radius, which communities cluster around this module. For large monorepos, this is the difference between a useful review context and a truncated one. The analysis runs entirely on your machine — no source code leaves the environment. The gap shows up when you need deep semantic understanding beyond structural imports; graph topology tells you what calls what, not whether the logic is correct.
FreeOpen Source
10. Codeep
Codeep is an open-source, terminal-native autonomous agent that reads your project structure, plans a sequence of steps, edits files, runs shell commands, and checks its own output against your build and test suite before declaring done. You describe the goal; it handles the steps. The self-verification loop — where it catches a broken typecheck and fixes it without prompting — is the part that separates it from a glorified shell wrapper. The ceiling appears on projects where the agent's context window fills before it has mapped the full dependency graph; community reports suggest large monorepos with deep cross-module dependencies push that limit faster than single-service repos. At that point, teams either scope tasks more tightly or reach for a dedicated sub-agent delegation pattern.
FreeOpen Source
11. Coherence
Coherence scans the links between code, docs, architectural decision records, tests, metrics, generated files, and API endpoints — and flags where those links have snapped. It runs locally, deterministically, with no external API calls by default, which means it fits inside a pre-commit hook or CI pipeline without sending your codebase anywhere. The checks are rule-based, not LLM-driven, so results are repeatable run-to-run. Where it breaks: Coherence detects drift but does not fix it, so the remediation loop is still manual. Teams with loosely structured repos get limited signal until they invest time defining what relationships Coherence should track.
FreeOpen Source
12. CoreTex
Orbit pulls one dependency-ordered task at a time from your backlog, hands it to whichever coding agent you connect, then refuses to mark it done unless tests, lint, and type checks pass. Every run writes four JSON or markdown artifacts: what the agent returned, how the work scored against a rubric, a human-readable mission log, and a recommendation to accept, iterate, or stop. The agent-neutral contract means you can swap Claude for Codex behind the same harness and compare structured artifacts instead of vibes. The ceiling appears fast on large repos: Orbit is intentionally small, so teams needing parallel agent execution, complex branching between task types, or CI integration will find themselves extending the harness manually.
FreeOpen Source
13. Enhanced Copy
The tool is a Chrome extension paired with an SDK: site owners author a prompt once, the extension wraps it around whatever the user selects, and the user pastes the whole package — prompt, selected content, source URL, content type — into whatever AI tool they already have open. There is no AI inference happening inside the extension itself; it is a copy-pipe, not an agent. That constraint is also the ceiling: it works for one-shot prompt-plus-content workflows, but the moment your use case requires routing output back into a system, chaining steps, or persisting results, the tool has no mechanism to do any of that. Teams needing those patterns wire this into a broader stack or stop here and reach for something that runs the model itself.
FreeOpen Source
14. Gito
Orbit wraps any JSON-speaking coding agent — Claude, Codex, Cursor, or your own — inside a loop that selects a dependency-ordered task, runs the agent, demands validation proof, and records every artifact before advancing. The output is structured JSON showing what the agent returned, rubric scoring for task focus and diff signal, and a human-readable mission log. Where it breaks: Orbit is intentionally small, which means teams that need hosted execution, a GUI, or a first-class CI/CD plugin will hit the boundary fast and find themselves wiring their own glue code. Teams experimenting with multiple agent frameworks get the most from it; teams shipping to production pipelines at scale will need to extend it.
FreeOpen Source
15. Knobkit
The vendor describes a scaffold-to-running-app path measured in seconds, not setup sessions. The core model is intentional minimalism: widgets plus handlers, nothing else wired by default. That constraint is exactly why it works for quick local demos — and exactly why it breaks when a project grows past a single-file scope. No API surface means automation or external orchestration is off the table. Teams that outgrow the single-file model migrate the logic into a conventional TypeScript stack and keep only the widget declarations, if they keep anything.
FreeOpen Source
16. Kodus AI
Kodus runs as an agent that watches pull requests across GitHub, GitLab, Bitbucket, and Azure Repos, posts inline comments, and can convert unresolved suggestions directly into tracked issues in Jira, Linear, or Notion. You write review rules in plain language — no DSL, no YAML policy files — and the agent applies them on every diff. Because you supply your own API keys and can self-host the full stack via Docker Compose, token costs are billed directly to your LLM provider, not marked up through Kodus. The ceiling appears when your rules grow complex enough that plain-language enforcement becomes ambiguous; at that point, teams either tighten the rule wording iteratively or accept occasional false-positive comments that engineers learn to dismiss.
PaidOpen SourceFree Trial · 14 days
17. KugelAudio
Orbit wraps agent runs in a controlled loop: pick a task from a dependency-ordered backlog, hand it to whichever agent backend you have configured, run tests and lint against the output, and write inspectable JSON artifacts before the task is ever marked complete. If the agent cannot pass the validation gate, the orbit does not close — no silent failures, no optimistic merges. The artifact trail covers what the agent returned, how the run scored against a rubric, and a human-readable recommendation to accept, iterate, or stop. It runs fully self-hosted with no hosted option and no API key required for the replay demo.
FreeOpen Source
18. LocalCode
Type what you want, get a suggested command, approve it, and it runs — no API key, no network request, no telemetry. All inference runs on Apple Silicon through the Foundation Models framework, which means your file paths, hostnames, and search terms never travel anywhere. The workflow is strictly one-shot: one prompt, one command suggestion, one approval gate. There is no session memory, no chaining, and no multi-step automation. Teams that want anything beyond single-command suggestions will hit the ceiling of what this proof-of-concept was designed to do.
FreeOpen Source
19. MandoCode
MandoCode is a .NET CLI agent that reads your project, proposes diffs, and applies changes across files — the full plan-search-edit loop, entirely on your machine. It is built on Semantic Kernel and RazorConsole, which renders a Spectre.Console terminal UI using Razor components and a virtual DOM. The agent is designed around C# and .NET codebases, so the file understanding and diff proposals are tuned for that ecosystem. Web search is available without a key but the vendor states a free Tavily key improves reliability. The ceiling appears when you push outside .NET: community reports on the GitHub page are thin, and the tool's own framing is explicit about its target audience.
FreeOpen Source
20. Memex
Orbit runs as a local harness that pulls one dependency-ordered task at a time, hands it to whichever coding agent you configure, then runs your tests, lint, and type checks before recording the result. Every run writes structured JSON artifacts — what the agent returned, how the output scored against a rubric, and a human-readable recommendation to accept, iterate, or stop. The audit trail is durable and replayable without an API key, which makes it usable in air-gapped environments. The tooling is intentionally minimal, so teams building on top of it will write their own adapter glue for agents that do not speak the expected JSON contract. Orbit does not manage the agent itself — it manages what the agent must prove.
FreeOpen Source
21. Mimirs
The vendor's own benchmark on a real project shows a prompt that consumed 380K tokens and took ~12 seconds dropping to 91K tokens and ~3 seconds after indexing — a 76% reduction. Mimirs gives Claude Code, Cursor, and compatible MCP clients a persistent, searchable memory layer for your codebase, stored entirely on your machine. It auto-generates a wiki and dependency graphs so your agent navigates structure instead of guessing at it. The ceiling appears on teams whose workflows require cloud sync, multi-machine access, or shared memory across developers — none of which a local-only architecture supports. Those teams end up pairing this with a hosted solution or abandoning it for one.
FreeOpen Source
22. Nanocode-CLI
The tool runs entirely in your terminal, talks to whatever LLM you point it at — local or remote — and edits files using line-and-hash anchors that reject a write if the target code has already drifted. That last detail matters more than it sounds: most agents will cheerfully overwrite a file that changed between the read and the write. nanocode refuses. The tradeoff is scope — the codebase is intentionally small, the feature surface is narrow, and teams who need a visual canvas, IDE integration, or a rich plugin ecosystem will hit the ceiling fast. For a restricted environment or a developer who wants to read every line of the agent loop before trusting it, that ceiling is the point.
FreeOpen Source
23. NodeCartel
Orbit wraps each coding agent run in a bounded loop: one task, validation gates (tests, lint, type checks), and a fixed set of JSON artifacts recording exactly what the agent returned, what the checks proved, and what should happen next. It is agent-neutral — Claude, Codex, Cursor, or any CLI that speaks JSON fits behind the same contract. The dependency-aware backlog means tasks run in order and only advance when the previous orbit closes cleanly. Where it stops: Orbit has no API and no dashboard, so teams that need live metrics or cross-run analytics build those themselves on top of the artifact files.
FreeOpen Source
24. Opencode
OpenCode is an open-source coding agent that runs in your terminal, a desktop app, or an IDE extension, connecting to 75+ LLM providers including local models. You can spin up multiple agents on the same project in parallel, share debug sessions via a link, and log in with your existing GitHub Copilot or ChatGPT Plus credentials rather than paying again. The no-data-storage architecture makes it viable in privacy-sensitive environments where cloud-only tools are ruled out. The ceiling shows up when you need validated, consistent model performance out of the box — that lives behind the paid Zen add-on, not in the free tier.
PaidOpen Source
25. Pi Coding Agent
Pi runs in a loop with full tool-calling access — read, write, edit, bash — and surfaces four modes: interactive TUI, print/JSON for scripting, RPC, and an SDK for deeper integration. Sessions are stored as trees, so you can rewind to any prior message, fork from that point, and share the entire branch as a rendered URL. The extension and skills system lets you load context on-demand rather than stuffing everything into the system prompt at startup — which the docs describe as a deliberate choice to stay token-efficient. Where Pi stops short is also deliberate: sub-agents and plan mode are not included by default, so teams that need multi-agent parallelism or structured planning build or install extensions themselves. That tradeoff keeps the core minimal, but it means the complexity budget shifts from the tool to you.
FreeOpen Source
26. Runway
Orbit wraps agent runs in bounded execution cycles: one task selected from a dependency-ordered backlog, real test and lint gates that must pass before the task closes, and a structured artifact trail left after every run. You get four output files — agent result, rubric evaluation, a human-readable progress log, and an accept/iterate/stop recommendation — so you can audit what happened instead of re-running it from memory. The deterministic replay demo runs without an API key, which means you can inspect the full loop before wiring in Claude, Codex, or any other JSON-speaking CLI. The tool is intentionally scoped: it handles the harness, not the agent. Teams that need the agent itself to do more will hit that boundary fast.
PaidOpen Source
27. SIMD Agent
Orbit is an MIT-licensed open-source harness that wraps any JSON-speaking CLI agent — Claude, Codex, Cursor, or otherwise — in a bounded loop: select one task from a dependency-aware backlog, run the agent, gate on real validation (tests, lint, type checks), and write inspectable artifacts before closing the orbit. Every run produces four JSON/markdown files recording what the agent returned, how the output scored against a rubric, whether to accept or iterate, and a human-readable mission log. The harness is intentionally small, so there is precious little abstraction to hide behind — what you see is what runs. Teams with strict audit requirements get durable, reviewable evidence without instrumenting the agent itself. The trade-off is that Orbit is a harness framework, not a turnkey product: you bring the agent, the backlog structure, and the validation suite.
FreeOpen Source
28. Skills
Orbit is a CLI harness that wraps any JSON-speaking coding agent — Claude, Codex, Cursor, or your own — in a bounded loop: one task selected from a dependency-ordered backlog, executed by the agent, then checked against tests, lint, and type validation before the orbit closes. If the agent cannot prove the work, the run does not advance. Every orbit writes structured JSON artifacts and a human-readable progress log, so you are reviewing evidence rather than re-reading diffs and guessing. The harness runs entirely locally, requires no API key for the replay demo, and is MIT licensed. Where it breaks: teams whose validation needs go beyond tests and lint — custom scoring rubrics, multi-step human approval workflows, or large parallel backlogs — will find the intentionally small surface area a ceiling rather than a feature.
FreeOpen Source
29. Stagewise
Open-source agentic IDE with embedded frontend coding agent that runs in your browser on localhost.
PaidOpen Source
30. Supertonic
Orbit structures agent execution around a single concept: one task, one orbit, bounded by real checks — tests, lint, type validation — and recorded in inspectable JSON artifacts before anything advances. The vendor describes it as agent-neutral: Claude, Codex, Cursor, or any JSON-speaking CLI slots in behind the same contract, so teams can swap agents and compare output artifacts instead of gut feelings. The architecture is intentionally small, which means the harness is easy to verify and replay, but it also means Orbit does not ship workflow UI, cloud hosting, or a managed backlog service. Teams with complex multi-agent pipelines or a need for a hosted dashboard will be assembling those pieces themselves. Where it shines is the messy middle: failing tests handed to an agent, with proof required before the task closes.
FreeOpen Source
31. Unspaghettit
Orbit wraps each coding-agent invocation in a bounded loop: it selects a dependency-ordered task from a backlog, runs the agent, then gates advancement on passing tests, lint, and type checks — not on the agent's self-report. Every run writes structured JSON artifacts and a human-readable progress log, so you can inspect what changed and why a task closed or stalled. The deterministic replay demo runs without an API key, which means you can verify the harness behavior before committing any agent credits. The ceiling appears when your workflow needs anything beyond CLI-compatible agents — there is no API and no visual interface.
FreeOpen Source
32. Wandesk
Wandesk is a free, open-source desktop application that generates functional local apps — calorie trackers, invoice generators, expense trackers — from natural language prompts, running entirely on your machine. The agent core handles code generation and execution autonomously, so a non-technical user can request a reading list manager and get a working desktop utility, not a code snippet to paste somewhere. Native integrations with Claude Code and Codex mean developers can wire the tool into repository workflows without an intermediary layer. The ceiling appears when your generated app needs persistent state across multiple interconnected tools or when branching logic between agent steps grows beyond a single-purpose utility. Teams building anything that resembles a product rather than a personal utility will hit that ceiling and reach for a dedicated app framework instead.
FreeOpen Source
33. WinkTerm
Orbit wraps each coding-agent run in a bounded loop: one task selected from a dependency-ordered backlog, executed by whatever CLI agent you hand it, then validated through tests, lint, and type checks before the orbit closes. Every run writes structured JSON artifacts — what the agent returned, how the diff scored, whether the reviewer should accept or iterate. This is not an agent itself; it is the scaffold that keeps agents accountable. The ceiling appears when your workflow needs dynamic replanning or multi-agent coordination across parallel tasks — Orbit's contract is deliberately single-focus, and teams that outgrow that boundary are maintaining a layer above the harness.
FreeOpen Source

Listings on this page are sourced and verified by the AIDiveForge data pipeline. AIDiveForge is editorially independent — no money changes hands for inclusion.