CLI Coding Agents With an API

As of June 2026, AIDiveForge tracks 8 cli coding agents with an api. Curated cli coding agents with an api tracked by AIDiveForge. Listings are verified against each tool's live website and re-checked regularly.

Last updated June 10, 2026 · 8 tools

1. Antigravity 2.0
The vendor describes Project IDX as a browser-based IDE where agents handle multi-step coding tasks end-to-end: writing code, executing it, observing what breaks in a live preview, and self-correcting before handing back control. Multi-model support means you are not locked to a single provider when one model handles your stack better than another. The free tier exists but carries usage caps that surface quickly on longer agentic runs — teams hitting those caps mid-task face a hard stop, not a graceful queue. Browser-based architecture removes local setup friction but also removes offline access and the deep editor customization that engineers who have spent years tuning their environment tend to miss.
Paid
2. Blackbox AI
The platform routes requests through Claude, Codex, Grok, and its own models behind one encrypted endpoint, so you're not juggling separate subscriptions or API keys when you need to swap models mid-project. The Chairman multi-agent workflow runs parallel agents — refactor, test-gen, deploy, review — then scores and merges their outputs without you in the loop for every handoff. That architecture holds well for greenfield tasks and legacy modernization where the scope is well-defined. Where it gets unsteady is on tasks requiring judgment calls mid-execution: agents push forward, and catching a wrong turn in a 47-file refactor after the PR is staged costs more time than the automation saved.
Paid
3. Codeep
Codeep is an open-source, terminal-native autonomous agent that reads your project structure, plans a sequence of steps, edits files, runs shell commands, and checks its own output against your build and test suite before declaring done. You describe the goal; it handles the steps. The self-verification loop — where it catches a broken typecheck and fixes it without prompting — is the part that separates it from a glorified shell wrapper. The ceiling appears on projects where the agent's context window fills before it has mapped the full dependency graph; community reports suggest large monorepos with deep cross-module dependencies push that limit faster than single-service repos. At that point, teams either scope tasks more tightly or reach for a dedicated sub-agent delegation pattern.
FreeOpen Source
4. Dropstone 1.5
Dropstone coordinates swarm agents that map dependencies, verify cross-system impact, and generate fixes — without requiring you to hand-hold each step. The persistent memory layer means context from last Tuesday's refactor session is still live on Friday. For teams modernizing legacy systems or untangling multi-language monorepos, that continuity is the difference between useful suggestions and noise. The ceiling appears when branching logic across agents grows complex enough that the autonomous recovery loop starts producing confident-looking fixes that miss upstream side effects. At that point, teams add manual checkpoints — which is exactly what they were trying to avoid.
Paid
5. Pi Coding Agent
Pi runs in a loop with full tool-calling access — read, write, edit, bash — and surfaces four modes: interactive TUI, print/JSON for scripting, RPC, and an SDK for deeper integration. Sessions are stored as trees, so you can rewind to any prior message, fork from that point, and share the entire branch as a rendered URL. The extension and skills system lets you load context on-demand rather than stuffing everything into the system prompt at startup — which the docs describe as a deliberate choice to stay token-efficient. Where Pi stops short is also deliberate: sub-agents and plan mode are not included by default, so teams that need multi-agent parallelism or structured planning build or install extensions themselves. That tradeoff keeps the core minimal, but it means the complexity budget shifts from the tool to you.
FreeOpen Source
6. Runway
Orbit wraps agent runs in bounded execution cycles: one task selected from a dependency-ordered backlog, real test and lint gates that must pass before the task closes, and a structured artifact trail left after every run. You get four output files — agent result, rubric evaluation, a human-readable progress log, and an accept/iterate/stop recommendation — so you can audit what happened instead of re-running it from memory. The deterministic replay demo runs without an API key, which means you can inspect the full loop before wiring in Claude, Codex, or any other JSON-speaking CLI. The tool is intentionally scoped: it handles the harness, not the agent. Teams that need the agent itself to do more will hit that boundary fast.
PaidOpen Source
7. Tabby
Open-source, self-hosted AI coding assistant with code completion, chat, and agentic automation.
Free
8. Uktics
The vendor describes an agentic system that monitors repositories, detects broken builds and failing tests, generates patches autonomously, and submits pull requests for human review before anything merges. The human-approval gate is structural, not optional — the agent cannot merge without a sign-off, which matters for regulated or high-stakes codebases. The tool also handles routine work: dependency upgrades, security pattern enforcement, and code refactors across multiple repos. Budget controls and daily usage limits gate expensive operations by subscription tier, so cost surprises are bounded. Note: the scraped page content returned data for an unrelated product; all claims here are drawn from the validator context and structured tool data provided.
Paid

Listings on this page are sourced and verified by the AIDiveForge data pipeline. AIDiveForge is editorially independent — no money changes hands for inclusion.