AI CODING AGENT WAR — Q2 2026 FULL SCOPE

Full Combat Analysis

THE COMBATANTS

Entries stay in the original structure, but copy now separates hard evidence from category judgment more clearly. Use the controls above to narrow by workflow instead of reading top-to-bottom.

S — Best in class, no caveats

A — Excellent, clear use case leader

B — Solid, competitive, specific niche

C — Limited scope or notable caveats

V — Vibe/browser tier (different category)

No tools match your search.

Terminal / CLI Agents

Claude Code #

Anthropic · Terminal + IDE Orchestrator Agent

Terminal-First Sync MCP Claude Pro $20 · Max from $100 · API pay-as-you-go

S TIER

▾

Official docs checkedVendor claims marked when relevant

SWE-bench Verified 80.84%

Context up to 1M

Surfaces terminal · IDE · desktop · web

Tooling MCP + subagents

Access Claude Pro / Max or API

Claude Code remains the strongest publicly benchmarked agent in this lineup. The revision here removes shakier market-share and ranking claims and keeps the case anchored to what is easier to verify: top-tier agent performance on SWE-bench Verified, broad Claude Code availability across terminal / IDE / desktop / browser surfaces, MCP support, and recently documented subagent workflows. The practical read is unchanged: it is still the safest pick for ambiguous, multi-file, architecture-heavy work.

Strengths

Highest clearly published agent benchmark result in this comparison set
MCP ecosystem and subagent patterns make it unusually extensible
Available across terminal, IDE, desktop, and browser surfaces
Context can stretch to very large codebases where smaller windows start collapsing
Excellent at planning, tool use, and code review loops on hard problems

Weaknesses

Heavy usage can get expensive quickly, especially on Max or raw API billing
Still primarily synchronous: it rewards an operator who stays in the loop
Terminal-first workflow is excellent for power users and less friendly for casual users
Model choice is narrower than fully BYOK open-source stacks

Combat VerdictStill the technical front-runner here, but now framed more honestly: best for hard engineering work, not automatically best for every budget, editor preference, or procurement environment.

Codex CLI #

OpenAI · Desktop App + CLI Agent

Desktop App CLI Model Router Included with ChatGPT plans · usage/credits can still apply

A TIER

▾

Official docs checkedVendor claims marked when relevant

Terminal-Bench 2.0 77.3%

Surfaces app · CLI · web · IDE

Plugins 90+ in current app builds

Model routing GPT-5.x Codex + GPT-5.x

Entry point ChatGPT account

Codex is no longer best understood as “just a CLI.” The same agent now spans the ChatGPT app, CLI, web, and editor integrations. That makes the product story stronger than the original draft suggested. This pass also corrects the pricing language: access is bundled with eligible ChatGPT plans, but depending on workspace setup and volume, usage or credits can still matter. The core thesis holds: excellent value, excellent GitHub adjacency, very low activation energy for people already living in ChatGPT.

Strengths

Strong value when you already pay for a qualifying ChatGPT plan
One agent across app, web, CLI, and editor surfaces keeps context transfer simple
Fast, reliable on bounded multi-step engineering tasks
Good plugin/tool story and strong GitHub-centric workflow
Terminal-Bench result makes it one of the stronger publicly signaled options

Weaknesses

Benchmark is from Terminal-Bench 2.0, so it is not perfectly comparable with SWE-bench entries
Complex open-ended reasoning still tends to trail Claude-backed stacks
Cost clarity can depend on plan type, workspace policy, and extra usage
Less open-ended extensibility than the broadest MCP or BYOK ecosystems

Combat VerdictBest value pick for users already inside ChatGPT’s ecosystem, especially if they want one agent that follows them from app to terminal to editor.

Gemini CLI #

Google DeepMind · Terminal Agent

Terminal Large Context Free Free with personal Google account or Gemini API key

B TIER

▾

Official docs checkedVendor claims marked when relevant

Context 1M tokens

License Apache 2.0

Free quota 60 req/min · 1000/day

Tools built-in + MCP

Modalities text + images + PDFs

Gemini CLI is stronger than the original draft implied in one important respect: it is not just “the free CLI.” It is also an open-source, multimodal, 1M-context terminal agent with built-in tools and MCP support. The benchmark language has been tightened, though. Public Gemini 3 model results are promising, but they should not be read as a direct audited product score for Gemini CLI itself. Even with that caveat, this remains one of the best entry points for large-context terminal work.

Strengths

Legitimate free entry point with unusually generous personal-account quotas
1M token context is useful for whole-repo exploration and wide architectural prompts
Open-source and extensible rather than black-box only
Multimodal prompts are genuinely useful for screenshots, diagrams, and PDFs
Cloud Shell availability lowers setup friction for Google-centric workflows

Weaknesses

Public model scores do not automatically equal product-level agent reliability
Hard multi-file refactors still tend to be more consistent on Claude-backed stacks
Google now has multiple overlapping coding surfaces, which can make positioning confusing
Enterprise governance and audit posture are less central than in some enterprise-first tools

Combat VerdictThe best free terminal option in this page, and a better one than the original copy gave it credit for.

Aider #

Open Source (MIT) · Git-Native Terminal Agent

Terminal Git-Native BYOK MIT Free (BYOK API costs)

B TIER

▾

Official docs checkedVendor claims marked when relevant

License MIT

Model Support Any BYOK

Workflow Every edit = a commit

Audit Trail Full git history

Aider operates on a foundational principle: every AI edit is a commit. Every session is a reviewable, revertible branch. For developers who want AI assistance that respects existing git workflows, nothing matches this. Bring your own API key — Claude, GPT, Gemini, any provider. Runs entirely in the terminal. Best for solo developers and small teams who prioritize full auditability, no black boxes, and complete cost transparency. No proprietary model advantage, but no vendor lock-in either.

Strengths

Every edit is a commit — maximum audit trail and rollback capability
Free tool — only pay LLM provider rates
BYOK: works with Claude, GPT, Gemini, Ollama, any provider
MIT licensed — true open source, zero vendor dependency
Terminal-native; composes with all Unix tooling

Weaknesses

No proprietary model advantage — ceiling is the BYOK model ceiling
No IDE integration — terminal only
Raw API costs can add up on heavy usage
Less polished UX than commercial alternatives

Combat VerdictThe terminal purist's pick. If you want an immutable AI audit trail and hate vendor lock-in, Aider is the answer. Pair with Claude Sonnet via API for 80%+ SWE-Bench equivalent capability at raw API cost.

OpenCode #

Open Source (MIT) · Terminal Agent · Rising Contender

Terminal 75+ LLM Providers Plan-First Free (BYOK) / Go Plan $10/mo

B TIER

▾

Official docs checkedVendor claims marked when relevant

License MIT

Model support 75+ providers

Workflow plan-first + approvals

Usage model BYOK or managed

Status fast-moving OSS

OpenCode stays in the lineup because the architecture is compelling: terminal-native, broad provider support, approval-based execution, and a plan-first posture that fits cautious engineering teams. The original version leaned too hard on unaudited popularity numbers, so those were removed here. The real story is simpler and more durable: it is one of the more credible open-source challengers in the terminal-agent lane.

Strengths

Very broad provider support gives teams genuine model flexibility
Plan-first and approval-based execution reduce surprise edits
MIT licensing keeps the stack portable and auditable
Interesting option for teams that want terminal power without vendor lock-in

Weaknesses

Still less battle-tested than the strongest commercial leaders
Capability ceiling depends heavily on the model you bring
Community momentum is high, but production maturity is still catching up

Combat VerdictWorth watching closely, but the case is stronger as a fast-rising open-source contender than as a finished market winner.

Cloud / Async Agents

Jules #

Google Labs · Async Cloud Agent

Async Cloud VM PR-Centric Free during beta with usage limits

B TIER

▾

Official docs checkedVendor claims marked when relevant

Mode async cloud agent

Repo flow GitHub → secure cloud VM → PR

Extras audio changelogs

Automation Jules CLI + API

Status public beta

Jules remains the clearest async specialist in the field. The pricing text is simplified here because the most stable public signal is that Jules is available in beta with usage limits, while deeper packaging can shift as Google evolves the product. What matters most is the workflow: queue work, let it run in a secure cloud VM, and review a pull request later. That keeps Jules in a distinct category from pair-programming agents.

Strengths

Async-by-default workflow is genuinely different and genuinely useful
CLI and API make it easier to connect with broader engineering workflows
PR-centric output fits existing review and CI patterns well
Great for scoped backlog items, dependency chores, and repetitive bug work
Free beta availability lowers experimentation cost

Weaknesses

Not the right fit for tight iterative debugging loops
Success depends heavily on prompt clarity and task scoping
Benchmark evidence is thinner than for the strongest synchronous agents
Feature packaging is still evolving as the product matures

Combat VerdictStill the best backlog-clearing specialist here. Think contractor, not copilot.

Devin #

Cognition AI · Fully Autonomous Agent

Autonomous Sandboxed Cloud IDE+Browser+Terminal Base plan + usage-based credits

B TIER

▾

Official docs checkedVendor claims marked when relevant

Mode fully autonomous cloud agent

Environment IDE + browser + terminal + shell

Planning interactive review before execution

Repo docs Devin Wiki

Output PRs for defined tasks

Devin is still the most autonomy-forward product in the page, but this revision tones down the certainty around a few of its splashier numbers. Where the original copy treated merge-rate and cost language as settled facts, this version frames them more cautiously. The durable point is that Devin is strongest when you can hand off bounded, verifiable work into an isolated environment and review the result later.

Strengths

Full sandboxed environment enables deeper autonomous execution than most rivals
Interactive planning and Devin Wiki help reduce some handoff ambiguity
Useful for repetitive, bounded, reviewable task classes
Parallel autonomous sessions can create real throughput on maintenance work

Weaknesses

Economics are still harder to predict than flat-seat editor tools
Autonomy shines on well-scoped tasks and degrades faster on ambiguous work
Marketing narrative can outrun what day-to-day teams can safely delegate
Requires stronger review discipline than the product demos can make it seem

Combat VerdictBest understood as a delegation tool for clearly specified work, not a universal replacement for interactive development.

Autonomy note: cloud-autonomous agents look strongest in demos and clearly scoped tasks. Review overhead remains a real part of safe production use.

AI-Native IDEs

Cursor #

Anysphere · AI-Native IDE (VS Code Fork)

IDE VS Code Fork Multi-Model Free · Pro $20 · Pro+ $60 · Teams $40/seat

A TIER

▾

Official docs checkedVendor claims marked when relevant

Models OpenAI · Anthropic · Gemini · xAI · Cursor

Workflow Composer 2 + Auto mode

Reach trusted by 50%+ of Fortune 500

Review Bugbot + MCP support

Base VS Code fork

Cursor still feels like the most polished AI-native editor in the mainstream lane. This pass removes brittle user-count and drama-heavy copy and keeps the case grounded in what Cursor publicly emphasizes today: a wide model menu, Auto mode, Composer 2, MCP support, and broad enterprise penetration. The net assessment barely changes. Cursor remains one of the best daily-driver editors for VS Code-oriented teams.

Strengths

Excellent editor feel and integration for people who live in VS Code
Auto mode and broad model access reduce manual routing friction
Composer 2 keeps multi-file edits inside a strong editing experience
Bugbot and MCP support improve team-scale review and extensibility
Official messaging around enterprise traction is very strong

Weaknesses

Still fundamentally tied to the VS Code family rather than true multi-IDE coverage
Hard architectural or ambiguous tasks still push many users toward Claude Code
Pricing and packaging have shifted enough that teams should verify before standardizing
Best-in-class daily flow does not necessarily equal best-in-class autonomous depth

Combat VerdictStill one of the best day-to-day editors in the market; just not the final answer for every hard engineering problem.

Windsurf #

Cognition AI (ex-Codeium) · Agentic Multi-IDE

IDE 40+ IDEs Cascade SWE-1.5 Free · Pro $20 · Max $200 · Teams $40/seat

A TIER

▾

Official docs checkedVendor claims marked when relevant

Users 1M+

Enterprise 4,000+ customers

Coverage JetBrains + VS Code + more

Memory Cascade

Vendor speed claim up to 950 tok/s

Windsurf remains the strongest “not just VS Code” mainstream editor in this page. This revision removes shakier ranking language and reframes the speed story as vendor-claimed rather than universal fact. What survives the audit is still substantial: broad IDE coverage, persistent Cascade context, aggressive product velocity, and a clear appeal for JetBrains-heavy shops that do not want to be second-class citizens.

Strengths

Broad IDE coverage is a real strategic advantage over VS Code-only rivals
Cascade persistent memory is meaningful in ongoing project work
Strong product velocity and serious enterprise presence
Compelling option for JetBrains-centric teams that want a more agentic editor
Very fast model options remain attractive for iterative workflows

Weaknesses

Vendor-reported model speed and benchmark numbers are not the same as independently comparable agent scores
Product positioning relative to Devin and Cognition’s broader stack is still a little messy
Raw reasoning quality still depends heavily on which model path you use
Teams should test stability in their actual IDE mix before standardizing

Combat VerdictStill the default recommendation for many JetBrains users, with the caveat that some of its most dramatic numbers need to be read as product-marketing signals, not settled truth.

Performance note: extremely high speed figures and SWE-1.5 benchmark references are part of Windsurf/Cognition’s product narrative. Treat them as important signals, but not clean substitutes for independently comparable agent benchmarks.

Zed #

Zed Industries · Rust-Native AI IDE

IDE Rust-Native Speed-First Real-Time Collab Free editor · Zed Pro $10/mo

B TIER

▾

Official docs checkedVendor claims marked when relevant

Latency Sub-50ms

Rendering GPU-accelerated

Collab Real-time multi-user

Models Multi (cloud + local)

Editor Free + open source

Zed remains the speed-first outlier in this field. The biggest audit fix here is pricing: the old $20/month framing was stale. Zed’s current positioning is a lighter paid AI layer on top of a fast open-source editor rather than a direct one-to-one pricing mirror of Cursor. That makes the value story cleaner for people who primarily care about editor performance and collaboration.

Strengths

Rust-native architecture and GPU rendering still make it feel unusually fast
Real-time collaboration is built into the editor rather than bolted on later
Open-source editor base keeps entry cost low
Interesting option for users who do not want a VS Code fork
Supports both cloud and local AI workflows

Weaknesses

Extension and ecosystem depth still trail the biggest editor platforms
AI workflow depth is improving but not as agent-heavy as the leaders
Migration cost is real if your team is deeply standardized on VS Code or JetBrains

Combat VerdictStill the best “I care about the editor itself” pick, especially after correcting the pricing story.

Antigravity #

Google DeepMind · Agentic AI IDE (Free Preview)

IDE Free Preview Gemini 3 Pro Free preview

B TIER

▾

Official docs checkedVendor claims marked when relevant

Status Google preview

Model context Gemini 3.1 Pro rollout

Positioning agentic development platform

Workflow prompt-to-build flows

Benchmark note Gemini 3 model scored 76.2%

Antigravity stays in the page as a high-upside preview, but its description needed the heaviest audit. The old copy treated product lineage, rankings, and benchmark language too confidently. This revision narrows the claim set to what is easier to support publicly: Google is positioning Antigravity as an agentic development surface tied to Gemini 3-era capabilities, and Gemini 3 model benchmarks are strong. That is promising context, not the same thing as a fully audited product benchmark or mature daily-driver verdict.

Strengths

Google is clearly investing in this area rather than treating it as a side experiment
Preview access makes it cheap to monitor and test
Gemini 3-era model capability gives the product interesting upside

Weaknesses

Too early and too fluid to rank confidently against mature daily-driver tools
Published evidence is stronger for the underlying model family than for the product UX itself
Preview products often change packaging and positioning quickly

Combat VerdictA serious watchlist product, not yet a stable recommendation.

Benchmark note: the 76.2% figure belongs to Gemini 3 model reporting. It is useful context around Antigravity, but it does not by itself prove Antigravity product quality.

IDE Extensions & Plugins

GitHub Copilot #

GitHub / Microsoft · IDE Extension + Agent Mode

Extension Agent Mode MCP Free · Pro $10 · Business $19 · Enterprise $39

B TIER

▾

Official docs checkedVendor claims marked when relevant

Users 26M+ developers

Surfaces editor · command line · GitHub

Modes chat · completion · agent

Coverage VS Code · JetBrains · Visual Studio

Free tier available

GitHub Copilot is still the distribution king. The audit pass mostly corrects stale scale language and simplifies the go-to-market story: Copilot now spans editor, command line, and GitHub surfaces, and a real free tier exists for individuals. The technical ceiling may not excite the most demanding users, but Copilot remains unusually easy to deploy across mainstream organizations.

Strengths

Enormous distribution and very low organizational friction
Works across editor, command line, and GitHub workflow surfaces
Free tier lowers the barrier to entry for individuals
Broad IDE support makes standardization easier
For many teams, it is the easiest “good enough everywhere” default

Weaknesses

Advanced reasoning and deeper agent loops still trail the strongest specialist tools
Power users often outgrow it for harder multi-file or architecture-heavy work
Procurement ease can mask that the technical frontier has moved beyond it
Feature depth can vary across the many surfaces and IDEs it supports

Combat VerdictStill the enterprise-safe floor. Add a stronger specialist beside it when the work gets harder.

Cline #

Open Source (Apache-2.0) · VS Code, JetBrains & CLI surfaces

VS Code Ext. BYOK MCP Apache-2.0 Free (BYOK API costs)

A TIER

▾

Official docs checkedVendor claims marked when relevant

Installs 5.0M+

GitHub 60.5k stars

SWE-bench note 80.8% vendor-reported

License Apache-2.0

Extensibility MCP-first

Cline keeps its place as one of the strongest open and extensible agents in the page. This revision sharpens the benchmark wording: the headline score is worth noticing, but it should be read as a vendor-reported configuration result rather than a fully equivalent official agent benchmark. Even with that caveat, Cline’s appeal is strong: open, extensible, broad surface support, and no subscription markup on model usage.

Strengths

One of the strongest open-source agent ecosystems currently available
MCP-first posture makes it highly extensible
No provider markup keeps costs transparent
Now spans more than just the classic VS Code extension story
Large install base and community energy are real strengths

Weaknesses

Benchmark headline should be read with vendor-reporting caveats
Usage costs still depend on the model/provider you choose
Open flexibility can mean more operational setup than tightly integrated products
Experience quality can vary more by configuration than in fixed-stack tools

Combat VerdictStill one of the best open routes into serious coding agents, with the benchmark language now properly caveated.

Benchmark note: the 80.8% figure is widely cited by the project and should be read as vendor-reported for a specific model/configuration, not as a directly audited official product score.

Augment Code #

Augment · Enterprise IDE Extension (VS Code + JetBrains)

VS Code + JetBrains Enterprise Large Codebase Indie $20 · Standard $60 · Max $200 · Enterprise

B TIER

▾

Official docs checkedVendor claims marked when relevant

Core idea Context Engine

Surfaces IDE → CLI → review

Security SOC 2 Type II · ISO 42001

Plans Indie / Standard / Max

Focus large codebases

Augment Code still reads as a serious large-codebase specialist, but the pricing and plan language in the original version was out of date. This pass aligns the card to current public tiers and emphasizes the more durable product claims: Context Engine, broad workflow coverage from IDE to review, and enterprise-oriented security posture. It remains especially relevant when ordinary editor agents lose the thread in very large repos.

Strengths

Context Engine is still one of the strongest product stories for very large codebases
Covers more of the engineering workflow than “just an IDE extension”
Enterprise security/compliance posture is explicitly part of the product story
VS Code and JetBrains support keeps it relevant to mixed teams

Weaknesses

Current public tiers are no longer the simple free-to-$50 story the original draft described
Value depends heavily on whether your repos are truly large/complex enough to need it
Teams should pilot performance on their own codebase rather than trusting category reputation alone

Combat VerdictStill a strong specialist for very large repos, now with corrected plan framing and less hypey language.

Tabnine #

Tabnine Ltd. · Privacy-First IDE Extension

All Major IDEs On-Premise Air-Gap SOC2/GDPR/HIPAA Enterprise-oriented pricing · from about $59/user/mo annual

C TIER*

▾

Official docs checkedVendor claims marked when relevant

Deployment Cloud / On-Prem / Air-Gap

Compliance SOC 2, GDPR, HIPAA

Custom Training Train on your codebase

Free Tier Basic completions

Note *S-Tier for compliance-regulated orgs

Tabnine remains important because deployment flexibility is still rare in this market. The original draft overstated the “only option” angle; this revision softens that to the more defensible claim that Tabnine is one of the few mainstream coding-AI platforms seriously built around air-gapped, VPC, and on-prem deployment patterns. That keeps it squarely relevant for regulated environments even when raw frontier capability is not top-tier.

Strengths

Very strong deployment flexibility across SaaS, VPC, on-prem, and air-gapped modes
Enterprise context and governance story is central rather than incidental
Useful when data residency matters more than absolute frontier performance
Broad IDE coverage helps in conservative enterprise environments

Weaknesses

Capability ceiling can trail frontier cloud-first rivals
Pricing and procurement are more enterprise-shaped than individual-friendly
“Best” depends almost entirely on the compliance problem you are solving

Combat VerdictA niche leader rather than a universal one. For strict residency and air-gap requirements, it stays highly relevant.

Continue.dev #

Open Source · VS Code / JetBrains / CLI · PR checks & local models

Extension BYOK Local Ollama Apache-2.0 Free (BYOK / Local)

B TIER

▾

Official docs checkedVendor claims marked when relevant

License open source

New angle AI checks on every PR

CLI cn coding agent

Local works with Ollama/local models

Fit policy + customization

Continue needed a framing update more than a ranking update. The old copy described it mostly as a local-model IDE extension. That is still true, but the product has broadened: Continue now pushes an open coding agent/CLI and source-controlled AI checks on pull requests. That makes it more relevant for teams who want policy, review automation, and self-hosted or customizable AI infrastructure rather than just a chat sidebar.

Strengths

Strong open-source customization story for teams that want policy and repeatability
Good fit for local models, self-hosting, and controlled environments
PR-check workflow makes it more than just an editor add-on
CLI plus editor support creates flexible deployment patterns

Weaknesses

Less turnkey than the most polished commercial tools
Best results often require more active setup and configuration
Local-model-first usage still involves a capability trade-off

Combat VerdictA better fit for customizable workflows and controlled environments than the original card suggested.

Amazon Q Developer #

Amazon Web Services · AWS-Native IDE Extension

Extension AWS-Native CloudFormation/CDK Free (50 agent chats/mo) · Pro $19/user/mo

C TIER*

▾

Official docs checkedVendor claims marked when relevant

AWS context IAM · CDK · CloudFormation · Lambda

Free tier 50 agent chats/mo

Transformation up to 1,000 lines/mo free

Pro $19/user/mo

Fit AWS-heavy teams

Amazon Q Developer by AWS understands CloudFormation, Lambda, CDK, and IAM in a way that general-purpose agents don't have baked in. At $19/user/month it undercuts GitHub Copilot Business. Free tier provides 50 agentic chats/month for evaluation. Excellence is narrow: deep AWS-specific performance, drops off sharply for non-AWS codebases or general programming tasks.

Strengths

Native AWS context: IAM, CDK, CloudFormation, Lambda — unmatched
$19/user/mo — undercuts Copilot Business
Free tier: 50 agentic chats/month
Deep integration with AWS services ecosystem

Weaknesses

General coding capability mediocre outside AWS context
Very narrow excellence — not a general-purpose replacement
Limited IDE support compared to Copilot

Combat Verdict*A-tier in AWS-heavy environments, and still much less compelling as a broad all-purpose coding agent.

Junie #

JetBrains · IntelliJ-Native AI Agent

JetBrains-Native Agent Claude + Codex backends JetBrains AI Pro $10 · AI Ultimate $30

C TIER

▾

Official docs checkedVendor claims marked when relevant

Supported IDEs IntelliJ · PyCharm · WebStorm · GoLand + more

Access via JetBrains AI plans

Providers OpenAI · Anthropic · Google · xAI · OpenRouter

Mode JetBrains-native agent

Status maturing fast

Junie is more credible than the original card made it sound, but it also was not accurately priced. Junie is no longer best described as “bundled with JetBrains subscriptions”; it now sits inside JetBrains AI plan tiers and can also work with several BYOK providers. That makes the story clearer: this is JetBrains’ native answer for users who want to stay inside JetBrains products without installing a separate editor brand.

Strengths

Native JetBrains experience with no editor migration required
Supports multiple frontier providers through JetBrains AI and BYOK paths
Broad coverage across major JetBrains IDEs
Natural option for teams already standardized on JetBrains tooling

Weaknesses

Still catching up to the most mature specialist editor agents
Packaging has changed enough that teams need to verify current plan fit
Best value depends on how much you already rely on JetBrains AI more broadly

Combat VerdictA legitimate JetBrains-native option, with the biggest fix here being the corrected pricing and access model.

Kilo Code #

Open Source · VS Code / JetBrains / CLI · broad model routing

VS Code Ext. 500+ Models BYOK Free (OSS/BYOK) · optional paid platform

C TIER

▾

Official docs checkedVendor claims marked when relevant

Kilo Code had an incomplete card before. It is not just a lightweight VS Code extension: current positioning spans JetBrains and CLI surfaces as well, with multiple agent modes and optional cloud/platform layers. That still does not move it above Cline in this ranking, but it makes the product more complete and more interesting than the original text implied.

Strengths

Extremely broad model routing and provider flexibility
Open-source base with multiple surfaces, not just a single extension story
Useful for users who want experimentation and breadth over curation

Weaknesses

Still earlier and less proven than stronger open-source rivals
Breadth can come at the expense of clear opinionated workflow design
Most users will not need the full model-selection sprawl it exposes

Combat VerdictMore capable and broader than the original card suggested, but still a watchlist-tier choice beside stronger open-source leaders.

Browser / Vibe Coding Tier

⚡ VIBE CODING TIER — Different Category, Different Rules

These tools compete with each other, not with Claude Code or Cursor. They serve non-developers, rapid prototypers, and designers. Judged on: zero-to-deployed speed, UX, and target audience. Not benchmarked against SWE-Bench.

Bolt.new #

StackBlitz · Browser-Native Full-Stack Builder

Browser IDE Full-Stack One-Click Deploy Free (300k/day, 1M/mo) · paid plans

VIBE

▾

Official docs checkedVendor claims marked when relevant

Bolt.new remains one of the fastest browser-based idea-to-app tools. The main audit change is the free-tier wording: current public messaging emphasizes both a daily and monthly token limit rather than just “1M per month.” The overall conclusion stands: excellent for fast prototypes, weaker once a project becomes a long-lived custom codebase.

Strengths

Zero local setup — browser-native full Node.js stack
1M token/month free — genuinely useful for prototyping
Fastest zero-to-deployed for simple web apps

Weaknesses

Not for complex, custom codebases
Non-trivial projects quickly hit quality ceiling
Limited custom configuration vs. real IDEs

Combat VerdictStill one of the fastest prototype builders in the browser, with the free-tier wording corrected.

Lovable #

Lovable · AI Full-Stack App Builder

Browser Full-Stack Builder React + Supabase Free · Pro $25/mo

VIBE

▾

Official docs checkedVendor claims marked when relevant

Lovable specializes in React + Supabase full-stack apps generated from natural language. Strong UI quality and database integration. Synchronous — you're involved after each prompt. Popular with founders building MVPs without engineering teams. Best compared to Bolt.new: Lovable has stronger default UI aesthetics; Bolt.new has broader stack support.

Strengths

Strong default UI quality for generated apps
React + Supabase integration well-optimized
Popular with non-technical founders

Weaknesses

Synchronous — you must supervise each prompt
Less flexible than Bolt.new for custom stacks
Quality ceiling hit quickly on complex features

Combat VerdictStill a strong React/Supabase vibe-coding pick, with corrected current pricing.

Replit #

Replit Inc. · Browser IDE + AI + Deploy

Browser IDE 50+ Languages Education Free · Core from $20/mo

VIBE

▾

Official docs checkedVendor claims marked when relevant

Replit stays relevant because it is still the most complete general browser IDE in this page. This pass slightly tightens the pricing and plan language: Core now starts around $20/month on annual billing, with a higher month-to-month path. The wider read is unchanged: more complete environment than pure vibe builders, less specialized than the strongest narrow app generators.

Strengths

50+ languages in one browser environment
Real terminal + database + deployment bundled
Best for education and learners
Generous free tier

Weaknesses

AI assistance less opinionated than specialized vibe coders
Performance limited by browser/cloud infrastructure

Combat VerdictThe education and learner platform. Not the vibe coder frontier-runner, but the most complete browser development environment for students and quick experiments.

v0 #

Vercel · React / Tailwind UI Component Generator

UI Generator React + Tailwind shadcn/ui Free · Premium $20/mo · Team $30/seat

VIBE

▾

Official docs checkedVendor claims marked when relevant

v0 remains a narrow but extremely effective tool. The main update here is pricing/plan clarity. The product story is still the same: not a general coding agent, but one of the best ways to turn prompts into polished React/Tailwind/shadcn-style UI and then move the result into a real application.

Strengths

Best React + Tailwind + shadcn/ui component quality
Perfect bridge from design to Next.js code
Tight Vercel ecosystem integration

Weaknesses

Narrow scope — UI components, not full apps
Assumes React/Tailwind/Vercel stack

Combat VerdictThe best React UI component generator alive. Narrow scope, exceptional execution. Vercel/Next.js developers should use this before reaching for anything else for UI scaffolding.

Google Opal #

Google Labs · Vibe Coding App Builder

Browser Vibe Coding Google AI Free (Labs preview)

VIBE

▾

Official docs checkedVendor claims marked when relevant

Google Opal is still too early for a confident product ranking, but the card now reflects what is easier to verify publicly: Opal is a Google Labs tool for building, editing, and sharing AI mini-apps with natural language, it has expanded availability beyond a tiny pilot, and Google has been adding more dynamic “agent step” workflow capabilities. That makes it worth watching as a lightweight app-builder surface, not yet something to rank as a mature engineering platform.

Strengths

Google backing and fast iteration pace make it worth monitoring
Natural-language mini-app construction has clear non-developer appeal
Agent-step workflow additions make it more than a static toy

Weaknesses

Still early and not mature enough for a strong engineering recommendation
Positioning relative to Google’s other coding/build products is still evolving

Combat VerdictA credible watchlist entry, not yet a stable ranked choice.

Tool (Company)	Category	SWE-Bench	Free	Async	Multi-IDE	Terminal	MCP	Multi-Agent	Context	Tier
Claude Code (Anthropic)	Terminal/CLI	80.84%official agent result	No free tier	Sync-first	IDE + app	✓	✓	✓	Up to 1M	S
Cline (OSS)	Extension / OSS	80.8%vendor-reported	✓ BYOK	Sync-first	VS Code + JB + CLI	via CLI / tools	✓	depends on setup	Model-dependent	A
Codex (OpenAI)	CLI / app / web	77.3%Terminal-Bench 2.0	Included on plans	Some async features	editor + app	✓	partial	✓	Model-dependent	A
Cursor (Anysphere)	AI IDE	N/A public	Hobby free	Mostly sync	VS Code family	Not terminal-first	✓	✓	Project-scoped	A
Windsurf (Cognition)	AI IDE	40.08%vendor model result	Free tier	Mostly sync	✓ broad IDE support	Not terminal-first	✓	✓	Persistent project memory	A
GitHub Copilot (GitHub)	Extension	N/A public	✓ Free tier	some agent flows	✓ broad IDE support	CLI support	✓	basic-to-mid	Standard project scope	B
Jules (Google Labs)	Cloud / Async	N/A public	Beta free limits	✓ async-first	GitHub-centered	CLI/API adj.	N/A focus	parallel queues	Task scoped	B
Gemini CLI (Google)	Terminal/CLI	Gemini 3: 76.2%model result context	✓ generous entry	Sync-first	CLI + Cloud Shell	✓	✓	tool-dependent	1M	B
Devin (Cognition)	Cloud / Autonomous	N/A public	Paid usage	✓	Cloud only	sandbox terminal	Not core story	✓	Repo/task scoped	B
Aider (OSS)	Terminal/CLI	N/A public	✓ BYOK	Sync-first	terminal only	✓	tooling-dependent	No native multi-agent	Model-dependent	B
OpenCode (OSS)	Terminal/CLI	N/A public	✓ BYOK	Sync-first	terminal + managed path	✓	varies	emerging	Model-dependent	B
Antigravity (Google)	Preview IDE	Gemini 3 context onlynot product score	✓ preview	evolving	preview surface	not terminal-first	evolving	unknown	Large-model context	B
Zed (Zed)	AI IDE	N/A public	✓ free editor	Sync-first	cross-platform editor	not terminal-first	partial	No	Project/editor scoped	B
Augment Code	Extension / enterprise	N/A public	Paid plans	Mostly sync	VS Code + JetBrains	CLI + review flows	✓	task/orchestration	Context Engine	B
Tabnine	Extension / enterprise	N/A public	Enterprise-oriented	Mostly sync	broad IDE support	No	No	No	Enterprise context	C*
Continue (OSS)	Extension / CLI	N/A public	✓ self-host/open	depends on setup	VS Code + JetBrains	CLI available	No native MCP focus	policy/check based	Model/self-hosted	B
Amazon Q Developer (AWS)	Extension	N/A public	✓ free tier	Mostly sync	limited	not terminal-first	No	No	AWS-native context	C*
Junie (JetBrains)	JetBrains-native	N/A public	via AI plans	Mostly sync	JetBrains native	not terminal-first	partial	evolving	IDE/project scoped	C
Kilo Code (OSS)	Extension / CLI	N/A public	✓ OSS/BYOK	Mostly sync	VS Code + JB + CLI	CLI	varies	evolving	Model-dependent	C
— BROWSER / VIBE BUILDER TIER — separate comparison axis —
Bolt.new (StackBlitz)	Browser builder	N/A	✓ free tier	Interactive	browser only	browser terminal	No	No	Prompt/app scoped	V
Lovable	Browser builder	N/A	✓ free tier	Interactive	browser only	No	No	No	Prompt/app scoped	V
Replit	Browser IDE	N/A	✓ free tier	long-running cloud tasks	browser only	browser terminal	No	platform features	Workspace scoped	V
v0 (Vercel)	Browser/UI	N/A	✓ free tier	Interactive	browser only	No	No	No	UI/component scoped	V
Google Opal (Google)	Browser mini-apps	N/A	✓ preview	Interactive	browser only	No	No	agent steps emerging	Mini-app scoped	V

Confidence model

Scoring philosophy

What was corrected

Still moving fast

⚡ VIBE CODING TIER — Different Category, Different Rules

Complex Multi-File Refactors

Best IDE (VS Code users)

Best IDE (JetBrains users)

Async Backlog Clearing

Best Free CLI Option

Best Value (Already on ChatGPT+)

Enterprise / Corporate Compliance

Air-Gap / Strict Data Residency

Open Source / Full Transparency

Fully Autonomous Delegation

Large Codebase (100K+ files)

Speed (Iterative Vibe Coding)

Rapid MVP / Zero Setup

React/Next.js UI Components

Power User Stack (2026)

Official product pages checked

More product docs checked

JetBrains / browser builders

Google previews / watchlist