One-Line Summary: As of mid-2026 the agent-harness market has split into roughly four categories — coding-IDE harnesses, terminal coding harnesses, orchestration platforms, and headless/agentic-OS harnesses — each represented by 2–4 dominant products with overlapping but distinct positioning.

Prerequisites: What is an AI harness, harness vs. framework vs. SDK

What Is the Harness Landscape?

A year ago this was a list of curiosities. By 2026 it is a market. Roughly:

  • Coding-IDE harnesses — Cursor, Windsurf, Zed-with-AI, Replit Agent. Live inside an editor. Optimized for keystrokes-per-task and tight feedback loops.
  • Terminal coding harnesses — Claude Code, Codex CLI, Aider, Continue. Live in $SHELL. Optimized for repository-scale changes, hooks, and CI integration.
  • Orchestration platforms — ruflo, OpenHands, AutoGPT-X, Devin-class. Run multiple agents over long horizons. Optimized for autonomy and persistent goals.
  • Headless / agentic-OS harnesses — emerging category running agents on their own machines or in dedicated VMs. SWE-agent, Devbox-style products, AGI-OS prototypes.

Frontier labs increasingly ship a first-party harness alongside their model: Anthropic ships Claude Code, OpenAI ships Codex CLI, and Google ships ADK plus Gemini CLI. This co-shipping is not a coincidence — the harness is where the user actually meets the model and where retention is won.

How to Read the Landscape

Three axes capture most of the differences:

  1. Where it lives — IDE, terminal, browser, headless.
  2. How autonomous — single-step (you confirm everything) → autopilot (the harness runs without you).
  3. Single- vs multi-agent — does it spawn sub-agents, run swarms, federate across machines.

A harness sits at one point on each axis. Cursor: IDE / single-step / single-agent. Claude Code: terminal / single-step-or-autopilot / single+sub-agents. Ruflo: terminal-or-headless / autopilot / multi-agent + federation.

Why It Matters

Choosing a harness is a stack decision. Switching costs are real — your .cursorrules are not your CLAUDE.md are not your .ruflo/agents. Plugins, slash commands, hooks, sub-agent definitions are mostly per-harness. MCP partly mitigates this on the tool side, but configuration, memory, and topology stay locked in.

Key Technical Details

  • Native model: Cursor and Continue are model-agnostic; Claude Code is Anthropic-first; Codex CLI is OpenAI-first; ruflo is Claude-first but multi-provider.
  • Open-source vs proprietary: Aider, OpenHands, Continue are open. Claude Code, Codex CLI, Cursor are closed.
  • Pricing models: Subscription (Cursor, Replit), per-token (Claude Code, Codex CLI), free + BYO key (Aider, Continue, OpenHands), platform (ruflo).
  • Hook/plugin ecosystem maturity (subjective, May 2026): Claude Code > ruflo > Codex CLI > Cursor > rest.
  • MCP support: Universal among 2026 harnesses; the lone outliers are early-stage open projects.
  • Multi-agent: ruflo and OpenHands are first-class; Claude Code supports sub-agents but not swarms; Cursor and Codex CLI are mostly single-agent.

How Harnesses & Frameworks Implement This

CategoryExamplesBest at
IDE-codingCursor, Windsurf, Zed-AI, Replit AgentTight inner loop, editing UX
Terminal-codingClaude Code, Codex CLI, Aider, ContinueRepo-scale changes, CI, scripting
Orchestrationruflo, OpenHands, Devin-class, AutoGPT-XLong-horizon autonomy
Headless / agentic-OSSWE-agent, AGI-OS prototypesBackground agents, autonomous workflows

Connections to Other Concepts

  • claude-code-as-harness.md, codex-cli-and-cursor-as-harnesses.md, ruflo-architecture-tour.md — The reference harnesses, expanded.
  • claude-code-vs-codex-vs-cursor.md — Side-by-side IDE-vs-terminal comparison.
  • langgraph-vs-autogen-vs-crewai.md — The framework counterpart.
  • choosing-your-harness-stack.md — Capstone decision framework.

Further Reading

  • Codex Blog, "Claude Multi-Agent Ecosystem" (2026) — A useful third-party survey.
  • The maintainers' READMEs of each listed harness — the most current positioning is always there.