One-Line Summary: As of mid-2026 the agent-harness market has split into roughly four categories — coding-IDE harnesses, terminal coding harnesses, orchestration platforms, and headless/agentic-OS harnesses — each represented by 2–4 dominant products with overlapping but distinct positioning.
Prerequisites: What is an AI harness, harness vs. framework vs. SDK
What Is the Harness Landscape?
A year ago this was a list of curiosities. By 2026 it is a market. Roughly:
- Coding-IDE harnesses — Cursor, Windsurf, Zed-with-AI, Replit Agent. Live inside an editor. Optimized for keystrokes-per-task and tight feedback loops.
- Terminal coding harnesses — Claude Code, Codex CLI, Aider, Continue. Live in
$SHELL. Optimized for repository-scale changes, hooks, and CI integration. - Orchestration platforms — ruflo, OpenHands, AutoGPT-X, Devin-class. Run multiple agents over long horizons. Optimized for autonomy and persistent goals.
- Headless / agentic-OS harnesses — emerging category running agents on their own machines or in dedicated VMs. SWE-agent, Devbox-style products, AGI-OS prototypes.
Frontier labs increasingly ship a first-party harness alongside their model: Anthropic ships Claude Code, OpenAI ships Codex CLI, and Google ships ADK plus Gemini CLI. This co-shipping is not a coincidence — the harness is where the user actually meets the model and where retention is won.
How to Read the Landscape
Three axes capture most of the differences:
- Where it lives — IDE, terminal, browser, headless.
- How autonomous — single-step (you confirm everything) → autopilot (the harness runs without you).
- Single- vs multi-agent — does it spawn sub-agents, run swarms, federate across machines.
A harness sits at one point on each axis. Cursor: IDE / single-step / single-agent. Claude Code: terminal / single-step-or-autopilot / single+sub-agents. Ruflo: terminal-or-headless / autopilot / multi-agent + federation.
Why It Matters
Choosing a harness is a stack decision. Switching costs are real — your .cursorrules are not your CLAUDE.md are not your .ruflo/agents. Plugins, slash commands, hooks, sub-agent definitions are mostly per-harness. MCP partly mitigates this on the tool side, but configuration, memory, and topology stay locked in.
Key Technical Details
- Native model: Cursor and Continue are model-agnostic; Claude Code is Anthropic-first; Codex CLI is OpenAI-first; ruflo is Claude-first but multi-provider.
- Open-source vs proprietary: Aider, OpenHands, Continue are open. Claude Code, Codex CLI, Cursor are closed.
- Pricing models: Subscription (Cursor, Replit), per-token (Claude Code, Codex CLI), free + BYO key (Aider, Continue, OpenHands), platform (ruflo).
- Hook/plugin ecosystem maturity (subjective, May 2026): Claude Code > ruflo > Codex CLI > Cursor > rest.
- MCP support: Universal among 2026 harnesses; the lone outliers are early-stage open projects.
- Multi-agent: ruflo and OpenHands are first-class; Claude Code supports sub-agents but not swarms; Cursor and Codex CLI are mostly single-agent.
How Harnesses & Frameworks Implement This
| Category | Examples | Best at |
|---|---|---|
| IDE-coding | Cursor, Windsurf, Zed-AI, Replit Agent | Tight inner loop, editing UX |
| Terminal-coding | Claude Code, Codex CLI, Aider, Continue | Repo-scale changes, CI, scripting |
| Orchestration | ruflo, OpenHands, Devin-class, AutoGPT-X | Long-horizon autonomy |
| Headless / agentic-OS | SWE-agent, AGI-OS prototypes | Background agents, autonomous workflows |
Connections to Other Concepts
claude-code-as-harness.md,codex-cli-and-cursor-as-harnesses.md,ruflo-architecture-tour.md— The reference harnesses, expanded.claude-code-vs-codex-vs-cursor.md— Side-by-side IDE-vs-terminal comparison.langgraph-vs-autogen-vs-crewai.md— The framework counterpart.choosing-your-harness-stack.md— Capstone decision framework.
Further Reading
- Codex Blog, "Claude Multi-Agent Ecosystem" (2026) — A useful third-party survey.
- The maintainers' READMEs of each listed harness — the most current positioning is always there.