OpenHands

agent open-source multi-agent

This item was not updated in last three versions of the Radar. Should it have appeared in one of the more recent editions, there is a good chance it remains pertinent. However, if the item dates back further, its relevance may have diminished and our current evaluation could vary. Regrettably, our capacity to consistently revisit items from past Radar editions is limited.

Mar 2026

Trial

OpenHands (formerly OpenDevin) is the leading open-source autonomous software engineering agent — achieving 72% on SWE-bench Verified (resolving real GitHub issues) and shipping a production-ready SDK in late 2025. With 64K+ GitHub stars and an MIT licence, it is the top open-source alternative to commercial agents like Devin.

Architecture Deep Dive → OpenHands Architecture Breakdown — event-sourced SDK design, CodeAct execution paradigm, Docker sandbox isolation, and multi-agent orchestration patterns.

Why It's in Trial

OpenHands crossed from research project to production platform in 2025. The key shift was the V1 SDK release — a modular, stateless, event-sourced architecture designed for running thousands of agents concurrently in the cloud rather than just locally.

What makes it stand out:

SWE-bench Verified: 72% resolution rate using Claude Sonnet 4.5 with extended thinking — top-tier performance on the industry's standard benchmark for autonomous coding agents
Model agnostic: Bring any LLM — Claude, GPT-5-Codex, Gemini, or local models via Ollama
Sandboxed execution: Agents run inside Docker containers — file writes, shell commands, and web access are isolated from your host system
Web UI + GitHub integration: Resolve GitHub issues directly from the UI; no terminal required
AMD partnership: Optimised for running on AMD Radeon workstations with open-source local models — a strong option for air-gapped/on-premises deployments
MIT licence: Full auditability, no proprietary components in the core

The Production SDK (V1)

The V1 SDK is composable: four packages — SDK, Tools, Workspace, Server — with clear separation. The event-sourced state model means a crashed agent can be resumed from any point in its execution history. This matters at scale: when running 100 parallel agents, individual failures shouldn't require manual recovery.

Compared to Devin

	OpenHands	Devin
Licence	MIT (open source)	Proprietary SaaS
Pricing	Free (bring your own LLM API)	~$500/seat/month
SWE-bench Verified	72%	~13% (original claim)
Self-hosted	✅ Yes	❌ No
Model choice	Any	Cognition's models

When OpenHands Struggles

Large monorepos with complex build systems. OpenHands clones your repo into a Docker container and works from there. For repos with multi-minute build times or complex dependency graphs (Bazel, Gradle multi-project), the agent spends significant time and tokens on build failures that a human would avoid by knowing the local environment. Claude Code's "work in your environment" approach handles this better because it inherits your local toolchain.
Tasks requiring deep domain context. The 72% SWE-bench score means 28% of real GitHub issues are not resolved. The failures tend to cluster on tasks requiring understanding of project-specific conventions, implicit architectural constraints, or multi-file coordination across distant parts of the codebase. If your issues routinely require reading 10+ files to understand context, expect lower success rates.
Ops overhead of self-hosting. "Free" is misleading — you need to provision Docker infrastructure, manage API keys, monitor agent runs, and handle the inevitable cases where an agent gets stuck in a loop burning tokens. For small teams without DevOps capacity, Devin's managed SaaS may actually be cheaper when you account for ops time.

For Engineering Managers

OpenHands is the answer to "can we get Devin-level capabilities without the $500/seat cost?" For teams willing to host it on their own infrastructure and bring their own LLM API keys, the performance and functionality are comparable at a fraction of the price. But factor in the ops cost of self-hosting — if your team doesn't have Docker infrastructure expertise, the "free" label is deceptive.

Getting Started

docker pull ghcr.io/all-hands-ai/openhands:main

docker run -it --rm \
  -e LLM_API_KEY=your-anthropic-key \
  -e LLM_MODEL=claude-sonnet-4-6 \
  -p 3000:3000 \
  ghcr.io/all-hands-ai/openhands:main

Open http://localhost:3000 and start assigning tasks.

Key Characteristics

Property	Value
Interface	Web UI + API
Licence	MIT
Execution	Docker sandboxes
SWE-bench Verified	72% (Claude Sonnet 4.5)
Provider	All Hands AI (community)
Website	all-hands.dev
GitHub	All-Hands-AI/OpenHands