Technology RadarTechnology Radar

OpenAI Codex (Cloud Agent)

agentide
Trial

OpenAI Codex is a multi-surface coding agent — cloud web app, open-source CLI, IDE extension, and GitHub bot — powered by the GPT-5.x-Codex model family. Each cloud task runs in an air-gapped sandbox with no internet access, and deep GitHub integration enables issue-to-PR and automatic PR review workflows. More than a million developers use it weekly.

Architecture Deep Dive → OpenAI Codex Architecture Breakdown — three-layer design, agent loop with prompt construction, dual sandbox model (kernel-level CLI vs. air-gapped cloud), AGENTS.md configuration, GitHub integration, and comparison with Claude Code, Copilot, and Devin.

Note: This is not the original Codex from 2021 (the code completion model). This is the 2025 relaunch — a fully agentic product that reuses the Codex name.

Why It's in Trial

OpenAI re-entered the agentic coding space in 2025 with a product that competes directly with Claude Code, GitHub Copilot, and Devin. Usage grew 5x since January 2026 — one of OpenAI's fastest-growing products.

Key capabilities:

  • Parallel cloud tasks: Multiple sandboxed instances work simultaneously on different tasks in your codebase
  • GitHub integration: Tag @codex on any GitHub issue or PR to trigger the agent; opt in to automatic PR reviews on every new PR
  • AGENTS.md support: Open standard (Linux Foundation / AAIF) for guiding agent behaviour — works across Codex, Cursor, Copilot, Windsurf, Gemini CLI, and others
  • Air-gapped sandbox: Cloud containers have no internet access during task execution — only your repo and pre-installed dependencies
  • Open-source CLI: Apache-2.0 licensed, written in Rust, 67K+ GitHub stars — the CLI uses OS-level kernel enforcement (Landlock + seccomp on Linux, Seatbelt on macOS), making it the only major coding agent with kernel-level sandboxing enabled by default
  • Codex GitHub Action: Run Codex in CI/CD for automated code review, migrations, and quality gates

Trial rather than Adopt because Claude Code leads SWE-bench Verified (80.8% vs Codex's competitive but lower score), the Codex product line is still iterating rapidly, and AGENTS.md cross-tool portability is an advantage for multi-tool teams but not a settled pattern yet.

When Codex is the right choice: Teams already paying for ChatGPT (it's included), multi-tool shops that value AGENTS.md portability, and security-conscious teams that want kernel-level sandboxing by default.

When to look elsewhere: If you need the highest autonomous success rate (Claude Code), if you want an editor-integrated experience (Cursor), or if your team is primarily on GitHub and wants the tightest CI/CD integration (Copilot).

Key Characteristics

Property Value
Interface Cloud web, CLI, IDE extension, Desktop app, GitHub bot
Underlying model GPT-5.3-Codex (Cloud), configurable (CLI)
CLI language Rust (Apache-2.0)
Sandbox OS-level: Landlock + seccomp (Linux), Seatbelt (macOS)
Pricing Included with ChatGPT Plus ($20/mo), Pro ($200/mo), Team, Business, Enterprise
GitHub openai/codex
Website openai.com/codex
Developer docs developers.openai.com/codex

Codex Security (Research Preview)

OpenAI launched Codex Security in March 2026 (formerly known internally as Aardvark) — an AI security agent built into the Codex ecosystem. It runs a three-stage pipeline: identification (scans the repo and generates a project-specific threat model), validation (sandbox-tests findings to filter false positives), and remediation (generates a concrete patch for review).

In its first 30 days of beta, Codex Security scanned more than 1.2 million commits, identifying 792 critical and 10,561 high-severity findings. OpenAI reported an 84% reduction in overall noise and a 50% drop in false-positive rates versus earlier approaches. The tool also discovered 14 CVEs in foundational open-source projects including libssh, PHP, Chromium, OpenSSH, and GnuTLS. (The Hacker News, OpenAI announcement)

Available to ChatGPT Pro, Enterprise, Business, and Edu customers. Comes weeks after Anthropic launched the analogous Claude Code Security.

Further Reading

Trial

OpenAI Codex (2025 relaunch) is a cloud-based software engineering agent powered by codex-1 (a coding-optimised version of o3/GPT-5) that can handle complex tasks in parallel cloud sandboxes — with deep GitHub integration for issue-to-PR workflows.

Note: This is not the original Codex from 2021 (the code completion model). This is a new agentic product reusing the Codex name.

Why It's in Trial

OpenAI re-entered the agentic coding space seriously in 2025 with a product that competes directly with Claude Code and GitHub Copilot Workspace. Daily usage grew more than 10x since August 2025, and it's one of OpenAI's fastest-growing products.

Key capabilities:

  • Parallel tasks: Multiple sandboxed instances can work simultaneously on different tasks in your codebase
  • GitHub integration: Tag @codex on any GitHub issue or PR to trigger the agent
  • Automatic PR reviews: Opt in to have Codex automatically review every new PR
  • AGENTS.md support: Place guidelines in your repo to control how the agent reviews and modifies code
  • Air-gapped sandbox: The cloud container has no internet access during task execution — it can only access your repo and pre-installed dependencies (a security feature)

Access Points

  • Codex Cloud: Web interface at openai.com/codex
  • Codex CLI: npm install -g @openai/codex — runs locally in your terminal
  • IDE extension: Available for VS Code, Cursor, and VS Code forks
  • SDK: TypeScript SDK for embedding in your own tools

Included With

ChatGPT Plus, Pro, Business, Edu, and Enterprise subscriptions.

Key Characteristics

Property Value
Interface Web, CLI, IDE extension
Underlying model codex-1 (o3/GPT-5 variant optimised for coding)
Strengths GitHub integration, parallel execution, PR review
Provider OpenAI
Announcement Introducing Codex
Website openai.com/codex