OpenAI Codex (Cloud Agent)

Mar 2026

Trial

OpenAI Codex is a multi-surface coding agent — cloud web app, open-source CLI, IDE extension, and GitHub bot — powered by the GPT-5.x-Codex model family. Each cloud task runs in an air-gapped sandbox with no internet access, and deep GitHub integration enables issue-to-PR and automatic PR review workflows. More than a million developers use it weekly.

Architecture Deep Dive → OpenAI Codex Architecture Breakdown — three-layer design, agent loop with prompt construction, dual sandbox model (kernel-level CLI vs. air-gapped cloud), AGENTS.md configuration, GitHub integration, and comparison with Claude Code, Copilot, and Devin.

Note: This is not the original Codex from 2021 (the code completion model). This is the 2025 relaunch — a fully agentic product that reuses the Codex name.

Why It's in Trial

OpenAI re-entered the agentic coding space in 2025 with a product that competes directly with Claude Code, GitHub Copilot, and Devin. Usage grew 5x since January 2026 — one of OpenAI's fastest-growing products.

Key capabilities:

Parallel cloud tasks: Multiple sandboxed instances work simultaneously on different tasks in your codebase
GitHub integration: Tag @codex on any GitHub issue or PR to trigger the agent; opt in to automatic PR reviews on every new PR
AGENTS.md support: Open standard (Linux Foundation / AAIF) for guiding agent behaviour — works across Codex, Cursor, Copilot, Windsurf, Gemini CLI, and others
Air-gapped sandbox: Cloud containers have no internet access during task execution — only your repo and pre-installed dependencies
Open-source CLI: Apache-2.0 licensed, written in Rust, 67K+ GitHub stars — the CLI uses OS-level kernel enforcement (Landlock + seccomp on Linux, Seatbelt on macOS), making it the only major coding agent with kernel-level sandboxing enabled by default
Codex GitHub Action: Run Codex in CI/CD for automated code review, migrations, and quality gates

Trial rather than Adopt because Claude Code leads SWE-bench Verified (80.8% vs Codex's competitive but lower score), the Codex product line is still iterating rapidly, and AGENTS.md cross-tool portability is an advantage for multi-tool teams but not a settled pattern yet.

When Codex is the right choice: Teams already paying for ChatGPT (it's included), multi-tool shops that value AGENTS.md portability, and security-conscious teams that want kernel-level sandboxing by default.

When to look elsewhere: If you need the highest autonomous success rate (Claude Code), if you want an editor-integrated experience (Cursor), or if your team is primarily on GitHub and wants the tightest CI/CD integration (Copilot).

Key Characteristics

Property	Value
Interface	Cloud web, CLI, IDE extension, Desktop app, GitHub bot
Underlying model	GPT-5.3-Codex (Cloud), configurable (CLI)
CLI language	Rust (Apache-2.0)
Sandbox	OS-level: Landlock + seccomp (Linux), Seatbelt (macOS)
Pricing	Included with ChatGPT Plus ($20/mo), Pro ($200/mo), Team, Business, Enterprise
GitHub	openai/codex
Website	openai.com/codex
Developer docs	developers.openai.com/codex

Codex Security (Research Preview)

OpenAI launched Codex Security in March 2026 (formerly known internally as Aardvark) — an AI security agent built into the Codex ecosystem. It runs a three-stage pipeline: identification (scans the repo and generates a project-specific threat model), validation (sandbox-tests findings to filter false positives), and remediation (generates a concrete patch for review).

In its first 30 days of beta, Codex Security scanned more than 1.2 million commits, identifying 792 critical and 10,561 high-severity findings. OpenAI reported an 84% reduction in overall noise and a 50% drop in false-positive rates versus earlier approaches. The tool also discovered 14 CVEs in foundational open-source projects including libssh, PHP, Chromium, OpenSSH, and GnuTLS. (The Hacker News, OpenAI announcement)

Available to ChatGPT Pro, Enterprise, Business, and Edu customers. Comes weeks after Anthropic launched the analogous Claude Code Security.

Why It's in Trial

OpenAI re-entered the agentic coding space seriously in 2025 with a product that competes directly with Claude Code and GitHub Copilot Workspace. Daily usage grew more than 10x since August 2025, and it's one of OpenAI's fastest-growing products.

Key capabilities:

Parallel tasks: Multiple sandboxed instances can work simultaneously on different tasks in your codebase
GitHub integration: Tag @codex on any GitHub issue or PR to trigger the agent
Automatic PR reviews: Opt in to have Codex automatically review every new PR
AGENTS.md support: Place guidelines in your repo to control how the agent reviews and modifies code
Air-gapped sandbox: The cloud container has no internet access during task execution — it can only access your repo and pre-installed dependencies (a security feature)

Access Points

Codex Cloud: Web interface at openai.com/codex
Codex CLI: npm install -g @openai/codex — runs locally in your terminal
IDE extension: Available for VS Code, Cursor, and VS Code forks
SDK: TypeScript SDK for embedding in your own tools

Included With

ChatGPT Plus, Pro, Business, Edu, and Enterprise subscriptions.

Key Characteristics

Property	Value
Interface	Web, CLI, IDE extension
Underlying model	codex-1 (o3/GPT-5 variant optimised for coding)
Strengths	GitHub integration, parallel execution, PR review
Provider	OpenAI
Announcement	Introducing Codex
Website	openai.com/codex

OpenAI Codex (Cloud Agent)

Why It's in Trial

Key Characteristics

Codex Security (Research Preview)

Further Reading

Why It's in Trial

Access Points

Included With

Key Characteristics