OpenAI launches GPT-5.5 and unified desktop agent application
OpenAI released its GPT-5.5 frontier model alongside a new unified desktop application that merges ChatGPT, Codex, and Atlas for agentic workflows.
On April 23, 2026, OpenAI released GPT-5.5, a new frontier model optimized for autonomous multi-step tasks. The rollout includes a unified desktop application that merges ChatGPT, Codex, and Atlas into a single workspace. This architectural shift enables the model to maintain context across web browsing, writing code, and automating general software operations.
Model Variants and Pricing
OpenAI introduced two primary model tiers alongside a specialized processing mode called GPT-5.5 Thinking. The standard GPT-5.5 model is available immediately to ChatGPT Plus, Pro, Business, and Enterprise users. The higher-performance GPT-5.5 Pro utilizes parallel test-time compute for complex data science and legal research, restricted to the Pro ($200 per month), Business, and Enterprise tiers.
API access is scheduled for a future update. The general API supports a 1 million token context window.
| Model | Input Price (per 1M tokens) | Output Price (per 1M tokens) |
|---|---|---|
| GPT-5.5 | $5.00 | $30.00 |
| GPT-5.5 Pro | $30.00 | $180.00 |
Unified Desktop Application
The launch of the desktop client centralizes OpenAI’s previous standalone tools. By combining ChatGPT with the Codex coding agent and the Atlas browser agent, developers can execute cross-application workflows without manually moving data between windows. Within the Codex environment, the model operates with a restricted 400,000 token context window.
Benchmark Results
OpenAI reports that GPT-5.5 achieved state-of-the-art results on 14 technical benchmarks. The model consistently outperformed Anthropic’s Claude Opus 4.7 and Google Gemini 3.1 Pro in agentic evaluations.
| Benchmark | GPT-5.5 Score | Claude Opus 4.7 Score |
|---|---|---|
| Terminal-Bench 2.0 | 82.7% | 69.4% |
| OSWorld-Verified | 78.7% | 78.0% |
| FrontierMath Tier 4 | 39.6% (Pro variant) | 22.9% |
The standard model also scored 73.1% on Expert-SWE, an internal coding benchmark measuring tasks that average 20 hours of human work. On GDPval, which tests economic tasks across 44 fields, the model set a new record at 84.9%.
Infrastructure and Training
Internally codenamed Spud, the model was trained and serves on NVIDIA GB200 and GB300 NVL72 infrastructure. The architecture is more token-efficient than GPT-5.4, delivering equivalent AI coding workflows at half the compute cost of competing models. OpenAI utilized GPT-5.5 during development to write new load-balancing heuristics for its own serving infrastructure, resulting in a 20 percent increase in token generation speeds.
Security testing evaluated biological and cyber threats. Under OpenAI’s Preparedness Framework, the cybersecurity capability rating is classified as High.
If you build systems that rely on cross-application context, the integration of Codex and Atlas within a single desktop environment changes how you structure multi-step automation. You should evaluate the GPT-5.5 API pricing structure against your current retrieval setups to determine if the 1 million token context window offsets the higher output token costs.
Get Insanely Good at AI
The book for developers who want to understand how AI actually works. LLMs, prompt engineering, RAG, AI agents, and production systems.
Keep Reading
How to Implement Multi-Agent Coordination Patterns
Learn five production-grade architectural patterns for multi-agent systems to optimize performance, hierarchy, and context management in AI engineering.
Google Launches Workspace Intelligence and Workspace MCP Server
Google announced a new agentic AI layer for Workspace apps alongside hardware updates and a Model Context Protocol server for developers.
Microsoft Deploys Agent Mode and Claude 3.5 Sonnet to Office
Microsoft is rolling out Agent Mode across Word, Excel, and PowerPoint, adding Claude 3.5 Sonnet and in-canvas task execution to Microsoft 365.
Scaling Ecom-RLVE for Verifiable AI Shopping Agents
The new Ecom-RLVE framework replaces subjective AI judging with algorithmic verification to train reliable e-commerce agents through adaptive RL environments.
AI Agents Get Post-Quantum Networking in Cloudflare Mesh
Cloudflare Mesh introduces a secure fabric for AI agents, users, and nodes, replacing legacy VPNs with identity-based, post-quantum encrypted connectivity.