3nm Trainium3 Chips Pivot AWS to Direct Merchant Silicon
Amazon Web Services is shifting its semiconductor strategy, selling its 3nm Trainium3 and Inferentia3 AI chips directly to external data center operators.
Amazon Web Services has transitioned from a cloud-only hardware operator to a direct merchant silicon vendor. During a June 17 investor briefing, Amazon CEO Andy Jassy confirmed the company will begin selling its custom AI chips as standalone hardware to external data centers. The strategic shift breaks the previous exclusivity of Trainium and Inferentia processors to AWS infrastructure, turning the cloud provider into a direct competitor against Nvidia, AMD, and Intel.
Jassy framed the policy change as a $50 billion revenue opportunity. For enterprise customers and independent colocation facilities, the move unlocks access to Amazon’s proprietary silicon without requiring workloads to live inside an AWS data center.
Trainium3 and Nitromorph Interconnect
The commercial hardware catalog leads with the AWS Trainium3 processor. Manufactured on a 3nm process, the chip utilizes a modular tile architecture designed for foundation model training. This physical design allows external operators to scale deployments from individual server racks up to localized UltraClusters.
AWS is pairing the silicon sales with licensing for its Nitromorph high-speed interconnect technology. By offering Nitromorph externally, third-party data centers can link Trainium3 clusters with the same bandwidth and latency profiles previously restricted to AWS Availability Zones.
For production AI inference workloads, AWS is also selling the Inferentia3 processor. This chip focuses on low-latency, high-throughput execution of pre-trained models.
Hardware Pricing and Availability
AWS targets the Nvidia H200 and B200 deployment market with aggressive cost claims. While specific per-unit MSRPs are not yet public, AWS states the total cost of ownership for training on its hardware will run 35-40% lower than comparable Nvidia architectures.
Rollout happens in two phases:
- Q3 2026: Validation Kits ship to select colocation providers like Equinix and Digital Realty.
- January 2027: General availability for enterprise hardware purchasing.
Decoupling the Software Stack
Selling physical chips requires a software ecosystem that does not assume an AWS cloud environment. To support this, AWS open-sourced the Neutron-SDK. This compiler and runtime suite bridges the gap between common machine learning frameworks and the proprietary hardware.
Developers looking to scale PyTorch training or run JAX workloads can use the Neutron-SDK to execute code directly on standalone Trainium hardware. The SDK ensures that teams building custom models avoid vendor lock-in at the cloud level, isolating their dependency strictly to the silicon and compiler layers.
If you operate private infrastructure to comply with data sovereignty requirements, this hardware availability changes how you reduce LLM API costs. Large-scale AI labs, including Anthropic, have already signaled plans to deploy Trainium3 within localized, private clusters to keep proprietary training data strictly on-premises while leveraging modern 3nm chip efficiencies.
Get Insanely Good at AI
The book for developers who want to understand how AI actually works. LLMs, prompt engineering, RAG, AI agents, and production systems.
Keep Reading
How to Route GPU GitHub Actions to Hugging Face Jobs
Offload your training and GPU-heavy CI workloads to Hugging Face Jobs using their new ephemeral GitHub runners and action integrations.
AI Exploit Chains Prompt Cloudflare's New Defense Architecture
Cloudflare detailed a four-layer security architecture designed to counter rapid exploit chain construction by frontier AI models like Claude Mythos.
Cloudflare Rebuilds CLI on Vite Following VoidZero Acquisition
Cloudflare acquired VoidZero, bringing the Rust-based Vite build ecosystem internally to unify local development environments with global edge runtimes.
Gemini Enterprise Demand Drives $30B SpaceX GPU Contract
Google has signed a $30 billion agreement to rent 110,000 NVIDIA GPUs from SpaceX at $920 million per month to meet demand for its Gemini Enterprise platform.
$650M Backs Groq's Neocloud Pivot After $20B Nvidia Deal
Following a $20 billion licensing agreement with Nvidia, Groq is raising $650 million to transition into an AI inference service provider dubbed Groq 2.0.