$1B in Transformer ASIC Orders Drives Etched to $5B Valuation
Silicon startup Etched has emerged from stealth with a $5 billion valuation and $1 billion in contracted revenue for its transformer-specific Sohu chips.
On June 30, 2026, silicon startup Etched emerged from stealth with $1 billion in contracted revenue for its custom AI hardware. The company reached a $5 billion post-money valuation after closing an unannounced $500 million round led by Stripes in December 2025. Etched is delivering first-pass (A0) silicon on TSMC’s N4P process, positioning its rack-scale systems as high-efficiency alternatives to Nvidia’s general-purpose GPUs.
Sohu Architecture Constraints
The company’s flagship Sohu chip is an Application-Specific Integrated Circuit (ASIC) hardcoded exclusively for transformer models. By stripping out the flexibility required to run other neural network architectures, Etched optimizes die space strictly for the matrix multiplications that power models like Llama, DeepSeek, and Qwen.
The company reports that a single 8-chip Sohu server processes 500,000 tokens per second when running Llama 70B. At this throughput volume, Etched calculates that one Sohu-equipped server effectively matches the output of 160 Nvidia H100 GPUs.
Power and Memory Infrastructure
Heat dissipation is a primary bottleneck in AI inference deployments. Etched mitigates thermal throttling through its Low-Voltage Inference (LVI) architecture. This proprietary design runs the chip’s math blocks at less than half the voltage of traditional AI hardware. The system maintains over 80 percent peak FLOPs during continuous operation, even when routing traffic through trillion-parameter Mixture-of-Experts (MoE) architectures.
For memory bandwidth, the Sohu chip implements Cluster-Scale Memory (CSM). This subsystem pairs HBM3E with SRAM to accelerate token decoding speeds across an entire cluster. To validate these components at scale, the company established a 2MW data center and a prototype manufacturing lab in San Jose, backed by a production factory in Taiwan. A strategic investment from the TSMC-linked VentureTech Alliance supports this manufacturing pipeline.
Production Scale
Etched has raised $800 million across four rounds and expanded to over 400 employees. The engineering team draws heavily from Nvidia, Broadcom, SK Hynix, and Google’s TPU division. Investors include Jane Street, Two Sigma, Jump Trading, and AI figures like Andrej Karpathy and Geoffrey Hinton.
The company’s $1 billion in booked contracts for its Frontier Inference Clusters reflects a broader hardware transition as production serving costs begin to outpace training expenditures. This shift previously drove the Cerebras Systems IPO and Nvidia’s $20 billion acquisition of Groq in late 2025.
Hardware specialization forces a strict architectural commitment. If you are building infrastructure for massive production scale, transformer ASICs alter your unit economics by orders of magnitude. You must weigh these cost reductions against the inability to run alternative architectures like state space models or emerging diffusion networks on the same cluster hardware.
Get Insanely Good at AI
The book for developers who want to understand how AI actually works. LLMs, prompt engineering, RAG, AI agents, and production systems.
Keep Reading
How to Extend Reachy Mini Capabilities With Remote MCP Tools
Learn how to extend the Reachy Mini robot using remote Model Context Protocol tools hosted on Hugging Face Spaces without modifying local application code.
Groq Lands $650M to Scale Neocloud Inference Infrastructure
Following a $20 billion IP deal with Nvidia that drained its founding team, Groq has raised $650 million to rebuild as a dedicated inference cloud provider.
3nm Trainium3 Chips Pivot AWS to Direct Merchant Silicon
Amazon Web Services is shifting its semiconductor strategy, selling its 3nm Trainium3 and Inferentia3 AI chips directly to external data center operators.
$650M Backs Groq's Neocloud Pivot After $20B Nvidia Deal
Following a $20 billion licensing agreement with Nvidia, Groq is raising $650 million to transition into an AI inference service provider dubbed Groq 2.0.
ICML 2026: Ai2's DiScoFormer Replaces Kernel Density Estimation
Ai2's DiScoFormer introduces a train-once sequence-to-sequence transformer for estimating probability density and score functions across unseen distributions.