Untrained Tasks Now Possible via π0.7 Robotic Brain
Physical Intelligence unveils π0.7, a foundation model enabling robots to solve novel, complex problems through compositional generalization.
Physical Intelligence released π0.7, a steerable robotic foundation model capable of untrained task execution. The April 2026 release demonstrates compositional generalization, allowing robots to combine fragmented knowledge from disparate training data to solve novel physical problems. For developers building robotic control systems, this shifts the production paradigm from training specialized models for individual tasks to prompting a single multi-robot foundation model.
Architecture and Training Logic
π0.7 builds on the company’s previous Vision-Language-Action (VLA) architectures. The model ingests Internet-scale vision-language pretraining data alongside specialized multi-robot datasets. It processes this multimodal input and directly outputs low-level motor commands.
The core capability relies on compositional generalization rather than rote memorization. During internal testing, the model successfully operated an air fryer despite minimal relevant training data. The closest episodes in the dataset showed a robot pushing an air fryer basket and interacting with the appliance in a home setting, plus data from the open-source DROID dataset on a different robot platform. The model synthesized these fragmented behaviors to complete the novel air fryer task with step-by-step language coaching. If you build multi-agent systems or physical robotic controllers, this capability drastically reduces the burden of gathering exhaustive task-specific demonstration data.
Performance Benchmarks
In general-purpose internal testing, π0.7 matched the performance of previous specialist models. Specialist models are traditionally trained to execute exactly one function. π0.7 handled diverse physical operations without task-specific fine-tuning.
Demonstrated tasks include:
- Making coffee
- Folding laundry
- Assembling boxes
- Cleaning entire kitchens and bedrooms using mobile manipulators
The model’s performance matches that of fine-tuned specialist models across dexterous tasks, while also being able to compose and recombine skills to solve new problems. Validating these physical models across varying environments requires rigorous methodology, similar to how you evaluate and test AI agents in software interfaces.
Hardware Decoupling and System Upgrades
Physical Intelligence positions π0.7 as a universal operating system for robotics. The underlying architecture is designed to decouple the artificial brain from specific hardware bodies.
This release builds on several rapid architectural iterations. The earlier π*0.6 specialist models utilized reinforcement learning via the Recap algorithm to improve success rates and throughput on individual tasks. π0.7 distills experience from those RL-trained specialists into a single general-purpose model, matching or exceeding their performance without task-specific fine-tuning. Implementing similar long-horizon task execution in your own software applications often requires dedicated architectural work to add memory to AI agents.
These technical deployments parallel a massive influx of capital. Reports from late March 2026 indicate Physical Intelligence is in talks to raise $1 billion at an $11 billion valuation, with investors including Founders Fund, Lightspeed Venture Partners, Thrive Capital, and Lux Capital. This would follow a $600 million round at a $5.6 billion valuation in November 2025, led by CapitalG.
If you deploy robotic hardware, you should evaluate steerable foundation models against your existing specialist stack. The ability to generate reliable low-level motor commands from a generalized VLA model means you can prioritize hardware integration and prompt orchestration over collecting thousands of manual training demonstrations.
Get Insanely Good at AI
The book for developers who want to understand how AI actually works. LLMs, prompt engineering, RAG, AI agents, and production systems.
Keep Reading
How to Fine-Tune Cosmos Predict 2.5 for Robotics With LoRA
Learn how to adapt NVIDIA's 2B and 14B Cosmos Predict 2.5 world foundation models using parameter-efficient fine-tuning methods like LoRA and DoRA.
Google's Gemini Robotics-ER 1.6 Gives Robots Better Brains
DeepMind's Gemini Robotics-ER 1.6 upgrades embodied AI with multi-angle success detection, industrial gauge reading, and superior spatial reasoning.
GENE-26.5 Gives Hardware-Agnostic Robots Human-Scale Dexterity
The French robotics startup Genesis AI has released GENE-26.5, a hardware-agnostic foundation model paired with a custom human-scale robotic hand.
AI Therapy App The Path Hits 95 on Vera-MH Safety Benchmark
Founded by Tony Robbins and Calm alumni, The Path launched with $14.3 million in seed funding to build specialized foundation models for clinical AI therapy.
Cosmos 3 Open Omnimodel Merges World Simulation and Action
NVIDIA released Cosmos 3, an open-weight omnimodel that unifies vision reasoning, world simulation, and action prediction for physical AI applications.