Ai Agents 3 min read

Untrained Tasks Now Possible via π0.7 Robotic Brain

Physical Intelligence unveils π0.7, a foundation model enabling robots to solve novel, complex problems through compositional generalization.

Physical Intelligence released π0.7, a steerable robotic foundation model capable of untrained task execution. The April 2026 release demonstrates compositional generalization, allowing robots to combine fragmented knowledge from disparate training data to solve novel physical problems. For developers building robotic control systems, this shifts the production paradigm from training specialized models for individual tasks to prompting a single multi-robot foundation model.

Architecture and Training Logic

π0.7 builds on the company’s previous Vision-Language-Action (VLA) architectures. The model ingests Internet-scale vision-language pretraining data alongside specialized multi-robot datasets. It processes this multimodal input and directly outputs low-level motor commands.

The core capability relies on compositional generalization rather than rote memorization. During internal testing, the model successfully operated an air fryer despite minimal relevant training data. The closest episodes in the dataset showed a robot pushing an air fryer basket and interacting with the appliance in a home setting, plus data from the open-source DROID dataset on a different robot platform. The model synthesized these fragmented behaviors to complete the novel air fryer task with step-by-step language coaching. If you build multi-agent systems or physical robotic controllers, this capability drastically reduces the burden of gathering exhaustive task-specific demonstration data.

Performance Benchmarks

In general-purpose internal testing, π0.7 matched the performance of previous specialist models. Specialist models are traditionally trained to execute exactly one function. π0.7 handled diverse physical operations without task-specific fine-tuning.

Demonstrated tasks include:

  • Making coffee
  • Folding laundry
  • Assembling boxes
  • Cleaning entire kitchens and bedrooms using mobile manipulators

The model’s performance matches that of fine-tuned specialist models across dexterous tasks, while also being able to compose and recombine skills to solve new problems. Validating these physical models across varying environments requires rigorous methodology, similar to how you evaluate and test AI agents in software interfaces.

Hardware Decoupling and System Upgrades

Physical Intelligence positions π0.7 as a universal operating system for robotics. The underlying architecture is designed to decouple the artificial brain from specific hardware bodies.

This release builds on several rapid architectural iterations. The earlier π*0.6 specialist models utilized reinforcement learning via the Recap algorithm to improve success rates and throughput on individual tasks. π0.7 distills experience from those RL-trained specialists into a single general-purpose model, matching or exceeding their performance without task-specific fine-tuning. Implementing similar long-horizon task execution in your own software applications often requires dedicated architectural work to add memory to AI agents.

These technical deployments parallel a massive influx of capital. Reports from late March 2026 indicate Physical Intelligence is in talks to raise $1 billion at an $11 billion valuation, with investors including Founders Fund, Lightspeed Venture Partners, Thrive Capital, and Lux Capital. This would follow a $600 million round at a $5.6 billion valuation in November 2025, led by CapitalG.

If you deploy robotic hardware, you should evaluate steerable foundation models against your existing specialist stack. The ability to generate reliable low-level motor commands from a generalized VLA model means you can prioritize hardware integration and prompt orchestration over collecting thousands of manual training demonstrations.

Get Insanely Good at AI

Get Insanely Good at AI

The book for developers who want to understand how AI actually works. LLMs, prompt engineering, RAG, AI agents, and production systems.

Keep Reading