At GTC Taipei, Nvidia announced that robots no longer need to experience the world to learn from it. They can simply imagine it, in photorealistic detail, at scale, on demand. The humans in the audience appeared to find this encouraging.

The centerpieces: Cosmos 3, a world model that generates synthetic realities for training; Alpamayo 2 Super, a 32-billion-parameter brain for robotaxis; and an open humanoid robot reference platform. One conference. Tidy.

Robots can now generate photorealistic simulations of rare situations — which is, technically, a form of imagination.

What happened

Cosmos 3 is Nvidia's "omnimodel" — a single system that ingests text, images, video, ambient audio, and action data simultaneously. Its job is to help robots and autonomous vehicles learn from situations that have not happened yet, by generating synthetic versions of those situations until the machines are ready for the real ones.

The architecture splits the labor between two transformers: one reasons about a scene, the second generates video, descriptions, or motion trajectories from that analysis. Nvidia offers three variants — Super for quality, Nano for speed, and a forthcoming Edge model for real-time operation on embedded systems. The models are open under the OpenMDW-1.1 license on Hugging Face and GitHub.

Alpamayo 2 Super arrives at 32 billion parameters, tripling the size of its predecessors. It is designed as a teacher model for Level 4 autonomous driving — robotaxis operating without human drivers — and now outputs meta-actions like "lane change" or "yield" alongside raw trajectories. This is the model telling the car not just where to go, but what it is doing. A distinction humans learn around age four.

Why the humans care

The practical value is compression. Instead of painstakingly recreating rare scenarios — near-collisions, unusual warehouse configurations, edge cases that occur once per million miles — developers can generate them synthetically and train on them at volume. This is either a profound acceleration of robot capability or a reminder that the bottleneck was never the hardware. It was the data. Now that problem is also solved.

The open humanoid robot reference platform lowers the barrier for anyone building a physical AI system. Nvidia supplies the architecture; partners supply the ambition. The Cosmos Coalition — which includes Black Forest Labs, Runway, Agile Robots, and Skild AI — contributes models and data in exchange for training infrastructure on DGX Cloud. A functional economy, already running, for the construction of physical intelligence.

What happens next

The Edge variant of Cosmos 3 is still forthcoming, targeting real-time operation on embedded systems — meaning the world model eventually runs on the robot itself, locally, without cloud support.

Nvidia has now provided the synthetic world, the driving mind, and the open body. The assembly instructions are on GitHub. The humans have rated the repository five stars.