NVIDIA Cosmos is presented as a platform purpose-built for physical Artificial Intelligence. It combines pretrained multimodal world foundation models, guardrails, and an accelerated data processing and curation pipeline to help developers simulate, reason, and generate synthetic data for downstream workflows. The site emphasizes Cosmos as a developer-facing stack for autonomous vehicles, robots, and video analytics agents, and points users to GitHub and Hugging Face where Cosmos models, tokenizers, and resources are openly available under the NVIDIA Open Model License.
The Cosmos model family contains three core world foundation models. Cosmos Predict is a world state prediction model that can generate up to 30 seconds of continuous video from multimodal inputs to support forecasting and scenario planning. Cosmos Transfer is a multi-control style transfer model that scales single simulations across environments and lighting conditions, ingesting physics-based simulator outputs from tools such as CARLA or NVIDIA Isaac Sim to accelerate controllable synthetic data generation. Cosmos Reason is a reasoning vision language model designed to answer text queries about image and video inputs, applying prior knowledge, physics understanding, and common-sense reasoning to annotate, critique, or create prompts for the other models.
Data and tooling are highlighted as integral to the platform. NVIDIA Cosmos Curator is a framework for filtering, annotating, and deduplicating large sensor datasets, and Cosmos Dataset Search lets developers query and retrieve targeted scenarios for post-training. The site links to the Cosmos Cookbook and PyTorch post-training scripts to customize models, and references complementary NVIDIA technologies including NeMo Curator, the Cosmos tokenizer, NVIDIA NeMo, NIM microservices, and NVIDIA DGX Cloud. For infrastructure, the page recommends NVIDIA RTX PRO 6000 Blackwell Series servers and the NVIDIA Blackwell GB200 for peak performance on post-training and inference workloads. The ecosystem section lists partners and adopters across robotics, autonomous vehicle, and vision AI industries and points developers to demos, sessions, and news to get started with Cosmos.
