Physical artificial intelligence with NVIDIA Cosmos

NVIDIA Cosmos is a platform for physical Artificial Intelligence that provides open world foundation models, guardrails, and an accelerated data pipeline for developers building robots, autonomous vehicles, and video analytics agents. Models, tooling, and datasets are available on GitHub and Hugging Face under the NVIDIA Open Model License.

NVIDIA Cosmos is presented as a platform purpose-built for physical Artificial Intelligence. It combines pretrained multimodal world foundation models, guardrails, and an accelerated data processing and curation pipeline to help developers simulate, reason, and generate synthetic data for downstream workflows. The site emphasizes Cosmos as a developer-facing stack for autonomous vehicles, robots, and video analytics agents, and points users to GitHub and Hugging Face where Cosmos models, tokenizers, and resources are openly available under the NVIDIA Open Model License.

The Cosmos model family contains three core world foundation models. Cosmos Predict is a world state prediction model that can generate up to 30 seconds of continuous video from multimodal inputs to support forecasting and scenario planning. Cosmos Transfer is a multi-control style transfer model that scales single simulations across environments and lighting conditions, ingesting physics-based simulator outputs from tools such as CARLA or NVIDIA Isaac Sim to accelerate controllable synthetic data generation. Cosmos Reason is a reasoning vision language model designed to answer text queries about image and video inputs, applying prior knowledge, physics understanding, and common-sense reasoning to annotate, critique, or create prompts for the other models.

Data and tooling are highlighted as integral to the platform. NVIDIA Cosmos Curator is a framework for filtering, annotating, and deduplicating large sensor datasets, and Cosmos Dataset Search lets developers query and retrieve targeted scenarios for post-training. The site links to the Cosmos Cookbook and PyTorch post-training scripts to customize models, and references complementary NVIDIA technologies including NeMo Curator, the Cosmos tokenizer, NVIDIA NeMo, NIM microservices, and NVIDIA DGX Cloud. For infrastructure, the page recommends NVIDIA RTX PRO 6000 Blackwell Series servers and the NVIDIA Blackwell GB200 for peak performance on post-training and inference workloads. The ecosystem section lists partners and adopters across robotics, autonomous vehicle, and vision AI industries and points developers to demos, sessions, and news to get started with Cosmos.

58

Impact Score

Best artificial intelligence video generators for every creator

Leading artificial intelligence video tools like Sora, Veo 3, Adobe Firefly, Runway and Midjourney target different needs, from free social clips to commercially safe productions, but all come with legal and ethical tradeoffs. Choosing the right platform means balancing price, creative control, output quality and how each service handles your data and copyrights.

UK mps open inquiry into artificial intelligence and edtech in education

UK mps have launched a cross party inquiry into how artificial intelligence and education technology are reshaping learning across early years, schools, colleges and universities, and how government should balance innovation with safeguards. The education committee will examine opportunities to improve teaching and workload alongside risks around inequality, privacy, safeguarding and assessment.

Contact Us

Got questions? Use the form to contact us.

Contact Form

Clicking next sends a verification code to your email. After verifying, you can enter your message.