NVIDIA and AWS announced an expanded strategic collaboration at AWS re:Invent focused on Artificial Intelligence infrastructure, interconnects, cloud services and physical Artificial Intelligence. AWS will support NVIDIA NVLink Fusion to deploy custom silicon including next-generation Trainium4 chips for inference and agentic model training, Graviton CPUs and the Nitro System virtualization infrastructure. using NVLink Fusion, AWS plans to combine NVIDIA NVLink scale-up interconnect and the NVIDIA MGX rack architecture with its custom silicon to increase performance and accelerate time to market for cloud-scale Artificial Intelligence capabilities.
the hardware expansion includes immediate availability of NVIDIA Blackwell architecture on AWS, including NVIDIA HGX B300 and NVIDIA GB300 NVL72 GPUs, with NVIDIA RTX PRO 6000 Blackwell Server Edition GPUs expected in the coming weeks. AWS also launched AWS AI Factories, a dedicated infrastructure offering that pairs NVIDIA Blackwell GPUs, NVIDIA Spectrum-X Ethernet switches and the full-stack NVIDIA accelerated computing platform to give customers sovereign deployment options and compliance controls. public sector organizations are highlighted as beneficiaries, with the unified architecture intended to let customers train and deploy massive models while maintaining control of proprietary data.
the partnership extends deep software and tooling integrations. NVIDIA Nemotron open models, including Nemotron Nano 2 and Nemotron Nano 2 VL, are integrated with Amazon Bedrock to enable production-scale generative and agentic applications. Amazon OpenSearch Service now offers serverless GPU acceleration for vector index building powered by NVIDIA cuVS, with early adopters seeing up to 10x faster vector indexing at a quarter of the cost. AWS and NVIDIA are also co-engineering agent tooling and observability through Strands Agents, NVIDIA NeMo Agent Toolkit and Amazon Bedrock AgentCore, and bringing NVIDIA Cosmos world foundation models to Amazon EKS and AWS Batch for robotics simulation and synthetic data generation. the companies noted a sustained collaboration spanning more than 15 years and emphasized global deployments and sovereign cloud commitments backed by award recognition from AWS.
