NVIDIA and AWS expand full-stack partnership for Artificial Intelligence compute platform

NVIDIA and AWS expanded integration around Artificial Intelligence infrastructure at AWS re:Invent, announcing support for NVIDIA NVLink Fusion with Trainium4, Graviton and the Nitro System. the move aims to unify NVIDIA scale-up interconnect and MGX rack architecture with AWS custom silicon to speed cloud-scale Artificial Intelligence deployments.

NVIDIA and AWS announced an expanded strategic collaboration at AWS re:Invent focused on Artificial Intelligence infrastructure, interconnects, cloud services and physical Artificial Intelligence. AWS will support NVIDIA NVLink Fusion to deploy custom silicon including next-generation Trainium4 chips for inference and agentic model training, Graviton CPUs and the Nitro System virtualization infrastructure. using NVLink Fusion, AWS plans to combine NVIDIA NVLink scale-up interconnect and the NVIDIA MGX rack architecture with its custom silicon to increase performance and accelerate time to market for cloud-scale Artificial Intelligence capabilities.

the hardware expansion includes immediate availability of NVIDIA Blackwell architecture on AWS, including NVIDIA HGX B300 and NVIDIA GB300 NVL72 GPUs, with NVIDIA RTX PRO 6000 Blackwell Server Edition GPUs expected in the coming weeks. AWS also launched AWS AI Factories, a dedicated infrastructure offering that pairs NVIDIA Blackwell GPUs, NVIDIA Spectrum-X Ethernet switches and the full-stack NVIDIA accelerated computing platform to give customers sovereign deployment options and compliance controls. public sector organizations are highlighted as beneficiaries, with the unified architecture intended to let customers train and deploy massive models while maintaining control of proprietary data.

the partnership extends deep software and tooling integrations. NVIDIA Nemotron open models, including Nemotron Nano 2 and Nemotron Nano 2 VL, are integrated with Amazon Bedrock to enable production-scale generative and agentic applications. Amazon OpenSearch Service now offers serverless GPU acceleration for vector index building powered by NVIDIA cuVS, with early adopters seeing up to 10x faster vector indexing at a quarter of the cost. AWS and NVIDIA are also co-engineering agent tooling and observability through Strands Agents, NVIDIA NeMo Agent Toolkit and Amazon Bedrock AgentCore, and bringing NVIDIA Cosmos world foundation models to Amazon EKS and AWS Batch for robotics simulation and synthetic data generation. the companies noted a sustained collaboration spanning more than 15 years and emphasized global deployments and sovereign cloud commitments backed by award recognition from AWS.

68

Impact Score

the state of artificial intelligence and DeepSeek strikes again

the download highlights a new MIT Technology Review and Financial Times feature on the uneven economic effects of Artificial Intelligence and a roundup of major technology items, including DeepSeek’s latest model claims and an Amsterdam welfare Artificial Intelligence investigation.

Contact Us

Got questions? Use the form to contact us.

Contact Form

Clicking next sends a verification code to your email. After verifying, you can enter your message.