NVIDIA Blackwell Dominates MLPerf Inference Benchmarks

NVIDIA Blackwell´s performance in MLPerf Inference V5.0 sets new records, showcasing cutting-edge capabilities in Artificial Intelligence.

In the latest MLPerf Inference V5.0 benchmarks, NVIDIA´s Blackwell platform has set new records, marking a significant achievement in artificial intelligence inference capabilities. For the first time, NVIDIA used its GB200 NVL72 system, a rack-scale solution designed for AI reasoning, for the submission. This system effectively connects 72 NVIDIA Blackwell GPUs into a single massive GPU, achieving up to 30x higher throughput on the Llama 3.1 405B benchmark compared to previous submissions.

Designed for AI factories, the NVIDIA Blackwell platform demonstrates the future of data processing by transforming raw data into real-time insights. The emphasis is on delivering accurate and swift responses to queries at minimal costs to numerous users. Innovations across technology stacks in silicon, network systems, and software are pushing the boundaries of AI capabilities, enabling smarter models with billions of parameters to efficiently deliver insights in real-time, while managing costs and computing resources.

Apart from Blackwell, the NVIDIA Hopper platform also displayed exceptional performance, significantly improving throughput on the Llama 2 70B benchmark due to full-stack optimizations. This ongoing enhancement in NVIDIA´s platforms underscores the sustained value and adaptability of its AI solutions amidst growing model complexities and demand for responsive user experiences. The inclusive participation of several partners and rigorous peer-reviewed benchmarking further highlights the comprehensive reach and influence of NVIDIA´s evolving AI technologies.

78

Impact Score

FLUX.2 image generation models now released, optimized for NVIDIA RTX GPUs

Black Forest Labs, the frontier Artificial Intelligence research lab, released the FLUX.2 family of visual generative models with new multi-reference and pose control tools and direct ComfyUI support. NVIDIA collaboration brings FP8 quantizations that reduce VRAM requirements by 40% and improve performance by 40%.

Aligning VMware migration with business continuity

Business continuity planning long focused on physical disasters, but cyber incidents, particularly ransomware, are now more common and often more damaging. In a survey of more than 500 CISOs, almost three-quarters (72%) said their organization had dealt with ransomware in the previous year.

Contact Us

Got questions? Use the form to contact us.

Contact Form

Clicking next sends a verification code to your email. After verifying, you can enter your message.