Nvidia and Coreweave set Graph500 record with H100 gpu cluster

December 12, 2025

Nvidia and Coreweave achieved a record-breaking 410 trillion traversed edges per second on the 31st Graph500 breadth-first search benchmark using a commercially available cluster of 8,192 H100 gpus hosted in Dallas. The result showcases a gpu-only, full-stack approach to large-scale graph processing that doubles the performance of comparable systems while using far fewer nodes and lower cost.

Nvidia has claimed the top spot on the 31st Graph500 breadth-first search list with a benchmark result of 410 trillion traversed edges per second (TEPS), delivered on a commercially available cluster hosted by cloud provider Coreweave. The record-setting run took place in a Coreweave data center in Dallas and used 8,192 Nvidia H100 gpus to process a graph containing 2.2 trillion vertices and 35 trillion edges. According to Nvidia, this performance is more than double that of comparable Graph500 entries, including systems operated by national laboratories, highlighting the potential of its accelerated computing stack for large-scale graph workloads.

The company emphasizes that efficiency is as important as raw speed. While a comparable top 10 Graph500 system used about 9,000 nodes, the Nvidia and Coreweave configuration reached its result with just over 1,000 nodes, which the company says delivers 3x better performance per dollar. Nvidia illustrates the scale by noting that if every person on Earth had 150 friends, this would correspond to 1.2 trillion edges in a social graph, and the demonstrated system could search all such relationships in about three milliseconds. The achievement relies on Nvidia’s integrated platform, spanning Nvidia CUDA software, Spectrum-X networking, H100 gpus and a new active messaging library designed to minimize hardware footprint while maximizing throughput.

Graph500 breadth-first search is presented as a long-standing industry benchmark for navigating sparse, irregular graphs, such as those representing social networks, banking relationships or cybersecurity data. Traditional approaches to very large graph processing have relied on cpu-based systems, where moving graph data between nodes can create communication bottlenecks at trillion-edge scales. To overcome this, developers have used active messages that process data in place, but these techniques were originally designed for cpus and are constrained by cpu throughput. Nvidia reengineered this model around gpus using a custom framework built on InfiniBand GPUDirect Async (IBGDA) and the NVSHMEM parallel programming interface, enabling gpu-to-gpu active messages and allowing hundreds of thousands of gpu threads to send messages concurrently. By running active messaging entirely on gpus and leveraging the parallelism and memory bandwidth of H100 devices on Coreweave’s infrastructure, the system doubled the performance of similar runs while using a fraction of the hardware and cost. Nvidia argues that this approach opens a new path for high-performance computing fields such as fluid dynamics and weather forecasting, which rely on sparse data structures, enabling developers to scale their largest applications on commercially available infrastructure using technologies like NVSHMEM and IBGDA.

Source

63

Impact Score

Latest News

How NotebookLM navigates copyright, contracts, and privacy in academic use

March 7, 2026

NotebookLM’s retrieval-augmented design can keep faculty and students on safer legal ground than general Artificial Intelligence chatbots, but only if copyright, publisher terms, and FERPA constraints are respected. Educators are urged to distinguish between fair use, contractual text and data mining limits, and ownership of Artificial Intelligence generated materials.

Smart home devices get an artificial intelligence reboot

March 7, 2026

Smart home devices are being redesigned around new artificial intelligence models that promise more natural control, deeper automation and better integration after years of fragmented, unreliable experiences.

Canada deepens partnership with Australia on critical minerals, defence, and artificial intelligence

March 7, 2026

Canada is elevating its relationship with Australia through new agreements on critical minerals, defence cooperation, clean energy, investment, and artificial intelligence, positioning both countries to strengthen economic security and technological capabilities.

Microsoft confirms Project Helix as next generation Xbox hybrid console

March 7, 2026

Microsoft’s new Xbox chief Asha Sharma has confirmed Project Helix as the next generation Xbox hardware, hinting at a hybrid design that can play both console and PC games and teasing more details at GDC in March.

Nvidia expands dominance in AIB GPU market as AMD and Intel lag

March 7, 2026

Nvidia has pushed its add-in-board GPU market share to a new high in Q4 2025, while AMD continues to lose ground and Intel holds a small foothold.

Nvidia and Coreweave set Graph500 record with H100 gpu cluster

63

Impact Score

Latest News

How NotebookLM navigates copyright, contracts, and privacy in academic use

Smart home devices get an artificial intelligence reboot

Canada deepens partnership with Australia on critical minerals, defence, and artificial intelligence

Microsoft confirms Project Helix as next generation Xbox hybrid console

Nvidia expands dominance in AIB GPU market as AMD and Intel lag

Contact Us