Microsoft shader execution reordering delivers major ray tracing gains on latest GPUs

Microsoft’s shader execution reordering demo shows up to a 90% ray tracing performance boost on Intel Arc B-Series and about an 80% uplift on Nvidia GeForce RTX 5080 'Blackwell' in independent testing, signaling significant headroom for future games.

Microsoft has updated its Agility SDK to version 1.619, introducing DirectX Shader Model 6.9 and new DirectX 12 enhancements that center on a feature called shader execution reordering, or SER. With SER, the application programming interface gives applications the ability to dynamically sort rays for highly optimized parallel execution, which improves ray tracing performance by aligning similar workloads and reducing idle hardware cycles. The technology is being showcased in a dedicated Direct3D 12 demonstration that highlights how much ray tracing performance can improve when ray processing is reordered more intelligently.

In Microsoft testing, Intel Arc B-Series GPUs, which include ‘Battlemage’ discrete graphics and Xe3-based integrated graphics in ‘Panther Lake,’ managed to achieve a 90% framerate increase in the technology demonstration, indicating that ray tracing on this architecture benefits significantly from more efficient ray scheduling. The company also tested the Nvidia GeForce RTX 4090 with shader execution reordering enabled, and the GeForce RTX 4090 with SER scored a 40% improvement over the default ray sorting in the previous execution model, underscoring that even high end existing hardware can see substantial uplift from the new approach.

Independent testing from Osvaldo Pinali Doederlein on X showed that the GeForce RTX 5080 ‘Blackwell’ GPU scored about an 80% improvement running this demo, providing external confirmation that SER can dramatically raise ray tracing performance on next generation Nvidia hardware. Microsoft has built the D3D12RaytracingHelloShaderExecutionReordering demo as a minimal implementation of the technology, allowing developers and enthusiasts to test their own hardware and directly measure the performance improvement. These results suggest that as game engines adopt shader execution reordering on DirectX 12, players with compatible Intel and Nvidia GPUs could see substantially higher ray traced frame rates without changing hardware.

68

Impact Score

Intel unveils Xeon 6+ Clearwater Forest with 288 efficiency cores on 18A

Intel is introducing its most core-dense Xeon 6+ processor, Clearwater Forest, combining 12 compute chiplets on the Intel 18A node with advanced 2.5D and 3D packaging to deliver up to 288 efficiency cores per socket. The design targets high parallel workloads with expanded cache, high memory bandwidth, and extensive PCIe 5.0 and CXL 2.0 connectivity.

TSMC N2 node capacity booked through 2027 as customers face long delays

TSMC’s next generation N2 process node is already heavily booked through the end of 2027, forcing customers to lock in allocation early or face extended delays and higher costs. The squeeze is reshaping product roadmaps across the semiconductor industry, including for TSMC’s largest customer, NVIDIA.

OpenAI strikes Pentagon deal that sidelines anthropic’s harder line

OpenAI has reached a rushed agreement to let the US military use its Artificial Intelligence models in classified settings, relying on existing law and internal safety rules rather than hard contractual bans on controversial uses. The deal contrasts sharply with Anthropic’s failed attempt to impose explicit red lines on autonomous weapons and mass surveillance, which triggered a fierce backlash from the Pentagon.

Contact Us

Got questions? Use the form to contact us.

Contact Form

Clicking next sends a verification code to your email. After verifying, you can enter your message.