Microsoft shader execution reordering delivers major ray tracing gains on latest GPUs

Microsoft’s shader execution reordering demo shows up to a 90% ray tracing performance boost on Intel Arc B-Series and about an 80% uplift on Nvidia GeForce RTX 5080 'Blackwell' in independent testing, signaling significant headroom for future games.

Microsoft has updated its Agility SDK to version 1.619, introducing DirectX Shader Model 6.9 and new DirectX 12 enhancements that center on a feature called shader execution reordering, or SER. With SER, the application programming interface gives applications the ability to dynamically sort rays for highly optimized parallel execution, which improves ray tracing performance by aligning similar workloads and reducing idle hardware cycles. The technology is being showcased in a dedicated Direct3D 12 demonstration that highlights how much ray tracing performance can improve when ray processing is reordered more intelligently.

In Microsoft testing, Intel Arc B-Series GPUs, which include ‘Battlemage’ discrete graphics and Xe3-based integrated graphics in ‘Panther Lake,’ managed to achieve a 90% framerate increase in the technology demonstration, indicating that ray tracing on this architecture benefits significantly from more efficient ray scheduling. The company also tested the Nvidia GeForce RTX 4090 with shader execution reordering enabled, and the GeForce RTX 4090 with SER scored a 40% improvement over the default ray sorting in the previous execution model, underscoring that even high end existing hardware can see substantial uplift from the new approach.

Independent testing from Osvaldo Pinali Doederlein on X showed that the GeForce RTX 5080 ‘Blackwell’ GPU scored about an 80% improvement running this demo, providing external confirmation that SER can dramatically raise ray tracing performance on next generation Nvidia hardware. Microsoft has built the D3D12RaytracingHelloShaderExecutionReordering demo as a minimal implementation of the technology, allowing developers and enthusiasts to test their own hardware and directly measure the performance improvement. These results suggest that as game engines adopt shader execution reordering on DirectX 12, players with compatible Intel and Nvidia GPUs could see substantially higher ray traced frame rates without changing hardware.

68

Impact Score

JEDEC outlines LPDDR6 expansion for data centers

JEDEC has previewed planned updates to LPDDR6 aimed at pushing the memory standard beyond mobile devices and into selected data center and accelerated computing use cases. The roadmap includes higher-capacity packaging options, flexible metadata support, 512 GB densities, and a new SOCAMM2 module standard.

Tsmc debuts A13 process technology

Tsmc has introduced its A13 process at its 2026 North America Technology Symposium as a tighter version of A14 aimed at next-generation Artificial Intelligence, high performance computing, and mobile designs. The company positions the node as a more compact and efficient option with backward-compatible design rules for faster migration.

Google unveils eighth-generation tensor processor units

Google introduced its eighth generation of custom tensor processor units with separate designs for training and inference. The new TPU 8t and TPU 8i are aimed at large-scale model training, serving, and agentic workloads.

Contact Us

Got questions? Use the form to contact us.

Contact Form

Clicking next sends a verification code to your email. After verifying, you can enter your message.