NVIDIA groq 3 LPX targets low-latency Artificial Intelligence inference

NVIDIA positions Groq 3 LPX as an inference accelerator for Vera Rubin built to handle low-latency, large-context workloads for agentic systems. The platform combines Rubin GPUs and LPUs in a co-designed architecture aimed at boosting throughput, token generation, and efficiency at rack scale.

Nvidia sets the stage for GTC 2026 keynote

Nvidia is preparing to outline its next wave of computing, networking, and rendering plans at GTC 2026, with Jensen Huang leading the keynote. The event is expected to focus on next-generation platforms, broader Artificial Intelligence infrastructure, and the company’s expanding partnership with Intel.