Penguin Solutions launches cxl-based kv cache server

Penguin Solutions introduced a production-ready KV cache server built on CXL memory technology for enterprise-scale inference and agentic Artificial Intelligence workloads. The system is positioned to ease memory bottlenecks, improve GPU cluster efficiency, and reduce latency.

Penguin Solutions introduced a production-ready KV cache server built with CXL memory technology to address the memory wall in Artificial Intelligence inferencing. The Penguin Solutions MemoryAI KV cache server is designed for enterprise scale inference, including agentic Artificial Intelligence, and is aimed at improving latency, throughput, GPU cluster efficiency, service-level agreement performance, and time-to-first-token.

Inference workloads are described as fundamentally different from model training and tuning because they are continuous, memory-bound, and latency-sensitive. Inference demands are typically 30% compute driven (GPU) and 70% memory driven (RAM), elevating the need for greater memory capacity and causing performance bottlenecks and GPU idle time.

The system delivers up to 11 TB of CXL-based memory for memory-dependent Artificial Intelligence processes. Penguin’s MemoryAI KV cache server increases memory capacity by integrating 3 TB of DDR5 main memory and up to eight 1 TB CXL Add-in Cards (AICs). The company positions the platform as a way to support higher performance inference while improving the utilization of GPU infrastructure.

52

Impact Score

Micron samples 256 GB DDR5 9200 MT/s RDIMM server modules

Micron has begun sampling 256 GB DDR5 RDIMM server modules built on its 1-gamma technology to key ecosystem partners. The company positions the new modules as a higher-speed, more power-efficient option for scaling next-generation Artificial Intelligence and HPC infrastructure.

Microsoft emails show early doubts about OpenAI

Court emails show Microsoft executives were unconvinced by OpenAI’s early Artificial Intelligence progress in 2018 while also worrying that rejecting the lab could push it toward Amazon. The messages reveal internal tension between skepticism over technical claims and concern about competitive and public relations fallout.

Apple explores Intel chip manufacturing alliance

Apple has reached a preliminary agreement with Intel to manufacture some chips for its devices, reflecting mounting pressure on semiconductor supply chains as Artificial Intelligence demand absorbs advanced capacity. The move also aligns with Washington’s push to expand domestic chip production and revive Intel’s foundry business.

Contact Us

Got questions? Use the form to contact us.

Contact Form

Clicking next sends a verification code to your email. After verifying, you can enter your message.