Stock prices of DRAM manufacturers dipped by as much as 19% over the past 5 days, over the March 24 announcement of Google TurboQuant, a new technology that Google claims will reduce the memory footprint of Artificial Intelligence models by a factor of 6, and improve inference speeds by a factor of 8. As of this writing, Micron Technology (NASDAQ: MU) dipped 19.5% over the last 5 days. Over in Korea, SK Hynix saw its stock price drop by 6%, while Samsung Electronics saw a dip by 5%.
TurboQuant is an advanced quantization algorithm developed by Google that delivers massive data compression for LLMs and vector search engines. It targets memory bottlenecks in the key-value cache and accelerates similarity lookups without sacrificing model accuracy. The system is presented as a way to improve efficiency in workloads where memory use and retrieval speed are critical constraints.
TurboQuant achieves this efficiency by combining two novel techniques: PolarQuant, which simplifies data geometry using polar coordinates to eliminate traditional memory overhead, and Quantized Johnson-Lindenstrauss (QJL), a 1-bit mathematical error-checker. Capable of compressing the key-value cache to just 3 bits without requiring fine-tuning, TurboQuant enables up to 8x faster runtimes on GPUs, establishing a new standard for Artificial Intelligence efficiency.
