Google compression algorithm targets data center energy use

Google has unveiled TurboQuant, a compression algorithm designed to shrink large language model memory usage and improve efficiency. The approach points to a future where Artificial Intelligence models need less data center capacity and could run on smaller devices.