TurboQuant significantly increases capacity and speeds up key-value cache (KV cache) in AI inference. KV-cache is a type of memory that enables an AI algorithm to retain previous context without ...