PQCache: Product Quantization-based KVCache for Long Context LLM InferencePublished in Proceedings of the ACM on Management of Data (SIGMOD), 2025Hailin Zhang, Xiaodong Ji, Yilin Chen, Fangcheng Fu, Xupeng Miao, Xiaonan Nie, Weipeng Chen, Bin CuiShare on Twitter Facebook LinkedIn Previous Next