SALE: Low-bit Estimation for Efficient Sparse Attention in Long-context LLM PrefillingPublished in arXiv preprint, 2025Xiaodong Ji, Hailin Zhang, Fangcheng Fu, Bin CuiShare on Twitter Facebook LinkedIn Previous Next