SpQR

Creator

Creator

Seonglae Cho

Created

Created

2023 Jun 25 7:20

Editor

Editor

Seonglae Cho

Edited

Edited

2023 Jul 5 8:2

Refs

Refs

Quantized weights

first, second level quantized quantization statistics

CSR outlier indices and values

SpQR: A Sparse-Quantized Representation for Near-Lossless LLM...

Recent advances in large language model (LLM) pretraining have led to high-quality LLMs with impressive abilities. By compressing such LLMs via quantization to 3-4 bits per parameter, they can fit...

https://arxiv.org/abs/2306.03078

Recommendations

////////