Texonom
Texonom
/
Engineering
Engineering
/Data Engineering/Artificial Intelligence/AI Development/AI Optimization/Model Quantization/Model Quantization Algorithm/
SpQR
Search

SpQR

Creator
Creator
Seonglae Cho
Created
Created
2023 Jun 25 7:20
Editor
Editor
Seonglae Cho
Edited
Edited
2023 Jul 5 8:2
Refs
Refs
  1. Quantized weights
  1. first, second level quantized quantization statistics
  1. CSR outlier indices and values
 
 
 
 
SpQR: A Sparse-Quantized Representation for Near-Lossless LLM...
Recent advances in large language model (LLM) pretraining have led to high-quality LLMs with impressive abilities. By compressing such LLMs via quantization to 3-4 bits per parameter, they can fit...
SpQR: A Sparse-Quantized Representation for Near-Lossless LLM...
https://arxiv.org/abs/2306.03078
SpQR: A Sparse-Quantized Representation for Near-Lossless LLM...
 
 
 

Recommendations

Texonom
Texonom
/
Engineering
Engineering
/Data Engineering/Artificial Intelligence/AI Development/AI Optimization/Model Quantization/Model Quantization Algorithm/
SpQR
Copyright Seonglae Cho