Texonom
Texonom
/
Engineering
Engineering
/Data Engineering/Artificial Intelligence/AI Development/AI Optimization/Model Quantization/Model Quantization Algorithm/GPTQ/
ExLLaMa
Search

ExLLaMa

Creator
Creator
Seonglae Cho
Created
Created
2023 Jul 9 6:46
Editor
Editor
Seonglae Cho
Edited
Edited
2024 Aug 10 7:37
Refs
Refs
WebUI is good
exllama
turboderp • Updated 2024 Aug 8 8:24
 
 
 

Exllama V2

ML Blog - ExLlamaV2: The Fastest Library to Run LLMs
Quantize and run EXL2 models
https://mlabonne.github.io/blog/posts/ExLlamaV2_The_Fastest_Library_to_Run%C2%A0LLMs.html
 
 

Backlinks

TGI Quantization

Recommendations

Texonom
Texonom
/
Engineering
Engineering
/Data Engineering/Artificial Intelligence/AI Development/AI Optimization/Model Quantization/Model Quantization Algorithm/GPTQ/
ExLLaMa
Copyright Seonglae Cho