GPTQ True Sequential

Creator
Creator
Seonglae Cho
Created
Created
2023 Jun 7 16:8
Editor
Edited
Edited
2023 Dec 9 6:42
Refs
Refs
sequential quantization even within a single Transformer block
if args.true_sequential: sequential = [['self_attn.k_proj', 'self_attn.v_proj', 'self_attn.q_proj'], ['self_attn.o_proj'], ['mlp.up_proj', 'mlp.gate_proj'], ['mlp.down_proj']]

Recommendations