Weight quantization

Creator

Seonglae Cho

Created

2024 Jan 15 16:22

Editor

Seonglae Cho

Edited

2024 Jan 15 16:22

Refs

Weight quantization is a technique that reduces the number of bits used to represent weights. This is done by converting model weights from high-precision floating-point representations to low-precision floating-point or integer representations

Recommendations

////////