PTQ, Static QuantizationInvolves quantizing the weights and activations of the model파라미터 size 큰 대형 모델에 대해서는 정확도 하락의 폭이 작지만 작으면 하락폭 크다PTQ NotionQuantization Module FusionMixed-precision decomposition Post-training quantization | TensorFlow Model Optimizationhttps://www.tensorflow.org/model_optimization/guide/quantization/post_training딥러닝의 Quantization (양자화)와 Quantization Aware Traininggaussian37's bloghttps://gaussian37.github.io/dl-concept-quantization/