32bit floating point to 8bit linear quantization
ONNX Quantization Notion
ONNX Quantization Usages
Quantize ONNX models
ONNX Runtime: cross-platform, high performance ML inferencing and training accelerator
https://onnxruntime.ai/docs/performance/model-optimizations/quantization.html

Seonglae Cho