HAWQ: Hessian AWare Quantization of Neural Networks with Mixed-PrecisionModel size and inference speed/power have become a major challenge in the deployment of Neural Networks for many applications. A promising approach to address these problems is quantization....https://arxiv.org/abs/1905.03696