This tutorial demostrates how to perform INT8 quantization with an image classification model using the Post-Training Optimization Tool Simplified Mode (part of OpenVINO). We use ResNet20 model and Cifar10 dataset.
The code in this tutorial is designed to extend to custom models and datasets. It consists of the following steps:
- Download and prepare the ResNet20 model and calibration dataset
- Prepare the model for quantization
- Compress the model using the simplified mode
- Compare performance of the original and quantized models
- Demonstrate the results of the optimized model