IntelĀ® Neural Compressor (formerly known as IntelĀ® Low Precision Optimization Tool), targeting to provide unified APIs for network compression technologies, such as low precision quantization, sparsity, pruning, knowledge distillation, across different deep learning frameworks to pursue optimal inference performance. -
View it on GitHub