Linear 8-Bit Quantization#
This section describes optimizations using training-time and post-training quantization, and considerations on model size and performance:
This section describes optimizations using training-time and post-training quantization, and considerations on model size and performance: