This section describes the following:
- Palettization Overview: An overview of palettization (aka weight clustering) and considerations on model size and performance.
- Post-Training Palettization: Weight clustering compression of Core ML models using
- Training-Time Palettization : Weight clustering compression on PyTorch models with data and fine-tuning, using
Updated 4 months ago