Release Notes
MindSpore Golden Stick 1.0.0 Release Notes
Major Features and Improvements
The Post-Training Quantization algorithm
PTQsupports theGPTQquantization algorithm, which can quantize weight to 8 bit or 4 bit.GPTQhas been added to the precision recovery algorithm set and can be selected throughprecision_recoveryinPTQConfig. Currently, only theGPTQalgorithm is available for precision recovery.The Post-Training Quantization algorithm
PTQsupports theAWQquantization algorithm. It enablesAWQby adding a new outlier suppression method to quantize the weights to 4 bits. The outlier suppression method can be selected throughoutliers_suppressioninPTQConfig, and currently, two methods are available:smoothandawq.The Post-Training Quantization algorithm
PTQsupports per-token dynamic quantization for activations, enabling online quantization of activations. This can be configured throughPTQConfigwithact_quant_granularity=QuantGranularity.PER_TOKEN.
API Change
The quantization methods
RoundToNearestandSmoothQuantare deprecated. Please replace them withPTQ.
Contributors
Thanks goes to these wonderful people:
huangzhuo, zhangminli, ccsszz, yyyyrf, hangangqiang
Contributions of any kind are welcome!