Question about the size of the quantized model

After int4 quantization by basic_quant_mix.py, the qwen2-32b model changed from 62G to 52G, which is too large.
Understood that the quantitative model has saved EETQ.
How can we save only the quantization model without saving EETQ.