Exploration of quantization bit-widths in Ditto
Investigate alternative quantization bit-widths beyond the 3-bit default for the clustering-based weight quantization component within Ditto, the proposed framework for compiling Code LLMs into lightweight executables.
References
We leave the exploration of different bitwidths to future work.
— Compiling Code LLMs into Lightweight Executables
(2603.29813 - Shi et al., 31 Mar 2026) in Section 4.1, Experimental Setup (Comparisons)