High-dimensional behavior of CFG weighting toward private regions
Establish whether, for any dimension D ≥ 2 in masked discrete diffusion models with classifier-free guidance (CFG), the sampled distribution q_T^{z_1,w} under Assumption 1 (the full data distribution p is a mixture of class-conditional distributions {p(·|z_k)} with weights {a_k}) consistently places larger weights on more private regions of the target class z_1, where privacy is defined via non-overlap of supports and marginal supports with other classes, thereby leveraging the geometric information of the full data distribution.
References
We conjecture that the above fact is also true in high dimension: For any D≥2, discrete diffusion with CFG leverages the geometric information from the full data distribution. More specifically, under Assumption \ref{assup:full distribution}, the sampled distribution q_T{z_1,w} adapts the class distribution p(\cdot|z_1) by putting larger weights on more private regions of class z_1, where those regions with different privacy are defined based on the support sets and their marginals.