Universality of two-step neural embedding kernels for ACMMD on probability measures of sequences
Determine whether the kernel on the space of probability measures over sequences k_{P(𝒴)}(q, q') = exp(-(1/(2σ^2)) MMD^2(q, q')) constructed using the two-step sequence kernel k_𝒴 that averages per-residue embeddings from the ESM-2 sequence model (and the analogous structural embeddings from GearNet) satisfies the conditions of Proposition 4 (vanishing at infinity and discrete mass property for k_𝒴) so that k_{P(𝒴)} is C0-universal on 𝒫(𝒴).
References
Whether \cref{prop:universal_kernel_on_PYS} holds for these kernels is an open question, but we find that they perform well in our experiments.
— Kernel-Based Evaluation of Conditional Biological Sequence Models
(2510.15601 - Glaser et al., 17 Oct 2025) in Section 6.2 (Choice of kernel)