An Interpretable Evaluation of Entropy-based Novelty of Generative Models (2402.17287v2)
Abstract: The massive developments of generative model frameworks require principled methods for the evaluation of a model's novelty compared to a reference dataset. While the literature has extensively studied the evaluation of the quality, diversity, and generalizability of generative models, the assessment of a model's novelty compared to a reference model has not been adequately explored in the machine learning community. In this work, we focus on the novelty assessment for multi-modal distributions and attempt to address the following differential clustering task: Given samples of a generative model $P_\mathcal{G}$ and a reference model $P_\mathrm{ref}$, how can we discover the sample types expressed by $P_\mathcal{G}$ more frequently than in $P_\mathrm{ref}$? We introduce a spectral approach to the differential clustering task and propose the Kernel-based Entropic Novelty (KEN) score to quantify the mode-based novelty of $P_\mathcal{G}$ with respect to $P_\mathrm{ref}$. We analyze the KEN score for mixture distributions with well-separable components and develop a kernel-based method to compute the KEN score from empirical data. We support the KEN framework by presenting numerical results on synthetic and real image datasets, indicating the framework's effectiveness in detecting novel modes and comparing generative models. The paper's code is available at: www.github.com/buyeah1109/KEN
- Auto-encoding variational bayes. arXiv preprint arXiv:1312.6114, 2013.
- Generative adversarial nets. Advances in neural information processing systems, 27, 2014.
- Denoising diffusion probabilistic models. Advances in neural information processing systems, 33:6840–6851, 2020.
- Gans trained by a two time-scale update rule converge to a local nash equilibrium. Advances in neural information processing systems, 30, 2017.
- Demystifying mmd gans. arXiv preprint arXiv:1801.01401, 2018.
- Improved techniques for training GANs. In D. Lee, M. Sugiyama, U. Luxburg, I. Guyon, and R. Garnett, editors, Advances in Neural Information Processing Systems, volume 29. Curran Associates, Inc., 2016.
- Assessing generative models via precision and recall. Advances in neural information processing systems, 31, 2018.
- Improved precision and recall metric for assessing generative models. Advances in Neural Information Processing Systems, 32, 2019.
- Reliable fidelity and diversity metrics for generative models. In International Conference on Machine Learning, pages 7176–7185. PMLR, 2020.
- Ali Borji. Pros and cons of gan evaluation measures: New developments. Computer Vision and Image Understanding, 215:103329, 2022.
- An information-theoretic evaluation of generative models in learning multi-modal distributions. In Thirty-seventh Conference on Neural Information Processing Systems, 2023.
- How faithful is your synthetic data? sample-level metrics for evaluating and auditing generative models. In International Conference on Machine Learning, pages 290–306. PMLR, 2022.
- A non-parametric test to detect data-copying in generative models. In International Conference on Artificial Intelligence and Statistics, 2020.
- Feature likelihood score: Evaluating the generalization of generative models using samples. In Thirty-seventh Conference on Neural Information Processing Systems, 2023.
- Rarity score : A new metric to evaluate the uncommonness of synthesized images. In The Eleventh International Conference on Learning Representations, 2023.
- Learning multiple layers of features from tiny images. 2009.
- Imagenet: A large-scale hierarchical image database. In 2009 IEEE conference on computer vision and pattern recognition, pages 248–255. Ieee, 2009.
- Deep learning face attributes in the wild. In Proceedings of the IEEE international conference on computer vision, pages 3730–3738, 2015.
- A style-based generator architecture for generative adversarial networks. In Proceedings of the IEEE/CVF conference on computer vision and pattern recognition, pages 4401–4410, 2019.
- Stargan v2: Diverse image synthesis for multiple domains. In Proceedings of the IEEE/CVF conference on computer vision and pattern recognition, pages 8188–8197, 2020.
- Rethinking the inception architecture for computer vision. In Proceedings of the IEEE conference on computer vision and pattern recognition, pages 2818–2826, 2016.
- StudioGAN: A Taxonomy and Benchmark of GANs for Image Synthesis. IEEE Transactions on Pattern Analysis and Machine Intelligence (TPAMI), 2023.
- Improved training of wasserstein gans. Advances in neural information processing systems, 30, 2017.
- Improved Precision and Recall Metric for Assessing Generative Models. Curran Associates Inc., Red Hook, NY, USA, 2019.
- Data-efficient instance generation from instance discrimination. arXiv preprint arXiv:2106.04566, 2021.
- High-resolution image synthesis with latent diffusion models, 2021.
- Rewon Child. Very deep vaes generalize autoregressive models and can outperform them on images. arXiv preprint arXiv:2011.10650, 2020.
- Stylegan-xl: Scaling stylegan to large diverse datasets. In ACM SIGGRAPH 2022 conference proceedings, pages 1–10, 2022.
- The variation of the spectrum of a normal matrix. In Selected Papers Of Alan J Hoffman: With Commentary, pages 118–120. World Scientific, 2003.
- Marco Marchesi. Megapixel size image creation using generative adversarial networks. arXiv preprint arXiv:1706.00082, 2017.
- Large scale gan training for high fidelity natural image synthesis. arXiv preprint arXiv:1809.11096, 2018.
- Geometric gan. arXiv preprint arXiv:1705.02894, 2017.
- Unsupervised representation learning with deep convolutional generative adversarial networks. arXiv preprint arXiv:1511.06434, 2015.
- Wasserstein gan, 2017.
- Conditional image synthesis with auxiliary classifier gans. In International conference on machine learning, pages 2642–2651. PMLR, 2017.
- Least squares generative adversarial networks. In Proceedings of the IEEE international conference on computer vision, pages 2794–2802, 2017.
- Logan: Latent optimisation for generative adversarial networks. arXiv preprint arXiv:1912.00953, 2019.
- Self-attention generative adversarial networks. In International conference on machine learning, pages 7354–7363. PMLR, 2019.
- Spectral normalization for generative adversarial networks, 2018.
- Contragan: Contrastive learning for conditional image generation, 2021.
- Training generative adversarial networks with limited data, 2020.
- Alias-free generative adversarial networks. In Proc. NeurIPS, 2021.
- Rebooting acgan: Auxiliary classifier gans with stable training, 2021.