Papers
Topics
Authors
Recent
Gemini 2.5 Flash
Gemini 2.5 Flash
126 tokens/sec
GPT-4o
47 tokens/sec
Gemini 2.5 Pro Pro
43 tokens/sec
o3 Pro
4 tokens/sec
GPT-4.1 Pro
47 tokens/sec
DeepSeek R1 via Azure Pro
28 tokens/sec
2000 character limit reached

Measuring Diversity in Co-creative Image Generation (2403.13826v1)

Published 6 Mar 2024 in cs.CV, cs.CL, and cs.LG

Abstract: Quality and diversity have been proposed as reasonable heuristics for assessing content generated by co-creative systems, but to date there has been little agreement around what constitutes the latter or how to measure it. Proposed approaches for assessing generative models in terms of diversity have limitations in that they compare the model's outputs to a ground truth that in the era of large pre-trained generative models might not be available, or entail an impractical number of computations. We propose an alternative based on entropy of neural network encodings for comparing diversity between sets of images that does not require ground-truth knowledge and is easy to compute. We also compare two pre-trained networks and show how the choice relates to the notion of diversity that we want to evaluate. We conclude with a discussion of the potential applications of these measures for ideation in interactive systems, model evaluation, and more broadly within computational creativity.

Definition Search Book Streamline Icon: https://streamlinehq.com
References (28)
  1. Gans trained by a two time-scale update rule converge to a local nash equilibrium. Advances in neural information processing systems, 30, 2017.
  2. Analyzing the expressive range of a level generator. In Proceedings of the 2010 workshop on procedural content generation in games, pages 1–7, 2010.
  3. Searching for good and diverse game levels. In 2014 IEEE Conference on Computational Intelligence and Games, pages 1–8. IEEE, 2014.
  4. Affect-conditioned image generation. arXiv preprint arXiv:2302.09742, 2023.
  5. Donald A Schön. Designing as reflective conversation with the materials of a design situation. Knowledge-based systems, 5(1):3–14, 1992.
  6. Kees Dorst. Frame innovation: Create new thinking by design. MIT press, 2015.
  7. Surprise-triggered reformulation of design goals. In Proceedings of the AAAI Conference on Artificial Intelligence, volume 30, 2016.
  8. Jonas Oppenlaender. The creativity of text-to-image generation. In Proceedings of the 25th International Academic Mindtrek Conference, pages 192–202, 2022.
  9. Diversity in recommender systems. In Proceedings: The Fourth International Conference on Advances in Human-oriented and Personalized Mechanisms, Technologies, and Services. CENTRIC, pages 23–29, 2011.
  10. Diversity in recommender systems–a survey. Knowledge-based systems, 123:154–162, 2017.
  11. Quality diversity: A new frontier for evolutionary computation. Frontiers in Robotics and AI, 3:40, 2016.
  12. Seeding diversity into ai art. arXiv preprint arXiv:2205.00804, 2022.
  13. Creative discovery using quality-diversity search. In Proceedings of the Companion Conference on Genetic and Evolutionary Computation, pages 747–750, 2023.
  14. Transformational creativity through the lens of quality-diversity. In Proceedings of ICCC’23, 2023.
  15. Reliable fidelity and diversity metrics for generative models. In International Conference on Machine Learning, pages 7176–7185. PMLR, 2020.
  16. Transfer learning techniques for medical image analysis: A review. Biocybernetics and Biomedical Engineering, 42(1):79–107, 2022.
  17. Rethinking the inception architecture for computer vision. In Proceedings of the IEEE conference on computer vision and pattern recognition, pages 2818–2826, 2016.
  18. Margaret A Boden. The creative mind: Myths and mechanisms. Psychology Press, 2004.
  19. Lou Jost. Entropy and diversity. Oikos, 113(2):363–375, 2006.
  20. A collaborative, interactive and context-aware drawing agent for co-creative design. arXiv preprint arXiv:2209.12588, 2022.
  21. Learning transferable visual models from natural language supervision. In International Conference on Machine Learning, pages 8748–8763. PMLR, 2021.
  22. Claude Elwood Shannon. A mathematical theory of communication. ACM SIGMOBILE mobile computing and communications review, 5(1):3–55, 2001.
  23. Imagenet: A large-scale hierarchical image database. In 2009 IEEE conference on computer vision and pattern recognition, pages 248–255. Ieee, 2009.
  24. Text-based measures of document diversity. In Proceedings of the 19th ACM SIGKDD international conference on Knowledge discovery and data mining, pages 23–31, 2013.
  25. Language models are few-shot learners. Advances in neural information processing systems, 33:1877–1901, 2020.
  26. High-resolution image synthesis with latent diffusion models. In Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, pages 10684–10695, 2022.
  27. Andy Stirling. A general framework for analysing diversity in science, technology and society. Journal of the Royal Society interface, 4(15):707–719, 2007.
  28. Maximilian Seitzer. pytorch-fid: FID Score for PyTorch. https://github.com/mseitzer/pytorch-fid, August 2020. Version 0.3.0.
Citations (1)

Summary

We haven't generated a summary for this paper yet.