Simplicity in Complexity : Explaining Visual Complexity using Deep Segmentation Models (2403.03134v3)
Abstract: The complexity of visual stimuli plays an important role in many cognitive phenomena, including attention, engagement, memorability, time perception and aesthetic evaluation. Despite its importance, complexity is poorly understood and ironically, previous models of image complexity have been quite complex. There have been many attempts to find handcrafted features that explain complexity, but these features are usually dataset specific, and hence fail to generalise. On the other hand, more recent work has employed deep neural networks to predict complexity, but these models remain difficult to interpret, and do not guide a theoretical understanding of the problem. Here we propose to model complexity using segment-based representations of images. We use state-of-the-art segmentation models, SAM and FC-CLIP, to quantify the number of segments at multiple granularities, and the number of classes in an image respectively. We find that complexity is well-explained by a simple linear model with these two features across six diverse image-sets of naturalistic scene and art images. This suggests that the complexity of images can be surprisingly simple.
- (2024). Impossibility theorems for feature attribution. Proceedings of the National Academy of Sciences, 121(2), e2304406120.
- (2019). Art expertise in construing meaning of representational and abstract artworks. Acta psychologica, 192, 11–22.
- (2021). On the opportunities and risks of foundation models. arXiv preprint arXiv:2108.07258.
- (2021). A comprehensive survey of scene graphs: Generation and application. IEEE Transactions on Pattern Analysis and Machine Intelligence, 45(1), 1–26.
- (2023). Semantic segment anything. github.com/fudan-zvg/Semantic-Segment-Anything.
- (2012). Complexity of images: Experimental and computational estimates compared. Perception, 41(6), 631–647.
- Chipman, S. F. (1977). Complexity and structure in visual patterns. Journal of Experimental Psychology: General, 106(3), 269.
- (2016). Predicting complexity perception of real world images. PloS one, 11(6), e0157986.
- (2022). Symbols and mental programs: a hypothesis about human singularity. Trends in Cognitive Sciences.
- (2019). Scene perception in the human brain. Annual review of vision science, 5, 373–397.
- (2017). Visual complexity of chinese ink paintings. In Proceedings of the acm symposium on applied perception (pp. 1–8).
- (2022). Ic9600: A benchmark dataset for automatic image complexity assessment. IEEE Transactions on Pattern Analysis and Machine Intelligence.
- (2017). Predicting perceived visual complexity of abstract patterns using computational measures: The influence of mirror symmetry on complexity perception. PloS one, 12(11), e0185276.
- (2023). Image visual complexity evaluation based on deep ordinal regression. In Chinese conference on pattern recognition and computer vision (prcv) (pp. 199–210).
- Ichikawa, S. (1985). Quantitative and structural factors in the judgment of pattern complexity. Perception & psychophysics, 38(2), 101–109.
- (2023). Compression ensembles quantify aesthetic complexity and the evolution of visual art. EPJ Data Science, 12(1), 21.
- (2020). The influence of visual complexity on initial user impressions: Testing the persuasive model of web design. Behaviour & Information Technology, 39(5), 497–510.
- (2023). Segment anything. arXiv preprint arXiv:2304.02643.
- (2022). Predicting human perception of scene complexity. In 2022 ieee international conference on image processing (icip) (pp. 1281–1285).
- (2023). Complexity & memorability have a nonlinear relationship when remembering scenes. Journal of Vision, 23(9), 5251–5251.
- (2023). Characterising and dissecting human perception of scene complexity. Cognition, 231, 105319.
- (2015). Computerized measures of visual complexity. Acta psychologica, 160, 43–57.
- (2023). Minimum description length clustering to measure meaningful image complexity. Available at SSRN 4391368.
- (2020). Predicting human complexity perception of real-world scenes. Royal Society open science, 7(5), 191487.
- (2023). Relating objective complexity, subjective complexity and beauty.
- (2004). Identifying the perceptual dimensions of visual complexity of scenes. In Proceedings of the annual meeting of the cognitive science society (Vol. 26).
- (2014). Examining visual complexity and its influence on perceived duration. Journal of vision, 14(14), 3–3.
- (2010). The stopping power of advertising: Measures and effects of visual complexity. Journal of Marketing, 74(5), 48–60.
- (2011). Experiencing art: The influence of expertise and painting abstraction level. Frontiers in human neuroscience, 5, 94.
- (2008). Dimensionality of visual complexity in computer graphics scenes. In Human vision and electronic imaging xiii (Vol. 6806, pp. 142–151).
- (2013). Predicting users’ first impressions of website aesthetics with a quantification of perceived visual complexity and colorfulness. In Proceedings of the sigchi conference on human factors in computing systems (pp. 2049–2058).
- (2007). Measuring visual clutter. Journal of vision, 7(2), 17–17.
- (2022). A language of thought for the mental representation of geometric shapes. Cognitive Psychology, 139, 101527.
- (2020). Visual complexity analysis using deep intermediate-layer features. Computer Vision and Image Understanding, 195, 102949.
- (2021). Curious objects: How visual complexity guides attention and engagement. Cognitive Science, 45(4), e12933.
- (2020). Order, complexity, and aesthetic appreciation. Psychology of aesthetics, creativity, and the arts, 14(2), 135.
- (2016). Complexity or simplicity? designing product pictures for advertising in online marketplaces. Journal of Retailing and Consumer Services, 28, 17–27.
- (2023). Convolutions die hard: Open-vocabulary segmentation with single frozen convolutional clip. arXiv preprint arXiv:2308.02487.