Benchmarking and Analysis of Unsupervised Object Segmentation from Real-world Single Images (2312.04947v1)
Abstract: In this paper, we study the problem of unsupervised object segmentation from single images. We do not introduce a new algorithm, but systematically investigate the effectiveness of existing unsupervised models on challenging real-world images. We first introduce seven complexity factors to quantitatively measure the distributions of background and foreground object biases in appearance and geometry for datasets with human annotations. With the aid of these factors, we empirically find that, not surprisingly, existing unsupervised models fail to segment generic objects in real-world images, although they can easily achieve excellent performance on numerous simple synthetic datasets, due to the vast gap in objectness biases between synthetic and real images. By conducting extensive experiments on multiple groups of ablated real-world datasets, we ultimately find that the key factors underlying the failure of existing unsupervised models on real-world images are the challenging distributions of background and foreground object biases in appearance and geometry. Because of this, the inductive biases introduced in existing unsupervised models can hardly capture the diverse object distributions. Our research results suggest that future work should exploit more explicit objectness biases in the network design.
- Arandjelovic R, Zisserman A (2019) Object Discovery with a Copy-Pasting GAN. arXiv:190511369
- Bielski A, Favaro P (2019) Emergence of object segmentation in perturbed generative models. NeurIPS
- Crawford E, Pineau J (2019) Spatially Invariant Unsupervised Object Detection with Convolutional Neural Networks. AAAI
- Eddins S (2011) Binary image convex hull. https://blogsmathworkscom/steve/2011/10/04/binary-image-convex-hull-algorithm-notes/
- Huang J, Murphy K (2016) Efficient inference in occlusion-aware generative models of images. ICLR Workshops
- Jiang J, Ahn S (2020) Generative neurosymbolic machines. NeurIPS
- Kingma DP, Welling M (2014) Auto-Encoding Variational Bayes. ICLR
- Piper J, Granum E (1987) Computing distance transformations in convex and non-convex domains. Pattern recognition 20(6):599–615
- Polsby DD, Popper RD (1991) The third criterion: Compactness as a procedural safeguard against partisan gerrymandering. Yale L & Pol’y Rev 9
- Rand WM (1971) Objective criteria for the evaluation of clustering methods. Journal of the American Statistical Association 66(336):846–850
- Rezende DJ, Viola F (2018) Taming vaes. arXiv:181000597
- Shannon CE (1948) A mathematical theory of communication. The Bell system technical journal 27(3):379–423
- Sobel I, Feldman G (1973) A 3x3 Isotropic Gradient Operator for Image Processing. Pattern Classification and Scene Analysis pp 271–272
- Song Z, Yang B (2022) OGC: Unsupervised 3D Object Segmentation from Rigid Dynamics of Point Clouds. NeurIPS
- Wertheimer M (1923) Untersuchungen zur Lehre yon der Gestalt. Psychologische Forschung
- Yang Y, Yang B (2022) Promising or Elusive? Unsupervised Object Segmentation from Real-world Single Images. NeurIPS