Papers
Topics
Authors
Recent
Gemini 2.5 Flash
Gemini 2.5 Flash
139 tokens/sec
GPT-4o
7 tokens/sec
Gemini 2.5 Pro Pro
46 tokens/sec
o3 Pro
4 tokens/sec
GPT-4.1 Pro
38 tokens/sec
DeepSeek R1 via Azure Pro
28 tokens/sec
2000 character limit reached

Open-World Semantic Segmentation Including Class Similarity (2403.07532v1)

Published 12 Mar 2024 in cs.CV

Abstract: Interpreting camera data is key for autonomously acting systems, such as autonomous vehicles. Vision systems that operate in real-world environments must be able to understand their surroundings and need the ability to deal with novel situations. This paper tackles open-world semantic segmentation, i.e., the variant of interpreting image data in which objects occur that have not been seen during training. We propose a novel approach that performs accurate closed-world semantic segmentation and, at the same time, can identify new categories without requiring any additional training data. Our approach additionally provides a similarity measure for every newly discovered class in an image to a known category, which can be useful information in downstream tasks such as planning or mapping. Through extensive experiments, we show that our model achieves state-of-the-art results on classes known from training data as well as for anomaly segmentation and can distinguish between different unknown classes.

Definition Search Book Streamline Icon: https://streamlinehq.com
References (77)
  1. Maskomaly: Zero-shot mask anomaly segmentation. In Proc. of British Machine Vision Conference (BMVC), 2023.
  2. Towards open set deep networks. In Proc. of the IEEE/CVF Conf. on Computer Vision and Pattern Recognition (CVPR), 2016.
  3. Uninformed Students: Student-Teacher Anomaly Detection With Discriminative Latent Embeddings. In Proc. of the IEEE/CVF Conf. on Computer Vision and Pattern Recognition (CVPR), 2020.
  4. Triggering Failures: Out-Of-Distribution detection by learning from local adversarial attacks in Semantic Segmentation. In Proc. of the IEEE/CVF Intl. Conf. on Computer Vision (ICCV), 2021.
  5. Simultaneous semantic segmentation and outlier detection in presence of domain shift. Pattern Recognition, 11824:33–47, 2019.
  6. The Fishyscapes Benchmark: Measuring blind spots in semantic segmentation. Intl. Journal of Computer Vision (IJCV), 129:3119–3135, 2021.
  7. Inverseform: A loss function for structured boundary-aware segmentation. In Proc. of the IEEE/CVF Conf. on Computer Vision and Pattern Recognition (CVPR), 2021.
  8. Weightless neural networks for open set recognition. Machine Learning, 106(9-10):1547–1567, 2017.
  9. SegmentMeIfYouCan: A Benchmark for Anomaly Segmentation. In Proc. of the Conf. on Neural Information Processing Systems (NeurIPS), 2021a.
  10. Entropy maximization and meta classification for out-of-distribution detection in semantic segmentation. In Proc. of the IEEE/CVF Conf. on Computer Vision and Pattern Recognition (CVPR), 2021b.
  11. A Simple Framework for Contrastive Learning of Visual Representations. In Proc. of the Intl. Conf. on Machine Learning (ICML), 2020.
  12. Weakly and semi-supervised detection, segmentation and tracking of table grapes with limited and noisy data. Computers and Electronics in Agriculture, 205:107624, 2023.
  13. The Cityscapes dataset for semantic urban scene understanding. In Proc. of the IEEE Conf. on Computer Vision and Pattern Recognition (CVPR), 2016.
  14. Learning confidence for out-of-distribution detection in neural networks. arXiv preprint, arXiv:1802.04865, 2018.
  15. Reducing network agnostophobia. In Proc. of the Conf. on Neural Information Processing Systems (NeurIPS), 2018.
  16. Dropout as a Bayesian Approximation: Representing model uncertainty in deep learning. In Proc. of the Intl. Conf. on Machine Learning (ICML), 2016.
  17. Ross Girshick. Fast R-CNN. In Proc. of the IEEE/CVF Intl. Conf. on Computer Vision (ICCV), 2015.
  18. Rich feature hierarchies for accurate object detection and semantic segmentation. In Proc. of the IEEE Conf. on Computer Vision and Pattern Recognition (CVPR), 2014.
  19. Densehybrid: Hybrid anomaly detection for dense open-set recognition. In Proc. of the Europ. Conf. on Computer Vision (ECCV), 2022.
  20. Disadvantages of using the area under the receiver operating characteristic curve to assess imaging tests: a discussion and proposal for an alternative approach. European Radiology, 25:932–939, 2015.
  21. Learning to discover novel visual categories via deep transfer clustering. In Proc. of the IEEE/CVF Conf. on Computer Vision and Pattern Recognition (CVPR), 2019.
  22. Deep residual learning for image recognition. In Proc. of the IEEE Conf. on Computer Vision and Pattern Recognition (CVPR), 2016.
  23. A baseline for detecting misclassified and out-of-distribution examples in neural networks. In Proc. of the Intl. Conf. on Learning Representations (ICLR), 2017.
  24. Scaling out-of-distribution detection for real-world settings. Proc. of the Intl. Conf. on Machine Learning (ICML), 2022.
  25. Bidirectional projection network for cross dimension scene understanding. In Proc. of the IEEE/CVF Conf. on Computer Vision and Pattern Recognition (CVPR), 2021.
  26. Beyond auroc & co. for evaluating out-of-distribution detection performance. In Proc. of the IEEE/CVF Conf. on Computer Vision and Pattern Recognition (CVPR), 2023.
  27. WinCLIP: Zero-/Few-Shot Anomaly Classification and Segmentation. In Proc. of the IEEE/CVF Conf. on Computer Vision and Pattern Recognition (CVPR), 2023.
  28. Adam: A Method for Stochastic Optimization. In Proc. of the Intl. Conf. on Learning Representations (ICLR), 2015.
  29. Panoptic Feature Pyramid Networks. In Proc. of the IEEE/CVF Conf. on Computer Vision and Pattern Recognition (CVPR), 2019a.
  30. Panoptic Segmentation. In Proc. of the IEEE/CVF Conf. on Computer Vision and Pattern Recognition (CVPR), 2019b.
  31. Opengan: Open-set recognition via open data generation. In Proc. of the IEEE/CVF Conf. on Computer Vision and Pattern Recognition (CVPR), 2021.
  32. Virtual multi-view fusion for 3d semantic segmentation. In Proc. of the Europ. Conf. on Computer Vision (ECCV), 2020.
  33. Simple and scalable predictive uncertainty estimation using deep ensembles. Proc. of the Conf. on Neural Information Processing Systems (NeurIPS), 2017.
  34. Enhancing the reliability of out-of-distribution image detection in neural networks. In Proc. of the Intl. Conf. on Learning Representations (ICLR), 2018.
  35. Microsoft COCO: Common objects in context. In Proc. of the Europ. Conf. on Computer Vision (ECCV), 2014.
  36. Detecting road obstacles by erasing them. IEEE Trans. on Pattern Analysis and Machine Intelligence (TPAMI), 2024.
  37. Future Frame Prediction for Anomaly Detection – A New Baseline. In Proc. of the IEEE/CVF Conf. on Computer Vision and Pattern Recognition (CVPR), 2018.
  38. Energy-based out-of-distribution detection. In Proc. of the Conf. on Neural Information Processing Systems (NeurIPS), 2020.
  39. Diversity-Measurable Anomaly Detection. In Proc. of the IEEE/CVF Conf. on Computer Vision and Pattern Recognition (CVPR), 2023.
  40. Fully Convolutional Networks for Semantic Segmentation. In Proc. of the IEEE Conf. on Computer Vision and Pattern Recognition (CVPR), 2015.
  41. Pixel-wise gradient uncertainty for convolutional neural networks applied to out-of-distribution segmentation. arXiv preprint, arXiv:2303.06920, 2023.
  42. High precision leaf instance segmentation for phenotyping in point clouds obtained under real field conditions. IEEE Robotics and Automation Letters (RA-L), 2023.
  43. Bonnet: An Open-Source Training and Deployment Framework for Semantic Segmentation in Robotics using CNNs. In Proc. of the IEEE Intl. Conf. on Robotics & Automation (ICRA), 2019.
  44. Real-time Semantic Segmentation of Crop and Weed for Precision Agriculture Robots Leveraging Background Knowledge in CNNs. In Proc. of the IEEE Intl. Conf. on Robotics & Automation (ICRA), 2018.
  45. Fast Instance and Semantic Segmentation Exploiting Local Connectivity, Metric Learning, and One-Shot Detection for Robotics. In Proc. of the IEEE Intl. Conf. on Robotics & Automation (ICRA), 2019.
  46. Confidence prediction for lexicon-free ocr. In Proc. of the IEEE Winter Conf. on Applications of Computer Vision (WACV), 2018.
  47. Segmenting unknown regions rejected by all. In Proc. of the IEEE/CVF Intl. Conf. on Computer Vision (ICCV), 2023.
  48. Deep neural networks are easily fooled: High confidence predictions for unrecognizable images. In Proc. of the IEEE/CVF Conf. on Computer Vision and Pattern Recognition (CVPR), 2015.
  49. In Defense of Pre-Trained ImageNet Architectures for Real-Time Semantic Segmentation of Road-Driving Images. In Proc. of the IEEE/CVF Conf. on Computer Vision and Pattern Recognition (CVPR), 2019.
  50. Learning transferable visual models from natural language supervision. In Proc. of the Intl. Conf. on Machine Learning (ICML), 2021.
  51. DenseCLIP: Language-Guided Dense Prediction With Context-Aware Prompting. In Proc. of the IEEE/CVF Conf. on Computer Vision and Pattern Recognition (CVPR), 2022.
  52. Faster R-CNN: Towards real-time object detection with region proposal networks. In Proc. of the Advances in Neural Information Processing Systems (NIPS), 2015.
  53. Hierarchical Approach for Joint Semantic, Plant Instance, and Leaf Instance Segmentation in the Agricultural Domain. Proc. of the IEEE Intl. Conf. on Robotics & Automation (ICRA), 2023.
  54. Erfnet: Efficient residual factorized convnet for real-time semantic segmentation. IEEE Trans. on Intelligent Transportation Systems (T-ITS), 19(1):263–272, 2018.
  55. Towards Total Recall in Industrial Anomaly Detection. In Proc. of the IEEE/CVF Conf. on Computer Vision and Pattern Recognition (CVPR), 2022.
  56. Stuart J. Russell. Artificial intelligence a modern approach. Pearson Education, Inc., 2010.
  57. Bayesian Nonparametric Submodular Video Partition for Robust Anomaly Detection. In Proc. of the IEEE/CVF Conf. on Computer Vision and Pattern Recognition (CVPR), 2022.
  58. Super-convergence: Very fast training of neural networks using large learning rates. Artificial Intelligence and Machine Learning for Multi-Domain Operations Applications, 11006:369–386, 2019.
  59. Robust Double-Encoder Network for RGB-D Panoptic Segmentation. Proc. of the IEEE Intl. Conf. on Robotics & Automation (ICRA), 2023.
  60. Hierarchical Semantic Contrast for Scene-Aware Video Anomaly Detection. In Proc. of the IEEE/CVF Conf. on Computer Vision and Pattern Recognition (CVPR), 2023.
  61. A deeper look at dataset bias. Domain Adaptation in Computer Vision Applications, pages 37–55, 2017.
  62. Multi-scale patch-based representation learning for image anomaly detection and segmentation. In Proc. of the IEEE Winter Conf. on Applications of Computer Vision (WACV), 2022.
  63. Open-set recognition: A good closed-set classifier is all you need. In Proc. of the Intl. Conf. on Learning Representations (ICLR), 2021.
  64. Out-of-distribution detection using an ensemble of self supervised leave-out classifiers. In Proc. of the Europ. Conf. on Computer Vision (ECCV), 2018.
  65. Internimage: Exploring large-scale vision foundation models with deformable convolutions. In Proc. of the IEEE/CVF Conf. on Computer Vision and Pattern Recognition (CVPR), 2023.
  66. Openauc: Towards auc-oriented open-set recognition. Proc. of the Conf. on Neural Information Processing Systems (NeurIPS), 2022.
  67. PhenoBench–A Large Dataset and Benchmarks for Semantic Image Interpretation in the Agricultural Domain. arXiv preprint, arXiv:2306.04557, 2023.
  68. Segformer: Simple and efficient design for semantic segmentation with transformers. In Proc. of the Conf. on Neural Information Processing Systems (NeurIPS), 2021.
  69. Video Event Restoration Based on Keyframes for Video Anomaly Detection. In Proc. of the IEEE/CVF Conf. on Computer Vision and Pattern Recognition (CVPR), 2023.
  70. Explicit Boundary Guided Semi-Push-Pull Contrastive Learning for Supervised Anomaly Detection. In Proc. of the IEEE/CVF Conf. on Computer Vision and Pattern Recognition (CVPR), 2023.
  71. BDD100K: A diverse driving dataset for heterogeneous multitask learning. In Proc. of the IEEE/CVF Conf. on Computer Vision and Pattern Recognition (CVPR), 2020.
  72. Draem-a discriminatively trained reconstruction embedding for surface anomaly detection. In Proc. of the IEEE/CVF Intl. Conf. on Computer Vision (ICCV), 2021.
  73. Exploiting Completeness and Uncertainty of Pseudo Labels for Weakly Supervised Video Anomaly Detection. In Proc. of the IEEE/CVF Conf. on Computer Vision and Pattern Recognition (CVPR), 2023a.
  74. DeSTSeg: Segmentation Guided Denoising Student-Teacher for Anomaly Detection. In Proc. of the IEEE/CVF Conf. on Computer Vision and Pattern Recognition (CVPR), 2023b.
  75. Pyramid Scene Parsing Network. In Proc. of the IEEE/CVF Conf. on Computer Vision and Pattern Recognition (CVPR), 2017.
  76. Ying Zhao. OmniAL: A Unified CNN Framework for Unsupervised Anomaly Localization. In Proc. of the IEEE/CVF Conf. on Computer Vision and Pattern Recognition (CVPR), 2023.
  77. RegionCLIP: Region-Based Language-Image Pretraining. In Proc. of the IEEE/CVF Conf. on Computer Vision and Pattern Recognition (CVPR), 2022.
Citations (3)

Summary

We haven't generated a summary for this paper yet.