Open-World Semantic Segmentation Including Class Similarity (2403.07532v1)
Abstract: Interpreting camera data is key for autonomously acting systems, such as autonomous vehicles. Vision systems that operate in real-world environments must be able to understand their surroundings and need the ability to deal with novel situations. This paper tackles open-world semantic segmentation, i.e., the variant of interpreting image data in which objects occur that have not been seen during training. We propose a novel approach that performs accurate closed-world semantic segmentation and, at the same time, can identify new categories without requiring any additional training data. Our approach additionally provides a similarity measure for every newly discovered class in an image to a known category, which can be useful information in downstream tasks such as planning or mapping. Through extensive experiments, we show that our model achieves state-of-the-art results on classes known from training data as well as for anomaly segmentation and can distinguish between different unknown classes.
- Maskomaly: Zero-shot mask anomaly segmentation. In Proc. of British Machine Vision Conference (BMVC), 2023.
- Towards open set deep networks. In Proc. of the IEEE/CVF Conf. on Computer Vision and Pattern Recognition (CVPR), 2016.
- Uninformed Students: Student-Teacher Anomaly Detection With Discriminative Latent Embeddings. In Proc. of the IEEE/CVF Conf. on Computer Vision and Pattern Recognition (CVPR), 2020.
- Triggering Failures: Out-Of-Distribution detection by learning from local adversarial attacks in Semantic Segmentation. In Proc. of the IEEE/CVF Intl. Conf. on Computer Vision (ICCV), 2021.
- Simultaneous semantic segmentation and outlier detection in presence of domain shift. Pattern Recognition, 11824:33–47, 2019.
- The Fishyscapes Benchmark: Measuring blind spots in semantic segmentation. Intl. Journal of Computer Vision (IJCV), 129:3119–3135, 2021.
- Inverseform: A loss function for structured boundary-aware segmentation. In Proc. of the IEEE/CVF Conf. on Computer Vision and Pattern Recognition (CVPR), 2021.
- Weightless neural networks for open set recognition. Machine Learning, 106(9-10):1547–1567, 2017.
- SegmentMeIfYouCan: A Benchmark for Anomaly Segmentation. In Proc. of the Conf. on Neural Information Processing Systems (NeurIPS), 2021a.
- Entropy maximization and meta classification for out-of-distribution detection in semantic segmentation. In Proc. of the IEEE/CVF Conf. on Computer Vision and Pattern Recognition (CVPR), 2021b.
- A Simple Framework for Contrastive Learning of Visual Representations. In Proc. of the Intl. Conf. on Machine Learning (ICML), 2020.
- Weakly and semi-supervised detection, segmentation and tracking of table grapes with limited and noisy data. Computers and Electronics in Agriculture, 205:107624, 2023.
- The Cityscapes dataset for semantic urban scene understanding. In Proc. of the IEEE Conf. on Computer Vision and Pattern Recognition (CVPR), 2016.
- Learning confidence for out-of-distribution detection in neural networks. arXiv preprint, arXiv:1802.04865, 2018.
- Reducing network agnostophobia. In Proc. of the Conf. on Neural Information Processing Systems (NeurIPS), 2018.
- Dropout as a Bayesian Approximation: Representing model uncertainty in deep learning. In Proc. of the Intl. Conf. on Machine Learning (ICML), 2016.
- Ross Girshick. Fast R-CNN. In Proc. of the IEEE/CVF Intl. Conf. on Computer Vision (ICCV), 2015.
- Rich feature hierarchies for accurate object detection and semantic segmentation. In Proc. of the IEEE Conf. on Computer Vision and Pattern Recognition (CVPR), 2014.
- Densehybrid: Hybrid anomaly detection for dense open-set recognition. In Proc. of the Europ. Conf. on Computer Vision (ECCV), 2022.
- Disadvantages of using the area under the receiver operating characteristic curve to assess imaging tests: a discussion and proposal for an alternative approach. European Radiology, 25:932–939, 2015.
- Learning to discover novel visual categories via deep transfer clustering. In Proc. of the IEEE/CVF Conf. on Computer Vision and Pattern Recognition (CVPR), 2019.
- Deep residual learning for image recognition. In Proc. of the IEEE Conf. on Computer Vision and Pattern Recognition (CVPR), 2016.
- A baseline for detecting misclassified and out-of-distribution examples in neural networks. In Proc. of the Intl. Conf. on Learning Representations (ICLR), 2017.
- Scaling out-of-distribution detection for real-world settings. Proc. of the Intl. Conf. on Machine Learning (ICML), 2022.
- Bidirectional projection network for cross dimension scene understanding. In Proc. of the IEEE/CVF Conf. on Computer Vision and Pattern Recognition (CVPR), 2021.
- Beyond auroc & co. for evaluating out-of-distribution detection performance. In Proc. of the IEEE/CVF Conf. on Computer Vision and Pattern Recognition (CVPR), 2023.
- WinCLIP: Zero-/Few-Shot Anomaly Classification and Segmentation. In Proc. of the IEEE/CVF Conf. on Computer Vision and Pattern Recognition (CVPR), 2023.
- Adam: A Method for Stochastic Optimization. In Proc. of the Intl. Conf. on Learning Representations (ICLR), 2015.
- Panoptic Feature Pyramid Networks. In Proc. of the IEEE/CVF Conf. on Computer Vision and Pattern Recognition (CVPR), 2019a.
- Panoptic Segmentation. In Proc. of the IEEE/CVF Conf. on Computer Vision and Pattern Recognition (CVPR), 2019b.
- Opengan: Open-set recognition via open data generation. In Proc. of the IEEE/CVF Conf. on Computer Vision and Pattern Recognition (CVPR), 2021.
- Virtual multi-view fusion for 3d semantic segmentation. In Proc. of the Europ. Conf. on Computer Vision (ECCV), 2020.
- Simple and scalable predictive uncertainty estimation using deep ensembles. Proc. of the Conf. on Neural Information Processing Systems (NeurIPS), 2017.
- Enhancing the reliability of out-of-distribution image detection in neural networks. In Proc. of the Intl. Conf. on Learning Representations (ICLR), 2018.
- Microsoft COCO: Common objects in context. In Proc. of the Europ. Conf. on Computer Vision (ECCV), 2014.
- Detecting road obstacles by erasing them. IEEE Trans. on Pattern Analysis and Machine Intelligence (TPAMI), 2024.
- Future Frame Prediction for Anomaly Detection – A New Baseline. In Proc. of the IEEE/CVF Conf. on Computer Vision and Pattern Recognition (CVPR), 2018.
- Energy-based out-of-distribution detection. In Proc. of the Conf. on Neural Information Processing Systems (NeurIPS), 2020.
- Diversity-Measurable Anomaly Detection. In Proc. of the IEEE/CVF Conf. on Computer Vision and Pattern Recognition (CVPR), 2023.
- Fully Convolutional Networks for Semantic Segmentation. In Proc. of the IEEE Conf. on Computer Vision and Pattern Recognition (CVPR), 2015.
- Pixel-wise gradient uncertainty for convolutional neural networks applied to out-of-distribution segmentation. arXiv preprint, arXiv:2303.06920, 2023.
- High precision leaf instance segmentation for phenotyping in point clouds obtained under real field conditions. IEEE Robotics and Automation Letters (RA-L), 2023.
- Bonnet: An Open-Source Training and Deployment Framework for Semantic Segmentation in Robotics using CNNs. In Proc. of the IEEE Intl. Conf. on Robotics & Automation (ICRA), 2019.
- Real-time Semantic Segmentation of Crop and Weed for Precision Agriculture Robots Leveraging Background Knowledge in CNNs. In Proc. of the IEEE Intl. Conf. on Robotics & Automation (ICRA), 2018.
- Fast Instance and Semantic Segmentation Exploiting Local Connectivity, Metric Learning, and One-Shot Detection for Robotics. In Proc. of the IEEE Intl. Conf. on Robotics & Automation (ICRA), 2019.
- Confidence prediction for lexicon-free ocr. In Proc. of the IEEE Winter Conf. on Applications of Computer Vision (WACV), 2018.
- Segmenting unknown regions rejected by all. In Proc. of the IEEE/CVF Intl. Conf. on Computer Vision (ICCV), 2023.
- Deep neural networks are easily fooled: High confidence predictions for unrecognizable images. In Proc. of the IEEE/CVF Conf. on Computer Vision and Pattern Recognition (CVPR), 2015.
- In Defense of Pre-Trained ImageNet Architectures for Real-Time Semantic Segmentation of Road-Driving Images. In Proc. of the IEEE/CVF Conf. on Computer Vision and Pattern Recognition (CVPR), 2019.
- Learning transferable visual models from natural language supervision. In Proc. of the Intl. Conf. on Machine Learning (ICML), 2021.
- DenseCLIP: Language-Guided Dense Prediction With Context-Aware Prompting. In Proc. of the IEEE/CVF Conf. on Computer Vision and Pattern Recognition (CVPR), 2022.
- Faster R-CNN: Towards real-time object detection with region proposal networks. In Proc. of the Advances in Neural Information Processing Systems (NIPS), 2015.
- Hierarchical Approach for Joint Semantic, Plant Instance, and Leaf Instance Segmentation in the Agricultural Domain. Proc. of the IEEE Intl. Conf. on Robotics & Automation (ICRA), 2023.
- Erfnet: Efficient residual factorized convnet for real-time semantic segmentation. IEEE Trans. on Intelligent Transportation Systems (T-ITS), 19(1):263–272, 2018.
- Towards Total Recall in Industrial Anomaly Detection. In Proc. of the IEEE/CVF Conf. on Computer Vision and Pattern Recognition (CVPR), 2022.
- Stuart J. Russell. Artificial intelligence a modern approach. Pearson Education, Inc., 2010.
- Bayesian Nonparametric Submodular Video Partition for Robust Anomaly Detection. In Proc. of the IEEE/CVF Conf. on Computer Vision and Pattern Recognition (CVPR), 2022.
- Super-convergence: Very fast training of neural networks using large learning rates. Artificial Intelligence and Machine Learning for Multi-Domain Operations Applications, 11006:369–386, 2019.
- Robust Double-Encoder Network for RGB-D Panoptic Segmentation. Proc. of the IEEE Intl. Conf. on Robotics & Automation (ICRA), 2023.
- Hierarchical Semantic Contrast for Scene-Aware Video Anomaly Detection. In Proc. of the IEEE/CVF Conf. on Computer Vision and Pattern Recognition (CVPR), 2023.
- A deeper look at dataset bias. Domain Adaptation in Computer Vision Applications, pages 37–55, 2017.
- Multi-scale patch-based representation learning for image anomaly detection and segmentation. In Proc. of the IEEE Winter Conf. on Applications of Computer Vision (WACV), 2022.
- Open-set recognition: A good closed-set classifier is all you need. In Proc. of the Intl. Conf. on Learning Representations (ICLR), 2021.
- Out-of-distribution detection using an ensemble of self supervised leave-out classifiers. In Proc. of the Europ. Conf. on Computer Vision (ECCV), 2018.
- Internimage: Exploring large-scale vision foundation models with deformable convolutions. In Proc. of the IEEE/CVF Conf. on Computer Vision and Pattern Recognition (CVPR), 2023.
- Openauc: Towards auc-oriented open-set recognition. Proc. of the Conf. on Neural Information Processing Systems (NeurIPS), 2022.
- PhenoBench–A Large Dataset and Benchmarks for Semantic Image Interpretation in the Agricultural Domain. arXiv preprint, arXiv:2306.04557, 2023.
- Segformer: Simple and efficient design for semantic segmentation with transformers. In Proc. of the Conf. on Neural Information Processing Systems (NeurIPS), 2021.
- Video Event Restoration Based on Keyframes for Video Anomaly Detection. In Proc. of the IEEE/CVF Conf. on Computer Vision and Pattern Recognition (CVPR), 2023.
- Explicit Boundary Guided Semi-Push-Pull Contrastive Learning for Supervised Anomaly Detection. In Proc. of the IEEE/CVF Conf. on Computer Vision and Pattern Recognition (CVPR), 2023.
- BDD100K: A diverse driving dataset for heterogeneous multitask learning. In Proc. of the IEEE/CVF Conf. on Computer Vision and Pattern Recognition (CVPR), 2020.
- Draem-a discriminatively trained reconstruction embedding for surface anomaly detection. In Proc. of the IEEE/CVF Intl. Conf. on Computer Vision (ICCV), 2021.
- Exploiting Completeness and Uncertainty of Pseudo Labels for Weakly Supervised Video Anomaly Detection. In Proc. of the IEEE/CVF Conf. on Computer Vision and Pattern Recognition (CVPR), 2023a.
- DeSTSeg: Segmentation Guided Denoising Student-Teacher for Anomaly Detection. In Proc. of the IEEE/CVF Conf. on Computer Vision and Pattern Recognition (CVPR), 2023b.
- Pyramid Scene Parsing Network. In Proc. of the IEEE/CVF Conf. on Computer Vision and Pattern Recognition (CVPR), 2017.
- Ying Zhao. OmniAL: A Unified CNN Framework for Unsupervised Anomaly Localization. In Proc. of the IEEE/CVF Conf. on Computer Vision and Pattern Recognition (CVPR), 2023.
- RegionCLIP: Region-Based Language-Image Pretraining. In Proc. of the IEEE/CVF Conf. on Computer Vision and Pattern Recognition (CVPR), 2022.