On the Estimation of Image-matching Uncertainty in Visual Place Recognition (2404.00546v1)
Abstract: In Visual Place Recognition (VPR) the pose of a query image is estimated by comparing the image to a map of reference images with known reference poses. As is typical for image retrieval problems, a feature extractor maps the query and reference images to a feature space, where a nearest neighbor search is then performed. However, till recently little attention has been given to quantifying the confidence that a retrieved reference image is a correct match. Highly certain but incorrect retrieval can lead to catastrophic failure of VPR-based localization pipelines. This work compares for the first time the main approaches for estimating the image-matching uncertainty, including the traditional retrieval-based uncertainty estimation, more recent data-driven aleatoric uncertainty estimation, and the compute-intensive geometric verification. We further formulate a simple baseline method, ``SUE'', which unlike the other methods considers the freely-available poses of the reference images in the map. Our experiments reveal that a simple L2-distance between the query and reference descriptors is already a better estimate of image-matching uncertainty than current data-driven approaches. SUE outperforms the other efficient uncertainty estimation methods, and its uncertainty estimates complement the computationally expensive geometric verification approach. Future works for uncertainty estimation in VPR should consider the baselines discussed in this work.
- Building rome in a day. Communications of the ACM, 54(10):105–112, 2011.
- Gsv-cities: Toward appropriate supervised visual place recognition. Neurocomputing, 513:194–203, 2022.
- Mixvpr: Feature mixing for visual place recognition. In Proceedings of the IEEE/CVF Winter Conference on Applications of Computer Vision, pages 2998–3007, 2023.
- NetVLAD: CNN architecture for weakly supervised place recognition. In Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, pages 5297–5307, 2016.
- Viewpoint invariant dense matching for visual geolocalization. In Proceedings of the IEEE/CVF International Conference on Computer Vision, pages 12169–12178, 2021.
- Rethinking visual geo-localization for large-scale applications. In Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, pages 4878–4888, 2022a.
- Deep visual geo-localization benchmark. In Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, pages 5396–5407, 2022b.
- Past, present, and future of simultaneous localization and mapping: Toward the robust-perception age. IEEE Transactions on Robotics, 32(6):1309–1332, 2016.
- STUN: Self-teaching uncertainty estimation for place recognition. In Proceedings of the IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS), pages 6614–6621. IEEE, 2022.
- City-scale landmark identification on mobile devices. In Proceedings of the IEEE International Conference on Computer Vision and Pattern Recognition, pages 737–744. IEEE, 2011.
- Mark Cummins. Highly scalable appearance-only slam-fab-map 2.0. In Proceedings of the Robotics: Sciences and Systems (RSS) Conference, 2009.
- Fab-map: Probabilistic localization and mapping in the space of appearance. The International Journal of Robotics Research, 27(6):647–665, 2008.
- Histograms of oriented gradients for human detection. In Proceedings of the IEEE International Conference on Computer Vision and Pattern Recognition, pages 886–893. IEEE, 2005.
- Using the condensation algorithm for robust, vision-based mobile robot localization. In Proceedings of the IEEE International Conference on Computer Vision and Pattern Recognition, pages 588–594. IEEE, 1999.
- Superpoint: Self-supervised interest point detection and description. In IEEE International Conference on Computer Vision and Pattern Recognition Workshops, pages 224–236, 2018.
- Where is your place, visual place recognition? In Proceedings of the Thirtieth International Joint Conference on Artificial Intelligence, 2021.
- Learning and calibrating per-location classifiers for visual place recognition. In Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, pages 907–914, 2013.
- Unsupervised complementary-aware multi-process fusion for visual place recognition. arXiv preprint arXiv:2112.04701, 2021a.
- Patch-CNN: Multi-scale fusion of locally-global descriptors for place recognition. In Proceedings of the IEEE/CVF International Conference on Computer Vision and Pattern Recognition, pages 14141–14152, 2021b.
- A baseline for detecting misclassified and out-of-distribution examples in neural networks. In Proceedings of the International Conference on Learning Representations, 2016.
- Detecting loop closure with scene sequences. International Journal of Computer Vision, 74(3):261–286, 2007.
- Modelling uncertainty in deep learning for camera relocalization. In IEEE International Conference on Robotics and Automation (ICRA), pages 4762–4769. IEEE, 2016.
- What uncertainties do we need in Bayesian deep learning for computer vision? Advances in Neural Information Processing Systems, 30, 2017.
- Avoiding confusing features in place recognition. In Proceedings of the European Conference on Computer Vision, pages 748–761. Springer, 2010.
- Camera relocalization by computing pairwise relative poses using convolutional neural network. In IEEE International Conference on Computer Vision Workshops, pages 929–938, 2017.
- Generalized contrastive optimization of siamese networks for place recognition. arXiv preprint arXiv:2103.06638, 2021.
- David G Lowe. Distinctive image features from scale-invariant keypoints. International Journal of Computer Vision, 60(2):91–110, 2004.
- Visual place recognition: A survey. IEEE Transactions on Robotics, 32(1):1–19, 2015.
- Predictive uncertainty estimation via prior networks. Advances in Neural Information Processing Systems, 31, 2018.
- A survey on deep visual place recognition. IEEE Access, 9:19516–19547, 2021.
- Mapping a suburb with a single camera using a biologically inspired slam system. IEEE Transactions on Robotics, 24(5):1038–1053, 2008.
- Coordinet: uncertainty-aware pose regressor for reliable vehicle localization. In Proceedings of the IEEE/CVF Winter Conference on Applications of Computer Vision, pages 2229–2238, 2022.
- Large-scale image retrieval with attentive deep local features. In Proceedings of the IEEE International Conference on Computer Vision, pages 3456–3465, 2017.
- Probabilistic regression of rotations using quaternion averaging and a deep multi-headed network. arXiv preprint arXiv:1904.03182, 2019.
- A survey on visual-based localization: On the benefit of heterogeneous data. Pattern Recognition, 74:90–109, 2018.
- Benchmarking image retrieval for visual localization. In International Conference on 3D Vision (3DV), pages 483–494. IEEE, 2020.
- Fine-tuning CNN image retrieval with no human annotation. IEEE Transactions on Pattern Analysis and Machine Intelligence, 41(7):1655–1668, 2018.
- Revisiting oxford and paris: Large-scale image retrieval benchmarking. In Proceedings of the IEEE International Conference on Computer Vision and Pattern Recognition, 2018.
- Learning with average precision: Training image retrieval with a listwise loss. In Proceedings of the IEEE International Conference on Computer Vision, pages 5107–5116, 2019.
- Large-scale location recognition and the geometric burstiness problem. In Proceedings of the IEEE International Conference on Computer Vision and Pattern Recognition, pages 1582–1590, 2016.
- Benchmarking 6dof outdoor visual localization in changing conditions. In Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, pages 8601–8610, 2018.
- Mobile robot localization and mapping with uncertainty using scale-invariant visual landmarks. International Journal of Robotics Research, 21(8):735–758, 2002.
- Sindre Skrede. Nordland dataset. https://bit.ly/2QVBOym, 2013.
- Probabilistic place recognition with covisibility maps. In Proceedings of the IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS), pages 4158–4163. IEEE, 2013.
- Geometrically mappable image features. IEEE Robotics and Automation Letters, 5(2):2062–2069, 2020.
- Image search with selective match kernels: aggregation across single and multiple images. International Journal of Computer Vision, 116(3):247–261, 2016.
- Are large-scale 3d models really necessary for accurate visual localization? IEEE Transactions on Pattern Analysis and Machine Intelligence, 2019.
- TransVPR: Transformer-based place recognition with multi-level attention aggregation. In Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, pages 13648–13657, 2022.
- Mapillary street-level sequences: A dataset for lifelong place recognition. In Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, pages 2626–2635, 2020.
- Bayesian Triplet Loss: Uncertainty quantification in image retrieval. In Proceedings of the IEEE/CVF International Conference on Computer Vision, pages 12158–12168, 2021.
- Spatial pyramid-enhanced CNN with weighted triplet loss for place recognition. IEEE Transactions on Neural Networks and Learning Systems, 31(2):661–674, 2019.
- VPR-Bench: An open-source visual place recognition evaluation framework with quantifiable viewpoint and appearance change. International Journal of Computer Vision, 129(7):2136–2174, 2021.
- CoPR: Toward accurate visual localization with continuous place-descriptor regression. IEEE Transactions on Robotics, 2023.
- Accurate image localization based on google maps street view. In Proceedings of the European Conference on Computer Vision, pages 255–268. Springer, 2010.
- Camera pose voting for large-scale image-based localization. In Proceedings of the IEEE International Conference on Computer Vision, pages 2704–2712, 2015.
- Vector of locally and adaptively aggregated descriptors for image feature representation. Pattern Recognition, 116:107952, 2021.
- Visual place recognition in long-term and large-scale environment based on CNN feature. In IEEE Intelligent Vehicles Symposium (IV), pages 1679–1685. IEEE, 2018.