Papers
Topics
Authors
Recent
Gemini 2.5 Flash
Gemini 2.5 Flash
158 tokens/sec
GPT-4o
7 tokens/sec
Gemini 2.5 Pro Pro
45 tokens/sec
o3 Pro
4 tokens/sec
GPT-4.1 Pro
38 tokens/sec
DeepSeek R1 via Azure Pro
28 tokens/sec
2000 character limit reached

AdapNet: Adaptive Noise-Based Network for Low-Quality Image Retrieval (2405.17718v1)

Published 28 May 2024 in cs.CV and cs.LG

Abstract: Image retrieval aims to identify visually similar images within a database using a given query image. Traditional methods typically employ both global and local features extracted from images for matching, and may also apply re-ranking techniques to enhance accuracy. However, these methods often fail to account for the noise present in query images, which can stem from natural or human-induced factors, thereby negatively impacting retrieval performance. To mitigate this issue, we introduce a novel setting for low-quality image retrieval, and propose an Adaptive Noise-Based Network (AdapNet) to learn robust abstract representations. Specifically, we devise a quality compensation block trained to compensate for various low-quality factors in input images. Besides, we introduce an innovative adaptive noise-based loss function, which dynamically adjusts its focus on the gradient in accordance with image quality, thereby augmenting the learning of unknown noisy samples during training and enhancing intra-class compactness. To assess the performance, we construct two datasets with low-quality queries, which is built by applying various types of noise on clean query images on the standard Revisited Oxford and Revisited Paris datasets. Comprehensive experimental results illustrate that AdapNet surpasses state-of-the-art methods on the Noise Revisited Oxford and Noise Revisited Paris benchmarks, while maintaining competitive performance on high-quality datasets. The code and constructed datasets will be made available.

Definition Search Book Streamline Icon: https://streamlinehq.com
References (38)
  1. Netvlad: Cnn architecture for weakly supervised place recognition. In Proceedings of the IEEE conference on computer vision and pattern recognition, pages 5297–5307, 2016.
  2. Unifying deep local and global features for image search. In Computer Vision–ECCV 2020: 16th European Conference, Glasgow, UK, August 23–28, 2020, Proceedings, Part XX 16, pages 726–743. Springer, 2020.
  3. Data uncertainty learning in face recognition. In Proceedings of the IEEE/CVF conference on computer vision and pattern recognition, pages 5710–5719, 2020.
  4. D2-net: A trainable cnn for joint detection and description of local features. arXiv preprint arXiv:1905.03561, 2019.
  5. Dfm: A performance baseline for deep feature matching. In Proceedings of the IEEE/CVF conference on computer vision and pattern recognition, pages 4284–4293, 2021.
  6. Random sample consensus: a paradigm for model fitting with applications to image analysis and automated cartography. Communications of the ACM, 24(6):381–395, 1981.
  7. Deep residual learning for image recognition. In Proceedings of the IEEE conference on computer vision and pattern recognition, pages 770–778, 2016.
  8. Local descriptors optimized for average precision. In Proceedings of the IEEE conference on computer vision and pattern recognition, pages 596–605, 2018.
  9. Improving face recognition from hard samples via distribution distillation loss. In Computer Vision–ECCV 2020: 16th European Conference, Glasgow, UK, August 23–28, 2020, Proceedings, Part XXX 16, pages 138–154. Springer, 2020.
  10. Aggregating local descriptors into a compact image representation. In 2010 IEEE computer society conference on computer vision and pattern recognition, pages 3304–3311. IEEE, 2010.
  11. Aggregating local image descriptors into compact codes. IEEE transactions on pattern analysis and machine intelligence, 34(9):1704–1716, 2011.
  12. Adaface: Quality adaptive margin for face recognition. In Proceedings of the IEEE/CVF conference on computer vision and pattern recognition, pages 18750–18759, 2022.
  13. Revisiting self-similarity: Structural embedding for image retrieval. In Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, pages 23412–23421, 2023.
  14. Correlation verification for image retrieval. In Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, pages 5374–5384, 2022.
  15. Unitsface: Unified threshold integrated sample-to-sample loss for face recognition. arXiv preprint arXiv:2311.02523, 2023.
  16. Spherical confidence learning for face recognition. In Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, pages 15629–15637, 2021.
  17. Perceptual visual quality metrics: A survey. Journal of visual communication and image representation, 22(4):297–312, 2011.
  18. Sphereface: Deep hypersphere embedding for face recognition. In Proceedings of the IEEE conference on computer vision and pattern recognition, pages 212–220, 2017.
  19. David G Lowe. Distinctive image features from scale-invariant keypoints. International journal of computer vision, 60:91–110, 2004.
  20. Magface: A universal representation for face recognition and quality assessment. In Proceedings of the IEEE/CVF conference on computer vision and pattern recognition, pages 14225–14234, 2021.
  21. A regularization by denoising super-resolution method based on genetic algorithms. Signal Processing: Image Communication, 99:116505, 2021.
  22. Large-scale image retrieval with attentive deep local features. In Proceedings of the IEEE international conference on computer vision, pages 3456–3465, 2017.
  23. Representation learning with contrastive predictive coding. arXiv preprint arXiv:1807.03748, 2018.
  24. Revisiting oxford and paris: Large-scale image retrieval benchmarking. In Proceedings of the IEEE conference on computer vision and pattern recognition, pages 5706–5715, 2018.
  25. Probabilistic face embeddings. In Proceedings of the IEEE/CVF International Conference on Computer Vision, pages 6902–6911, 2019.
  26. Local features and visual words emerge in activations. In Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, pages 11651–11660, 2019.
  27. Image search with selective match kernels: aggregation across single and multiple images. International Journal of Computer Vision, 116:247–261, 2016.
  28. Learning and aggregating deep local descriptors for instance-level recognition. In Computer Vision–ECCV 2020: 16th European Conference, Glasgow, UK, August 23–28, 2020, Proceedings, Part I 16, pages 460–477. Springer, 2020.
  29. Learning super-features for image retrieval. arXiv preprint arXiv:2201.13182, 2022.
  30. Google landmarks dataset v2-a large-scale benchmark for instance-level recognition and retrieval. In Proceedings of the IEEE/CVF conference on computer vision and pattern recognition, pages 2575–2584, 2020.
  31. Learning token-based representation for image retrieval. In Proceedings of the AAAI Conference on Artificial Intelligence, volume 36, pages 2703–2711, 2022.
  32. Dolg: Single-stage image retrieval with deep orthogonal fusion of local and global features. In Proceedings of the IEEE/CVF International conference on Computer Vision, pages 11772–11781, 2021.
  33. Two-stage discriminative re-ranking for large-scale landmark retrieval. In Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition Workshops, pages 1012–1013, 2020.
  34. Learning spatial-context-aware global visual feature representation for instance image retrieval. In Proceedings of the IEEE/CVF International Conference on Computer Vision, pages 11250–11259, 2023.
  35. Uncertainty modeling of contextual-connections between tracklets for unconstrained video-based face recognition. In Proceedings of the IEEE/CVF International Conference on Computer Vision, pages 703–712, 2019.
  36. Sift meets cnn: A decade survey of instance retrieval. IEEE transactions on pattern analysis and machine intelligence, 40(5):1224–1244, 2017.
  37. Tour the world: building a web-scale landmark recognition engine. In 2009 IEEE Conference on Computer Vision and Pattern Recognition, pages 1085–1092. IEEE, 2009.
  38. Coarse-to-fine: Learning compact discriminative representation for single-stage image retrieval. In Proceedings of the IEEE/CVF International Conference on Computer Vision, pages 11260–11269, 2023.

Summary

We haven't generated a summary for this paper yet.