Papers
Topics
Authors
Recent
Gemini 2.5 Flash
Gemini 2.5 Flash
102 tokens/sec
GPT-4o
59 tokens/sec
Gemini 2.5 Pro Pro
43 tokens/sec
o3 Pro
6 tokens/sec
GPT-4.1 Pro
50 tokens/sec
DeepSeek R1 via Azure Pro
28 tokens/sec
2000 character limit reached

Deep Hashing via Householder Quantization (2311.04207v3)

Published 7 Nov 2023 in cs.CV and cs.IR

Abstract: Hashing is at the heart of large-scale image similarity search, and recent methods have been substantially improved through deep learning techniques. Such algorithms typically learn continuous embeddings of the data. To avoid a subsequent costly binarization step, a common solution is to employ loss functions that combine a similarity learning term (to ensure similar images are grouped to nearby embeddings) and a quantization penalty term (to ensure that the embedding entries are close to binarized entries, e.g., -1 or 1). Still, the interaction between these two terms can make learning harder and the embeddings worse. We propose an alternative quantization strategy that decomposes the learning problem in two stages: first, perform similarity learning over the embedding space with no quantization; second, find an optimal orthogonal transformation of the embeddings so each coordinate of the embedding is close to its sign, and then quantize the transformed embedding through the sign function. In the second step, we parametrize orthogonal transformations using Householder matrices to efficiently leverage stochastic gradient descent. Since similarity measures are usually invariant under orthogonal transformations, this quantization strategy comes at no cost in terms of performance. The resulting algorithm is unsupervised, fast, hyperparameter-free and can be run on top of any existing deep hashing or metric learning algorithm. We provide extensive experimental results showing that this approach leads to state-of-the-art performance on widely used image datasets, and, unlike other quantization strategies, brings consistent improvements in performance to existing deep hashing algorithms.

Definition Search Book Streamline Icon: https://streamlinehq.com
References (58)
  1. P.A. Absil, R. Mahony and R. Sepulchre “Optimization Algorithms on Matrix Manifolds” Princeton University Press, 2009 URL: https://books.google.com.br/books?id=NSQGQeLN3NcC
  2. “Beyond locality-sensitive hashing” In Proceedings of the twenty-fifth annual ACM-SIAM symposium on Discrete algorithms, 2014, pp. 1018–1028 SIAM
  3. “Ensemble deep manifold similarity learning using hard proxies” In Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2019, pp. 7299–7307
  4. “Signature verification using a" siamese" time delay neural network” In Advances in neural information processing systems 6, 1993
  5. “An updated table of minimum-distance bounds for binary linear codes” In IEEE Transactions on Information Theory 39.2, 1993, pp. 662–677 DOI: 10.1109/18.212301
  6. “Deep cauchy hashing for hamming space retrieval” In Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2018, pp. 1229–1237
  7. “Hashnet: Deep learning to hash by continuation” In Proceedings of the IEEE international conference on computer vision, 2017, pp. 5608–5617
  8. “Deep priority hashing” In Proceedings of the 26th ACM international conference on Multimedia, 2018, pp. 1653–1661
  9. “Deep discrete hashing with pairwise correlation learning” In Neurocomputing 385 Elsevier, 2020, pp. 111–121
  10. Sumit Chopra, Raia Hadsell and Yann LeCun “Learning a similarity metric discriminatively, with application to face verification” In 2005 IEEE computer society conference on computer vision and pattern recognition (CVPR’05) 1, 2005, pp. 539–546 IEEE
  11. “Nus-wide: a real-world web image database from national university of singapore” In Proceedings of the ACM international conference on image and video retrieval, 2009, pp. 1–9
  12. “ImageNet: A large-scale hierarchical image database” In 2009 IEEE Conference on Computer Vision and Pattern Recognition, 2009, pp. 248–255 DOI: 10.1109/CVPR.2009.5206848
  13. “Imagenet: A large-scale hierarchical image database” In 2009 IEEE conference on computer vision and pattern recognition, 2009, pp. 248–255 Ieee
  14. Khoa D Doan, Peng Yang and Ping Li “One loss for quantization: Deep hashing with discrete wasserstein distributional matching” In Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2022, pp. 9447–9457
  15. “Deep Polarized Network for Supervised Learning of Accurate Binary Hashing Codes.” In IJCAI, 2020, pp. 825–831
  16. Aristides Gionis, Piotr Indyk and Rajeev Motwani “Similarity search in high dimensions via hashing” In Vldb 99.6, 1999, pp. 518–529
  17. G.H. Golub and C.F. Van Loan “Matrix Computations”, Johns Hopkins Studies in the Mathematical Sciences Johns Hopkins University Press, 1996 URL: https://books.google.com.br/books?id=mlOa7wPX6OYC
  18. “Iterative quantization: A procrustean approach to learning binary codes for large-scale image retrieval” In IEEE transactions on pattern analysis and machine intelligence 35.12 IEEE, 2012, pp. 2916–2929
  19. “One loss for all: Deep hashing with a single cosine similarity based learning objective” In Advances in Neural Information Processing Systems 34, 2021, pp. 24286–24298
  20. “A survey on locality sensitive hashing algorithms and their applications” In arXiv preprint arXiv:2102.08942, 2021
  21. Herve Jegou, Matthijs Douze and Cordelia Schmid “Hamming embedding and weak geometric consistency for large scale image search” In Computer Vision–ECCV 2008: 10th European Conference on Computer Vision, Marseille, France, October 12-18, 2008, Proceedings, Part I 10, 2008, pp. 304–317 Springer
  22. “Aggregating local descriptors into a compact image representation” In 2010 IEEE computer society conference on computer vision and pattern recognition, 2010, pp. 3304–3311 IEEE
  23. “Asymmetric deep supervised hashing” In Proceedings of the AAAI conference on artificial intelligence 32.1, 2018
  24. “Maximum-margin hamming hashing” In Proceedings of the IEEE/CVF international conference on computer vision, 2019, pp. 8252–8261
  25. “Proxy anchor loss for deep metric learning” In Proceedings of the IEEE/CVF conference on computer vision and pattern recognition, 2020, pp. 3238–3247
  26. Diederik P Kingma and Jimmy Ba “Adam: A method for stochastic optimization” In arXiv preprint arXiv:1412.6980, 2014
  27. “Learning multiple layers of features from tiny images” Toronto, ON, Canada, 2009
  28. Alex Krizhevsky, Vinod Nair and Geoffrey Hinton “The CIFAR-10 dataset” In online: http://www. cs. toronto. edu/kriz/cifar. html 55.5, 2014
  29. Alex Krizhevsky, Ilya Sutskever and Geoffrey E Hinton “Imagenet classification with deep convolutional neural networks” In Advances in neural information processing systems 25, 2012
  30. Brian Kulis “Metric learning: A survey” In Foundations and Trends® in Machine Learning 5.4 Now Publishers, Inc., 2013, pp. 287–364
  31. “Simultaneous feature learning and hash coding with deep neural networks” In Proceedings of the IEEE conference on computer vision and pattern recognition, 2015, pp. 3270–3278
  32. Mario Lezcano Casado “Trivializations for gradient-based optimization on manifolds” In Advances in Neural Information Processing Systems 32, 2019
  33. Jun Li, Li Fuxin and Sinisa Todorovic “Efficient riemannian optimization on the stiefel manifold via the cayley transform” In arXiv preprint arXiv:2002.01113, 2020
  34. Wu-Jun Li, Sheng Wang and Wang-Cheng Kang “Feature learning based deep supervised hashing with pairwise labels” In arXiv preprint arXiv:1511.03855, 2015
  35. “Deep supervised discrete hashing” In Advances in neural information processing systems 30, 2017
  36. “Microsoft coco: Common objects in context” In Computer Vision–ECCV 2014: 13th European Conference, Zurich, Switzerland, September 6-12, 2014, Proceedings, Part V 13, 2014, pp. 740–755 Springer
  37. “Deep supervised hashing for fast image retrieval” In Proceedings of the IEEE conference on computer vision and pattern recognition, 2016, pp. 2064–2072
  38. “Discrete graph hashing” In Advances in neural information processing systems 27, 2014
  39. “Supervised hashing with kernels” In 2012 IEEE conference on computer vision and pattern recognition, 2012, pp. 2074–2081 IEEE
  40. “Hashing with graphs”, 2011
  41. “A Survey on Deep Hashing Methods” In ACM Trans. Knowl. Discov. Data 17.1 New York, NY, USA: Association for Computing Machinery, 2023 DOI: 10.1145/3532624
  42. “What If Neural Networks had SVDs?” In NeurIPS, 2020
  43. “Supervised discrete hashing” In Proceedings of the IEEE conference on computer vision and pattern recognition, 2015, pp. 37–45
  44. “Very deep convolutional networks for large-scale image recognition” In arXiv preprint arXiv:1409.1556, 2014
  45. “Learning to hash: A comprehensive survey of deep learning-based hashing methods” In Knowledge and Information Systems 64.10 Springer, 2022, pp. 2565–2597
  46. “Weighted gaussian loss based hamming hashing” In Proceedings of the 29th ACM International Conference on Multimedia, 2021, pp. 3409–3417
  47. Frank Uhlig “Constructive ways for generating (generalized) real orthogonal matrices as products of (generalized) symmetries” In Linear Algebra and its Applications 332 Elsevier, 2001, pp. 459–467
  48. “A survey on learning to hash” In IEEE transactions on pattern analysis and machine intelligence 40.4 IEEE, 2017, pp. 769–790
  49. Jun Wang, Sanjiv Kumar and Shih-Fu Chang “Semi-supervised hashing for scalable image retrieval” In 2010 IEEE Computer Society Conference on Computer Vision and Pattern Recognition, 2010, pp. 3424–3431 IEEE
  50. Qifan Wang, Luo Si and Bin Shen “Learning to hash on structured data” In Proceedings of the AAAI Conference on Artificial Intelligence 29.1, 2015
  51. Yair Weiss, Antonio Torralba and Rob Fergus “Spectral hashing” In Advances in neural information processing systems 21, 2008
  52. “Supervised hashing for image retrieval via image representation learning” In Proceedings of the AAAI conference on artificial intelligence 28.1, 2014
  53. “HyP2 Loss: Beyond Hypersphere Metric Space for Multi-Label Image Retrieval” In Proceedings of the 30th ACM International Conference on Multimedia, MM ’22 Lisboa, Portugal: Association for Computing Machinery, 2022, pp. 3173–3184 DOI: 10.1145/3503161.3548032
  54. “Hyp2 loss: Beyond hypersphere metric space for multi-label image retrieval” In Proceedings of the 30th ACM International Conference on Multimedia, 2022, pp. 3173–3184
  55. “Central similarity quantization for efficient image and video retrieval” In Proceedings of the IEEE/CVF conference on computer vision and pattern recognition, 2020, pp. 3083–3092
  56. “Discrete image hashing using large weakly annotated photo collections” In Proceedings of the AAAI Conference on Artificial Intelligence 30.1, 2016
  57. “Improved Deep Hashing With Soft Pairwise Similarity for Multi-Label Image Retrieval” In IEEE Transactions on Multimedia 22.2, 2020, pp. 540–553 DOI: 10.1109/TMM.2019.2929957
  58. “Deep hashing network for efficient similarity retrieval” In Proceedings of the AAAI conference on Artificial Intelligence 30.1, 2016
User Edit Pencil Streamline Icon: https://streamlinehq.com
Authors (4)
  1. Lucas R. Schwengber (1 paper)
  2. Lucas Resende (4 papers)
  3. Paulo Orenstein (10 papers)
  4. Roberto I. Oliveira (29 papers)
Citations (1)