Deep Hashing via Householder Quantization (2311.04207v3)
Abstract: Hashing is at the heart of large-scale image similarity search, and recent methods have been substantially improved through deep learning techniques. Such algorithms typically learn continuous embeddings of the data. To avoid a subsequent costly binarization step, a common solution is to employ loss functions that combine a similarity learning term (to ensure similar images are grouped to nearby embeddings) and a quantization penalty term (to ensure that the embedding entries are close to binarized entries, e.g., -1 or 1). Still, the interaction between these two terms can make learning harder and the embeddings worse. We propose an alternative quantization strategy that decomposes the learning problem in two stages: first, perform similarity learning over the embedding space with no quantization; second, find an optimal orthogonal transformation of the embeddings so each coordinate of the embedding is close to its sign, and then quantize the transformed embedding through the sign function. In the second step, we parametrize orthogonal transformations using Householder matrices to efficiently leverage stochastic gradient descent. Since similarity measures are usually invariant under orthogonal transformations, this quantization strategy comes at no cost in terms of performance. The resulting algorithm is unsupervised, fast, hyperparameter-free and can be run on top of any existing deep hashing or metric learning algorithm. We provide extensive experimental results showing that this approach leads to state-of-the-art performance on widely used image datasets, and, unlike other quantization strategies, brings consistent improvements in performance to existing deep hashing algorithms.
- P.A. Absil, R. Mahony and R. Sepulchre “Optimization Algorithms on Matrix Manifolds” Princeton University Press, 2009 URL: https://books.google.com.br/books?id=NSQGQeLN3NcC
- “Beyond locality-sensitive hashing” In Proceedings of the twenty-fifth annual ACM-SIAM symposium on Discrete algorithms, 2014, pp. 1018–1028 SIAM
- “Ensemble deep manifold similarity learning using hard proxies” In Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2019, pp. 7299–7307
- “Signature verification using a" siamese" time delay neural network” In Advances in neural information processing systems 6, 1993
- “An updated table of minimum-distance bounds for binary linear codes” In IEEE Transactions on Information Theory 39.2, 1993, pp. 662–677 DOI: 10.1109/18.212301
- “Deep cauchy hashing for hamming space retrieval” In Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2018, pp. 1229–1237
- “Hashnet: Deep learning to hash by continuation” In Proceedings of the IEEE international conference on computer vision, 2017, pp. 5608–5617
- “Deep priority hashing” In Proceedings of the 26th ACM international conference on Multimedia, 2018, pp. 1653–1661
- “Deep discrete hashing with pairwise correlation learning” In Neurocomputing 385 Elsevier, 2020, pp. 111–121
- Sumit Chopra, Raia Hadsell and Yann LeCun “Learning a similarity metric discriminatively, with application to face verification” In 2005 IEEE computer society conference on computer vision and pattern recognition (CVPR’05) 1, 2005, pp. 539–546 IEEE
- “Nus-wide: a real-world web image database from national university of singapore” In Proceedings of the ACM international conference on image and video retrieval, 2009, pp. 1–9
- “ImageNet: A large-scale hierarchical image database” In 2009 IEEE Conference on Computer Vision and Pattern Recognition, 2009, pp. 248–255 DOI: 10.1109/CVPR.2009.5206848
- “Imagenet: A large-scale hierarchical image database” In 2009 IEEE conference on computer vision and pattern recognition, 2009, pp. 248–255 Ieee
- Khoa D Doan, Peng Yang and Ping Li “One loss for quantization: Deep hashing with discrete wasserstein distributional matching” In Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2022, pp. 9447–9457
- “Deep Polarized Network for Supervised Learning of Accurate Binary Hashing Codes.” In IJCAI, 2020, pp. 825–831
- Aristides Gionis, Piotr Indyk and Rajeev Motwani “Similarity search in high dimensions via hashing” In Vldb 99.6, 1999, pp. 518–529
- G.H. Golub and C.F. Van Loan “Matrix Computations”, Johns Hopkins Studies in the Mathematical Sciences Johns Hopkins University Press, 1996 URL: https://books.google.com.br/books?id=mlOa7wPX6OYC
- “Iterative quantization: A procrustean approach to learning binary codes for large-scale image retrieval” In IEEE transactions on pattern analysis and machine intelligence 35.12 IEEE, 2012, pp. 2916–2929
- “One loss for all: Deep hashing with a single cosine similarity based learning objective” In Advances in Neural Information Processing Systems 34, 2021, pp. 24286–24298
- “A survey on locality sensitive hashing algorithms and their applications” In arXiv preprint arXiv:2102.08942, 2021
- Herve Jegou, Matthijs Douze and Cordelia Schmid “Hamming embedding and weak geometric consistency for large scale image search” In Computer Vision–ECCV 2008: 10th European Conference on Computer Vision, Marseille, France, October 12-18, 2008, Proceedings, Part I 10, 2008, pp. 304–317 Springer
- “Aggregating local descriptors into a compact image representation” In 2010 IEEE computer society conference on computer vision and pattern recognition, 2010, pp. 3304–3311 IEEE
- “Asymmetric deep supervised hashing” In Proceedings of the AAAI conference on artificial intelligence 32.1, 2018
- “Maximum-margin hamming hashing” In Proceedings of the IEEE/CVF international conference on computer vision, 2019, pp. 8252–8261
- “Proxy anchor loss for deep metric learning” In Proceedings of the IEEE/CVF conference on computer vision and pattern recognition, 2020, pp. 3238–3247
- Diederik P Kingma and Jimmy Ba “Adam: A method for stochastic optimization” In arXiv preprint arXiv:1412.6980, 2014
- “Learning multiple layers of features from tiny images” Toronto, ON, Canada, 2009
- Alex Krizhevsky, Vinod Nair and Geoffrey Hinton “The CIFAR-10 dataset” In online: http://www. cs. toronto. edu/kriz/cifar. html 55.5, 2014
- Alex Krizhevsky, Ilya Sutskever and Geoffrey E Hinton “Imagenet classification with deep convolutional neural networks” In Advances in neural information processing systems 25, 2012
- Brian Kulis “Metric learning: A survey” In Foundations and Trends® in Machine Learning 5.4 Now Publishers, Inc., 2013, pp. 287–364
- “Simultaneous feature learning and hash coding with deep neural networks” In Proceedings of the IEEE conference on computer vision and pattern recognition, 2015, pp. 3270–3278
- Mario Lezcano Casado “Trivializations for gradient-based optimization on manifolds” In Advances in Neural Information Processing Systems 32, 2019
- Jun Li, Li Fuxin and Sinisa Todorovic “Efficient riemannian optimization on the stiefel manifold via the cayley transform” In arXiv preprint arXiv:2002.01113, 2020
- Wu-Jun Li, Sheng Wang and Wang-Cheng Kang “Feature learning based deep supervised hashing with pairwise labels” In arXiv preprint arXiv:1511.03855, 2015
- “Deep supervised discrete hashing” In Advances in neural information processing systems 30, 2017
- “Microsoft coco: Common objects in context” In Computer Vision–ECCV 2014: 13th European Conference, Zurich, Switzerland, September 6-12, 2014, Proceedings, Part V 13, 2014, pp. 740–755 Springer
- “Deep supervised hashing for fast image retrieval” In Proceedings of the IEEE conference on computer vision and pattern recognition, 2016, pp. 2064–2072
- “Discrete graph hashing” In Advances in neural information processing systems 27, 2014
- “Supervised hashing with kernels” In 2012 IEEE conference on computer vision and pattern recognition, 2012, pp. 2074–2081 IEEE
- “Hashing with graphs”, 2011
- “A Survey on Deep Hashing Methods” In ACM Trans. Knowl. Discov. Data 17.1 New York, NY, USA: Association for Computing Machinery, 2023 DOI: 10.1145/3532624
- “What If Neural Networks had SVDs?” In NeurIPS, 2020
- “Supervised discrete hashing” In Proceedings of the IEEE conference on computer vision and pattern recognition, 2015, pp. 37–45
- “Very deep convolutional networks for large-scale image recognition” In arXiv preprint arXiv:1409.1556, 2014
- “Learning to hash: A comprehensive survey of deep learning-based hashing methods” In Knowledge and Information Systems 64.10 Springer, 2022, pp. 2565–2597
- “Weighted gaussian loss based hamming hashing” In Proceedings of the 29th ACM International Conference on Multimedia, 2021, pp. 3409–3417
- Frank Uhlig “Constructive ways for generating (generalized) real orthogonal matrices as products of (generalized) symmetries” In Linear Algebra and its Applications 332 Elsevier, 2001, pp. 459–467
- “A survey on learning to hash” In IEEE transactions on pattern analysis and machine intelligence 40.4 IEEE, 2017, pp. 769–790
- Jun Wang, Sanjiv Kumar and Shih-Fu Chang “Semi-supervised hashing for scalable image retrieval” In 2010 IEEE Computer Society Conference on Computer Vision and Pattern Recognition, 2010, pp. 3424–3431 IEEE
- Qifan Wang, Luo Si and Bin Shen “Learning to hash on structured data” In Proceedings of the AAAI Conference on Artificial Intelligence 29.1, 2015
- Yair Weiss, Antonio Torralba and Rob Fergus “Spectral hashing” In Advances in neural information processing systems 21, 2008
- “Supervised hashing for image retrieval via image representation learning” In Proceedings of the AAAI conference on artificial intelligence 28.1, 2014
- “HyP2 Loss: Beyond Hypersphere Metric Space for Multi-Label Image Retrieval” In Proceedings of the 30th ACM International Conference on Multimedia, MM ’22 Lisboa, Portugal: Association for Computing Machinery, 2022, pp. 3173–3184 DOI: 10.1145/3503161.3548032
- “Hyp2 loss: Beyond hypersphere metric space for multi-label image retrieval” In Proceedings of the 30th ACM International Conference on Multimedia, 2022, pp. 3173–3184
- “Central similarity quantization for efficient image and video retrieval” In Proceedings of the IEEE/CVF conference on computer vision and pattern recognition, 2020, pp. 3083–3092
- “Discrete image hashing using large weakly annotated photo collections” In Proceedings of the AAAI Conference on Artificial Intelligence 30.1, 2016
- “Improved Deep Hashing With Soft Pairwise Similarity for Multi-Label Image Retrieval” In IEEE Transactions on Multimedia 22.2, 2020, pp. 540–553 DOI: 10.1109/TMM.2019.2929957
- “Deep hashing network for efficient similarity retrieval” In Proceedings of the AAAI conference on Artificial Intelligence 30.1, 2016
- Lucas R. Schwengber (1 paper)
- Lucas Resende (4 papers)
- Paulo Orenstein (10 papers)
- Roberto I. Oliveira (29 papers)