Generalized Relevance Learning Grassmann Quantization (2403.09183v1)
Abstract: Due to advancements in digital cameras, it is easy to gather multiple images (or videos) from an object under different conditions. Therefore, image-set classification has attracted more attention, and different solutions were proposed to model them. A popular way to model image sets is subspaces, which form a manifold called the Grassmann manifold. In this contribution, we extend the application of Generalized Relevance Learning Vector Quantization to deal with Grassmann manifold. The proposed model returns a set of prototype subspaces and a relevance vector. While prototypes model typical behaviours within classes, the relevance factors specify the most discriminative principal vectors (or images) for the classification task. They both provide insights into the model's decisions by highlighting influential images and pixels for predictions. Moreover, due to learning prototypes, the model complexity of the new method during inference is independent of dataset size, unlike previous works. We applied it to several recognition tasks including handwritten digit recognition, face recognition, activity recognition, and object recognition. Experiments demonstrate that it outperforms previous works with lower complexity and can successfully model the variation, such as handwritten style or lighting conditions. Moreover, the presence of relevances makes the model robust to the selection of subspaces' dimensionality.
- Z. Huang, R. Wang, S. Shan, and X. Chen, “Projection metric learning on grassmann manifold with application to video based face recognition,” in Proceedings of the IEEE conference on computer vision and pattern recognition, pp. 140–149, 2015.
- J. Zhang, G. Zhu, R. W. Heath Jr, and K. Huang, “Grassmannian learning: Embedding geometry awareness in shallow and deep learning,” arXiv preprint arXiv:1808.02229, 2018.
- O. Yamaguchi, K. Fukui, and K.-i. Maeda, “Face recognition using temporal image sequence,” in Proceedings Third IEEE International Conference on Automatic Face and Gesture Recognition, pp. 318–323. IEEE, 1998.
- K. Fukui and O. Yamaguchi, “Face recognition using multi-viewpoint patterns for robot vision,” in Robotics Research. The Eleventh International Symposium, pp. 192–201. Springer, 2005.
- T.-K. Kim, O. Arandjelović, and R. Cipolla, “Boosted manifold principal angles for image set-based recognition,” Pattern Recognition, vol. 40, no. 9, pp. 2475–2484, 2007.
- T.-K. Kim, J. Kittler, and R. Cipolla, “Discriminative learning and recognition of image set classes using canonical correlations,” IEEE Transactions on Pattern Analysis and Machine Intelligence, vol. 29, no. 6, pp. 1005–1018, 2007.
- R. Wang, S. Shan, X. Chen, and W. Gao, “Manifold-manifold distance with application to face recognition based on image set,” in 2008 IEEE Conference on Computer Vision and Pattern Recognition, pp. 1–8. IEEE, 2008.
- R. Wang and X. Chen, “Manifold discriminant analysis,” in 2009 IEEE Conference on Computer Vision and Pattern Recognition, pp. 429–436. IEEE, 2009.
- J. Hamm and D. D. Lee, “Grassmann discriminant analysis: a unifying view on subspace-based learning,” in Proceedings of the 25th international conference on Machine learning, pp. 376–383, 2008.
- M. T. Harandi, C. Sanderson, S. Shirazi, and B. C. Lovell, “Graph embedding discriminant analysis on grassmannian manifolds for improved image set matching,” in CVPR 2011, pp. 2705–2712. IEEE, 2011.
- D. Wei, X. Shen, Q. Sun, X. Gao, and W. Yan, “Prototype learning and collaborative representation using grassmann manifolds for image set classification,” Pattern Recognition, vol. 100, p. 107123, 2020.
- N. Sogi, L. S. Souza, B. B. Gatto, and K. Fukui, “Metric learning with a-based scalar product for image-set recognition,” in Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition Workshops, pp. 850–851, 2020.
- M. Mohammadi, J. Mutatiina, T. Saifollahi, and K. Bunte, “Detection of extragalactic ultra-compact dwarfs and globular clusters using explainable ai techniques,” Astronomy and Computing, vol. 39, p. 100555, 2022.
- B. Hammer and T. Villmann, “Generalized relevance learning vector quantization,” Neural Networks, vol. 15, no. 8-9, pp. 1059–1068, 2002.
- P. Schneider, M. Biehl, and B. Hammer, “Adaptive relevance matrices in learning vector quantization,” Neural computation, vol. 21, no. 12, pp. 3532–3561, 2009.
- K. Bunte, P. Schneider, B. Hammer, F.-M. Schleif, T. Villmann, and M. Biehl, “Limited rank matrix learning, discriminative dimension reduction and visualization,” Neural Networks, vol. 26, pp. 159–173, 2012.
- S. Saralajew and T. Villmann, “Adaptive tangent distances in generalized learning vector quantization for transformation and distortion invariant classification learning,” in 2016 International Joint Conference on Neural Networks (IJCNN), pp. 2672–2679. IEEE, 2016.
- S. Ghosh, P. Tino, and K. Bunte, “Visualisation and knowledge discovery from interpretable models,” in 2020 International Joint Conference on Neural Networks (IJCNN), pp. 1–8. IEEE, 2020.
- F. Tang, P. Tiňo, and H. Yu, “Generalized learning vector quantization with log-euclidean metric learning on symmetric positive-definite manifold,” IEEE Transactions on Cybernetics, 2022.
- M. Harandi, C. Sanderson, C. Shen, and B. C. Lovell, “Dictionary learning and sparse coding on grassmann manifolds: An extrinsic solution,” in Proceedings of the IEEE international conference on computer vision, pp. 3120–3127, 2013.
- A. Björck and G. H. Golub, “Numerical methods for computing angles between linear subspaces,” Mathematics of computation, vol. 27, no. 123, pp. 579–594, 1973.
- A. Sato and K. Yamada, “Generalized learning vector quantization,” Advances in neural information processing systems, vol. 8, 1995.
- L. H. Gilpin, D. Bau, B. Z. Yuan, A. Bajwa, M. Specter, and L. Kagal, “Explaining explanations: An overview of interpretability of machine learning,” in 2018 IEEE 5th International Conference on data science and advanced analytics (DSAA), pp. 80–89. IEEE, 2018.
- A. Edelman, T. A. Arias, and S. T. Smith, “The geometry of algorithms with orthogonality constraints,” SIAM journal on Matrix Analysis and Applications, vol. 20, no. 2, pp. 303–353, 1998.
- S. Saralajew, “New prototype concepts in classification learning,” Ph.D. dissertation, Universität Bielefeld, 2020.
- C. Deng, “Four face databases in matlab format,” 2008.
- K. Q. Weinberger and L. K. Saul, “Distance metric learning for large margin nearest neighbor classification.” Journal of machine learning research, vol. 10, no. 2, 2009.
- E. Fix and J. L. Hodges, “Discriminatory analysis. nonparametric discrimination: Consistency properties,” International Statistical Review/Revue Internationale de Statistique, vol. 57, no. 3, pp. 238–247, 1989.
- B. Leibe and B. Schiele, “Analyzing appearance and contour based methods for object categorization,” in 2003 IEEE Computer Society Conference on Computer Vision and Pattern Recognition, 2003. Proceedings., vol. 2, pp. II–409. IEEE, 2003.
- L. Van der Maaten and G. Hinton, “Visualizing data using t-sne.” Journal of machine learning research, vol. 9, no. 11, 2008.
- M. D. Rodriguez, J. Ahmed, and M. Shah, “Action mach a spatio-temporal maximum average correlation height filter for action recognition,” in 2008 IEEE conference on computer vision and pattern recognition, pp. 1–8. IEEE, 2008.