Multimodal Graph Neural Network for Recommendation with Dynamic De-redundancy and Modality-Guided Feature De-noisy (2411.01561v1)
Abstract: Graph neural networks (GNNs) have become crucial in multimodal recommendation tasks because of their powerful ability to capture complex relationships between neighboring nodes. However, increasing the number of propagation layers in GNNs can lead to feature redundancy, which may negatively impact the overall recommendation performance. In addition, the existing recommendation task method directly maps the preprocessed multimodal features to the low-dimensional space, which will bring the noise unrelated to user preference, thus affecting the representation ability of the model. To tackle the aforementioned challenges, we propose Multimodal Graph Neural Network for Recommendation (MGNM) with Dynamic De-redundancy and Modality-Guided Feature De-noisy, which is divided into local and global interaction. Initially, in the local interaction process,we integrate a dynamic de-redundancy (DDR) loss function which is achieved by utilizing the product of the feature coefficient matrix and the feature matrix as a penalization factor. It reduces the feature redundancy effects of multimodal and behavioral features caused by the stacking of multiple GNN layers. Subsequently, in the global interaction process, we developed modality-guided global feature purifiers for each modality to alleviate the impact of modality noise. It is a two-fold guiding mechanism eliminating modality features that are irrelevant to user preferences and captures complex relationships within the modality. Experimental results demonstrate that MGNM achieves superior performance on multimodal information denoising and removal of redundant information compared to the state-of-the-art methods.
- N. Yang, K. Zeng, Q. Wu, and J. Yan, “Molerec: Combinatorial drug recommendation with substructure-aware molecular representation learning,” in Proceedings of the ACM Web Conference 2023, 2023, pp. 4075–4085.
- C. Yang, C. Xiao, F. Ma, L. Glass, and J. Sun, “Safedrug: Dual molecular graph encoders for recommending effective and safe drug combinations,” in Proceedings of the Thirtieth International Joint Conference on Artificial Intelligence (IJCAI), 2021, pp. 3735–3741.
- Y. Zheng, J. Qin, P. Wei, Z. Chen, and L. Lin, “Cipl: Counterfactual interactive policy learning to eliminate popularity bias for online recommendation,” IEEE Transactions on Neural Networks and Learning Systems, pp. 1–14, 2023.
- D. Wang, X. Zhang, D. Yu, G. Xu, and S. Deng, “Came: Content- and context-aware music embedding for recommendation,” IEEE Transactions on Neural Networks and Learning Systems, vol. 32, no. 3, pp. 1375–1388, 2021.
- V. L. Gatta, V. Moscato, M. Pennone, M. Postiglione, and G. Sperlí, “Music recommendation via hypergraph embedding,” IEEE Transactions on Neural Networks and Learning Systems, vol. 34, no. 10, pp. 7887–7899, 2023.
- Z. Ou, Z. Han, P. Liu, S. Teng, and M. Song, “Siir: Symmetrical information interaction modeling for news recommendation,” IEEE Transactions on Neural Networks and Learning Systems, pp. 1–12, 2023.
- H.-S. Sheu, Z. Chu, D. Qi, and S. Li, “Knowledge-guided article embedding refinement for session-based news recommendation,” IEEE Transactions on Neural Networks and Learning Systems, vol. 33, no. 12, pp. 7921–7927, 2022.
- S. Deng, L. Huang, G. Xu, X. Wu, and Z. Wu, “On deep learning for trust-aware recommendations in social networks,” IEEE Transactions on Neural Networks and Learning Systems, vol. 28, no. 5, pp. 1164–1177, 2017.
- X. Sha, Z. Sun, J. Zhang, and Y.-S. Ong, “Who wants to shop with you: Joint product–participant recommendation for group-buying service,” IEEE Transactions on Neural Networks and Learning Systems, vol. 35, no. 2, pp. 2353–2363, 2024.
- M. Liu, H. Gao, and S. Ji, “Towards deeper graph neural networks,” in Proceedings of the 26th ACM SIGKDD International Conference on Knowledge Discovery & Data Mining (KDD), 2020, pp. 338–348.
- T. K. Rusch, M. M. Bronstein, and S. Mishra, “A survey on oversmoothing in graph neural networks,” ArXiv, vol. abs/2303.10993, 2023.
- D. Chen, Y. Lin, W. Li, P. Li, J. Zhou, and X. Sun, “Measuring and relieving the over-smoothing problem for graph neural networks from the topological view,” Proceedings of the AAAI Conference on Artificial Intelligence (AAAI), vol. 34, no. 04, pp. 3438–3445, 2020.
- Y. Yan, M. Hashemi, K. Swersky, Y. Yang, and D. Koutra, “Two sides of the same coin: Heterophily and oversmoothing in graph convolutional neural networks,” in 2022 IEEE International Conference on Data Mining (ICDM), 2022, pp. 1287–1292.
- W. Wu, C. Wang, D. Shen, C. Qin, L. Chen, and H. Xiong, “Afdgcf: Adaptive feature de-correlation graph collaborative filtering for recommendations,” 2024.
- P. Yu, Z. Tan, and G. Lu, “Multi-view graph convolutional network for multimedia recommendation,” in Proceedings of the ACM International Conference on Multimedia (MM), 2023, pp. 6576–6585.
- X. Zhou, H. Zhou, Y. Liu, Z. Zeng, C. Miao, P. Wang, Y. You, and F. Jiang, “Bootstrap latent representations for multi-modal recommendation,” pp. 845–854, 2023.
- Y. Wei, X. Wang, L. Nie, X. He, R. Hong, and T.-S. Chua, “Mmgcn: Multi-modal graph convolution network for personalized recommendation of micro-video,” in Proceedings of the ACM International Conference on Multimedia (MM), 2019, pp. 1437–1445.
- Y. Koren, R. Bell, and C. Volinsky, “Matrix factorization techniques for recommender systems,” Computer, vol. 42, no. 8, pp. 30–37, 2009.
- R. Salakhutdinov and A. Mnih, “Probabilistic matrix factorization,” in Proceedings of the 20th International Conference on Neural Information Processing Systems (NIPS), 2007, pp. 1257–1264.
- C. Su, M. Chen, and X. Xie, “Graph convolutional matrix completion via relation reconstruction,” in Proceedings of the 2021 10th International Conference on Software and Computer Applications (ICSCA), 2021, pp. 51–56.
- R. van den Berg, T. Kipf, and M. Welling, “Graph convolutional matrix completion,” ArXiv, vol. abs/1706.02263, 2017.
- R. Ying, R. He, K. Chen, P. Eksombatchai, W. L. Hamilton, and J. Leskovec, “Graph convolutional neural networks for web-scale recommender systems,” in Proceedings of the 24th ACM SIGKDD International Conference on Knowledge Discovery & Data Mining (KDD), 2018, pp. 974–983.
- B. Wu, X. He, Q. Zhang, M. Wang, and Y. Ye, “Gcrec: Graph-augmented capsule network for next-item recommendation,” IEEE Transactions on Neural Networks and Learning Systems, vol. 34, no. 12, pp. 10 164–10 177, 2023.
- K. Liu, F. Xue, D. Guo, P. Sun, S. Qian, and R. Hong, “Multimodal graph contrastive learning for multimedia-based recommendation,” IEEE Transactions on Multimedia, vol. 25, pp. 9343–9355, 2023.
- K. Mao, J. Zhu, X. Xiao, B. Lu, Z. Wang, and X. He, “Ultragcn: Ultra simplification of graph convolutional networks for recommendation,” in Proceedings of the 30th ACM International Conference on Information & Knowledge Management (CIKM), 2021, pp. 1253–1262.
- L. Chen, L. Wu, R. Hong, K. Zhang, and M. Wang, “Revisiting graph based collaborative filtering: A linear residual graph convolutional network approach,” Proceedings of the AAAI Conference on Artificial Intelligence (AAAI), vol. 34, no. 01, pp. 27–34, Apr. 2020.
- X. He, K. Deng, X. Wang, Y. Li, Y. Zhang, and M. Wang, “Lightgcn: Simplifying and powering graph convolution network for recommendation,” in Proceedings of the International ACM SIGIR Conference on Research and Development in Information Retrieval (SIGIR), 2020, pp. 639–648.
- Y. Wang, Y. Zhao, Y. Zhang, and T. Derr, “Collaboration-aware graph convolutional network for recommender systems,” in Proceedings of the ACM Web Conference 2023 (WWW), 2023, pp. 91–101.
- Z. Wu, X. Dai, X. Wang, Y. Xiong, S. Gao, and D. Liu, “A multi-label recommendation algorithm based on graph attention and sentiment correction,” in 2023 4th International Seminar on Artificial Intelligence, Networking and Information Technology (AINIT), 2023, pp. 396–401.
- W. Song, Z. Xiao, Y. Wang, L. Charlin, M. Zhang, and J. Tang, “Session-based social recommendation via dynamic graph attention networks,” in Proceedings of the Twelfth ACM International Conference on Web Search and Data Mining (WSDM), 2019, pp. 555–563.
- R. He and J. McAuley, “Vbpr: Visual bayesian personalized ranking from implicit feedback,” in Proceedings of the AAAI Conference on Artificial Intelligence (AAAI), 2016, pp. 144–150.
- W.-C. Kang, C. Fang, Z. Wang, and J. McAuley, “Visually-aware fashion recommendation and design with generative image models,” 2017 IEEE International Conference on Data Mining (ICDM), pp. 207–216, 2017.
- C. Lei, D. Liu, W. Li, Z. Zha, and H. Li, “Comparative deep learning of hybrid representations for image recommendations,” 2016 IEEE Conference on Computer Vision and Pattern Recognition (CVPR), pp. 2545–2553, 2016.
- L. Zheng, V. Noroozi, and P. S. Yu, “Joint deep modeling of users and items using reviews for recommendation,” in Proceedings of the Tenth ACM International Conference on Web Search and Data Mining (WSDM), 2017, pp. 425–434.
- H. Liu, F. Wu, W. Wang, X. Wang, P. Jiao, C. Wu, and X. Xie, “Nrpa: Neural recommendation with personalized attention,” in Proceedings of the 42nd International ACM SIGIR Conference on Research and Development in Information Retrieval (SIGIR), 2019, pp. 1233–1236.
- D. Liu, J. Li, B. Du, J. Chang, and R. Gao, “Daml: Dual attention mutual learning between ratings and reviews for item recommendation,” in Proceedings of the 25th ACM SIGKDD International Conference on Knowledge Discovery & Data Mining (KDD), 2019, pp. 344–352.
- K. He, X. Zhang, S. Ren, and J. Sun, “Deep residual learning for image recognition,” in IEEE Conference on Computer Vision and Pattern Recognition (CVPR), 2016, pp. 770–778.
- S. Nitish, R. Darsini, G. S. Shashank, V. Tejas, and A. Arya, “Bidirectional encoder representation from transformers (bert) variants for procedural long-form answer extraction,” in International Conference on Cloud Computing, Data Science & Engineering (Confluence) (NAACL), 2022, pp. 71–76.
- A. Dosovitskiy, L. Beyer, A. Kolesnikov, D. Weissenborn, X. Zhai, T. Unterthiner, M. Dehghani, M. Minderer, G. Heigold, S. Gelly, J. Uszkoreit, and N. Houlsby, “An image is worth 16x16 words: Transformers for image recognition at scale,” ArXiv, vol. abs/2010.11929, 2020.
- Z. Guo, J. Li, G. Li, C. Wang, S. Shi, and B. Ruan, “Lgmrec: Local and global graph learning for multimodal recommendation,” Proceedings of the AAAI Conference on Artificial Intelligence (AAAI), vol. 38, no. 8, pp. 8454–8462, 2024.
- S. Rendle, C. Freudenthaler, Z. Gantner, and L. Schmidt-Thieme, “Bpr: Bayesian personalized ranking from implicit feedback,” in Proceedings of the Twenty-Fifth Conference on Uncertainty in Artificial Intelligence (UAI), 2009, pp. 452–461.
- X. Zhou and Z. Shen, “A tale of two graphs: Freezing and denoising graph structures for multimodal recommendation,” in Proceedings of the 31st ACM International Conference on Multimedia (MM), 2023, pp. 935–943.
- J. Ni, J. Li, and J. McAuley, “Justifying recommendations using distantly-labeled reviews and fine-grained aspects,” in Proceedings of the 2019 Conference on Empirical Methods in Natural Language Processing and the 9th International Joint Conference on Natural Language Processing (EMNLP-IJCNLP), 2019, pp. 188–197.
- N. Reimers and I. Gurevych, “Sentence-BERT: Sentence embeddings using Siamese BERT-networks,” in Proceedings of the 2019 Conference on Empirical Methods in Natural Language Processing and the 9th International Joint Conference on Natural Language Processing (EMNLP-IJCNLP), Nov. 2019, pp. 3982–3992.
- J. Zhang, Y. Zhu, Q. Liu, S. Wu, S. Wang, and L. Wang, “Mining latent structures for multimedia recommendation,” in Proceedings of the 29th ACM International Conference on Multimedia (MM), 2021, pp. 3872–3880.
- X. Glorot and Y. Bengio, “Understanding the difficulty of training deep feedforward neural networks,” Journal of Machine Learning Research - Proceedings Track, vol. 9, pp. 249–256, 01 2010.
- D. P. Kingma and J. Ba, “Adam: A method for stochastic optimization,” vol. abs/1412.6980, 2014.
- Q. Wang, Y. Wei, J. Yin, J. Wu, X. Song, and L. Nie, “Dualgnn: Dual graph neural network for multimedia recommendation,” IEEE Transactions on Multimedia, pp. 1074–1084, 2023.