Papers
Topics
Authors
Recent
Search
2000 character limit reached

A Quasi-Wasserstein Loss for Learning Graph Neural Networks

Published 18 Oct 2023 in cs.LG | (2310.11762v4)

Abstract: When learning graph neural networks (GNNs) in node-level prediction tasks, most existing loss functions are applied for each node independently, even if node embeddings and their labels are non-i.i.d. because of their graph structures. To eliminate such inconsistency, in this study we propose a novel Quasi-Wasserstein (QW) loss with the help of the optimal transport defined on graphs, leading to new learning and prediction paradigms of GNNs. In particular, we design a ``Quasi-Wasserstein'' distance between the observed multi-dimensional node labels and their estimations, optimizing the label transport defined on graph edges. The estimations are parameterized by a GNN in which the optimal label transport may determine the graph edge weights optionally. By reformulating the strict constraint of the label transport to a Bregman divergence-based regularizer, we obtain the proposed Quasi-Wasserstein loss associated with two efficient solvers learning the GNN together with optimal label transport. When predicting node labels, our model combines the output of the GNN with the residual component provided by the optimal label transport, leading to a new transductive prediction paradigm. Experiments show that the proposed QW loss applies to various GNNs and helps to improve their performance in node-level classification and regression tasks. The code of this work can be found at \url{https://github.com/SDS-Lab/QW_Loss}.

Authors (2)
Definition Search Book Streamline Icon: https://streamlinehq.com
References (61)
  1. Wasserstein generative adversarial networks. In International conference on machine learning. PMLR, 214–223.
  2. Analyzing the expressive power of graph neural networks in a spectral perspective. In International Conference on Learning Representations.
  3. Iterative Bregman projections for regularized transportation problems. SIAM Journal on Scientific Computing 37, 2 (2015), A1111–A1138.
  4. Graph neural networks with convolutional arma filters. IEEE transactions on pattern analysis and machine intelligence 44, 7 (2021), 3496–3507.
  5. Optimal transport graph neural networks. arXiv preprint arXiv:2006.04804 (2020).
  6. Adaptive Universal Generalized PageRank Graph Neural Network. In International Conference on Learning Representations.
  7. Marco Cuturi. 2013. Sinkhorn distances: lightspeed computation of optimal transport. In Proceedings of the 26th International Conference on Neural Information Processing Systems-Volume 2. 2292–2300.
  8. Convolutional neural networks on graphs with fast localized spectral filtering. In Proceedings of the 30th International Conference on Neural Information Processing Systems. 3844–3852.
  9. On the equivalence of decoupled graph convolution network and label propagation. In Proceedings of the Web Conference 2021. 3651–3662.
  10. Yihe Dong and Will Sawin. 2020. Copt: Coordinated optimal transport on graphs. Advances in Neural Information Processing Systems 33 (2020), 19327–19338.
  11. Gbk-gnn: Gated bi-kernel graph neural networks for modeling both homophily and heterophily. In Proceedings of the ACM Web Conference 2022. 1550–1558.
  12. Montacer Essid and Justin Solomon. 2018. Quadratically regularized optimal transport on graphs. SIAM Journal on Scientific Computing 40, 4 (2018), A1961–A1986.
  13. Enrico Facca and Michele Benzi. 2021. Fast iterative solution of the optimal transport problem on graphs. SIAM Journal on Scientific Computing 43, 3 (2021), A2295–A2319.
  14. Graph neural networks for social recommendation. In The world wide web conference. 417–426.
  15. Learning with a Wasserstein loss. In Proceedings of the 28th International Conference on Neural Information Processing Systems-Volume 2. 2053–2061.
  16. Magnn: Metapath aggregated graph neural network for heterogeneous graph embedding. In Proceedings of The Web Conference 2020. 2331–2341.
  17. Predict then Propagate: Graph Neural Networks meet Personalized PageRank. In International Conference on Learning Representations.
  18. Unsupervised alignment of embeddings with wasserstein procrustes. In The 22nd International Conference on Artificial Intelligence and Statistics. PMLR, 1880–1890.
  19. Graph Neural Networks with Diverse Spectral Filtering. In Proceedings of the ACM Web Conference 2023. 306–316.
  20. Inductive representation learning on large graphs. In Proceedings of the 31st International Conference on Neural Information Processing Systems. 1025–1035.
  21. Convolutional neural networks on graphs with chebyshev approximation, revisited. Advances in Neural Information Processing Systems 35 (2022), 7264–7276.
  22. Bernnet: Learning arbitrary graph spectral filters via bernstein approximation. Advances in Neural Information Processing Systems 34 (2021), 14239–14251.
  23. Combining Label Propagation and Simple Models out-performs Graph Neural Networks. In International Conference on Learning Representations.
  24. Could graph neural networks learn better molecular representation for drug discovery? A comparison study of descriptor-based and graph-based models. Journal of cheminformatics 13, 1 (2021), 1–23.
  25. Diederik P Kingma and Jimmy Ba. 2014. Adam: A method for stochastic optimization. arXiv preprint arXiv:1412.6980 (2014).
  26. Thomas N Kipf and Max Welling. 2017. Semi-Supervised Classification with Graph Convolutional Networks. In International Conference on Learning Representations.
  27. Diffusion improves graph learning. In Proceedings of the 33rd International Conference on Neural Information Processing Systems. 13366–13378.
  28. Hierarchical optimal transport for multimodal distribution alignment. In Proceedings of the 33rd International Conference on Neural Information Processing Systems. 13475–13485.
  29. Mengzhang Li and Zhanxing Zhu. 2021. Spatial-temporal fusion graph neural networks for traffic flow forecasting. In Proceedings of the AAAI conference on artificial intelligence, Vol. 35. 4189–4196.
  30. Large Scale Learning on Non-Homophilous Graphs: New Benchmarks and Strong Simple Methods. In Advances in Neural Information Processing Systems, M. Ranzato, A. Beygelzimer, Y. Dauphin, P.S. Liang, and J. Wortman Vaughan (Eds.), Vol. 34. Curran Associates, Inc., 20887–20902. https://proceedings.neurips.cc/paper_files/paper/2021/file/ae816a80e4c1c56caa2eb4e1819cbb2f-Paper.pdf
  31. Image-based recommendations on styles and substitutes. In Proceedings of the 38th international ACM SIGIR conference on research and development in information retrieval. 43–52.
  32. Learning convolutional neural networks for graphs. In International conference on machine learning. PMLR, 2014–2023.
  33. Geom-GCN: Geometric Graph Convolutional Networks. In International Conference on Learning Representations.
  34. Gabriel Peyré and Marco Cuturi. 2019. Computational Optimal Transport. Foundations and Trends ® in Machine Learning 11, 5-6 (2019), 355–607.
  35. Deepinf: Social influence prediction with deep learning. In Proceedings of the 24th ACM SIGKDD international conference on knowledge discovery & data mining. 2110–2119.
  36. Gmnn: Graph markov neural networks. In International conference on machine learning. PMLR, 5241–5250.
  37. Multi-scale attributed node embedding. Journal of Complex Networks 9, 2 (2021), cnab014.
  38. Filippo Santambrogio. 2015. Optimal Transport for Applied Mathematicians: Calculus of Variations, PDEs, and Modeling. Vol. 87. Birkhäuser.
  39. E (n) equivariant graph neural networks. In International conference on machine learning. PMLR, 9323–9332.
  40. Collective classification in network data. AI magazine 29, 3 (2008), 93–93.
  41. Optimal transport for structured data with application on graphs. In International Conference on Machine Learning. PMLR, 6275–6284.
  42. Wasserstein Auto-Encoders. In International Conference on Learning Representations.
  43. Graph Attention Networks. In International Conference on Learning Representations.
  44. Cédric Villani. 2008. Optimal transport: old and new. Vol. 338. Springer Science & Business Media.
  45. Online graph dictionary learning. In International Conference on Machine Learning. PMLR, 10564–10574.
  46. Huahua Wang and Arindam Banerjee. 2014. Bregman alternating direction method of multipliers. In Proceedings of the 27th International Conference on Neural Information Processing Systems-Volume 2. 2816–2824.
  47. Hongwei Wang and Jure Leskovec. 2021. Combining graph convolutional neural networks and label propagation. ACM Transactions on Information Systems (TOIS) 40, 4 (2021), 1–27.
  48. Heterogeneous graph attention network. In The world wide web conference. 2022–2032.
  49. Traffic flow prediction via spatial temporal graph neural network. In Proceedings of the web conference 2020. 1082–1092.
  50. Molecular contrastive learning of representations via graph neural networks. Nature Machine Intelligence 4, 3 (2022), 279–287.
  51. A fast proximal point method for computing exact wasserstein distance. In Uncertainty in artificial intelligence. PMLR, 433–453.
  52. Graph convolutional networks using heat kernel for semi-supervised learning. In Proceedings of the 28th International Joint Conference on Artificial Intelligence. 1928–1934.
  53. Hongteng Xu. 2020. Gromov-Wasserstein factorization models for graph clustering. In Proceedings of the AAAI conference on artificial intelligence, Vol. 34. 6478–6485.
  54. Representing graphs via Gromov-Wasserstein factorization. IEEE Transactions on Pattern Analysis and Machine Intelligence 45, 1 (2022), 999–1016.
  55. Gromov-wasserstein learning for graph matching and node embedding. In International conference on machine learning. PMLR, 6932–6941.
  56. How Powerful are Graph Neural Networks?. In International Conference on Learning Representations.
  57. Revisiting semi-supervised learning with graph embeddings. In International conference on machine learning. PMLR, 40–48.
  58. RSGNN: A Model-agnostic Approach for Enhancing the Robustness of Signed Graph Neural Networks. In Proceedings of the ACM Web Conference 2023. 60–70.
  59. Interpreting and unifying graph neural networks with an optimization framework. In Proceedings of the Web Conference 2021. 1215–1226.
  60. Xiaojin Zhu and Andrew B Goldberg. 2022. Introduction to Semi-Supervised Learning. Springer Nature.
  61. Chenyi Zhuang and Qiang Ma. 2018. Dual graph convolutional networks for graph-based semi-supervised classification. In Proceedings of the 2018 world wide web conference. 499–508.

Summary

No one has generated a summary of this paper yet.

Paper to Video (Beta)

No one has generated a video about this paper yet.

Whiteboard

No one has generated a whiteboard explanation for this paper yet.

Open Problems

We haven't generated a list of open problems mentioned in this paper yet.

Continue Learning

We haven't generated follow-up questions for this paper yet.

Collections

Sign up for free to add this paper to one or more collections.