Enhance Hyperbolic Representation Learning via Second-order Pooling
Abstract: Hyperbolic representation learning is well known for its ability to capture hierarchical information. However, the distance between samples from different levels of hierarchical classes can be required large. We reveal that the hyperbolic discriminant objective forces the backbone to capture this hierarchical information, which may inevitably increase the Lipschitz constant of the backbone. This can hinder the full utilization of the backbone's generalization ability. To address this issue, we introduce second-order pooling into hyperbolic representation learning, as it naturally increases the distance between samples without compromising the generalization ability of the input features. In this way, the Lipschitz constant of the backbone does not necessarily need to be large. However, current off-the-shelf low-dimensional bilinear pooling methods cannot be directly employed in hyperbolic representation learning because they inevitably reduce the distance expansion capability. To solve this problem, we propose a kernel approximation regularization, which enables the low-dimensional bilinear features to approximate the kernel function well in low-dimensional space. Finally, we conduct extensive experiments on graph-structured datasets to demonstrate the effectiveness of the proposed method.
- Interpretable bilinear attention network with domain adaptation improves drug–target prediction. Nature Machine Intelligence, 5(2): 126–136.
- Neural embeddings of graphs in hyperbolic space. arXiv preprint arXiv:1705.10359.
- Hyperbolic graph convolutional neural networks. Advances in neural information processing systems, 32.
- Fastgcn: fast learning with graph convolutional networks via importance sampling. arXiv preprint arXiv:1801.10247.
- Simple and deep graph convolutional networks. In International conference on machine learning, 1725–1735. PMLR.
- Large-margin classification in hyperbolic space. In The 22nd international conference on artificial intelligence and statistics, 1832–1840. PMLR.
- A hyperbolic-to-hyperbolic graph convolutional network. In Proceedings of the IEEE/CVF conference on computer vision and pattern recognition, 154–163.
- Embedding text in hyperbolic spaces. arXiv preprint arXiv:1806.04313.
- Hyperbolic vision transformers: Combining improvements in metric learning. In Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 7409–7419.
- Kernel methods in hyperbolic spaces. In Proceedings of the IEEE/CVF International Conference on Computer Vision, 10665–10674.
- Certified robustness via dynamic margin maximization and improved lipschitz regularization. Advances in Neural Information Processing Systems, 36.
- Efficient and accurate estimation of lipschitz constants for deep neural networks. Advances in neural information processing systems, 32.
- Compact bilinear pooling. In Proceedings of the IEEE conference on computer vision and pattern recognition, 317–326.
- Revisiting bilinear pooling: A coding perspective. In Proceedings of the AAAI Conference on Artificial Intelligence, volume 34, 3954–3961.
- Hyperbolic contrastive learning for visual representations beyond objects. In Proceedings of the IEEE/CVF conference on computer vision and pattern recognition, 6840–6849.
- Inductive representation learning on large graphs. Advances in neural information processing systems, 30.
- Open Graph Benchmark: Datasets for Machine Learning on Graphs. arXiv preprint arXiv:2005.00687.
- Keriven, N. 2022. Not too little, not too much: a theoretical analysis of graph (over) smoothing. Advances in Neural Information Processing Systems, 35: 2268–2281.
- Hadamard product for low-rank bilinear pooling. arXiv preprint arXiv:1610.04325.
- Semi-supervised classification with graph convolutional networks. arXiv preprint arXiv:1609.02907.
- Low-rank bilinear pooling for fine-grained classification. In Proceedings of the IEEE conference on computer vision and pattern recognition, 365–374.
- Higher-order occurrence pooling for bags-of-words: Visual concept detection. IEEE transactions on pattern analysis and machine intelligence, 39(2): 313–326.
- Lipschitz bounds and provably robust training by laplacian smoothing. Advances in Neural Information Processing Systems, 33: 10924–10935.
- Learning multiple layers of features from tiny images.
- Bilinear convolutional neural networks for fine-grained visual recognition. IEEE transactions on pattern analysis and machine intelligence, 40(6): 1309–1322.
- DeepHGCN: Toward Deeper Hyperbolic Graph Convolutional Networks. arXiv preprint arXiv:2310.02027.
- Graph neural networks with adaptive residual. Advances in Neural Information Processing Systems, 34: 9720–9733.
- The numerical stability of hyperbolic representation learning. In International Conference on Machine Learning, 24925–24949. PMLR.
- A wrapped normal distribution on hyperbolic space for gradient-based learning. In International Conference on Machine Learning, 4693–4702. PMLR.
- Learning continuous hierarchies in the lorentz model of hyperbolic geometry. In International conference on machine learning, 3779–3788. PMLR.
- Hyperbolic deep neural networks: A survey. IEEE Transactions on pattern analysis and machine intelligence, 44(12): 10023–10044.
- Fast and scalable polynomial kernels via explicit feature maps. In Proceedings of the 19th ACM SIGKDD international conference on Knowledge discovery and data mining, 239–247.
- Collective classification in network data. AI magazine, 29(3): 93–93.
- Masked label prediction: Unified message passing model for semi-supervised classification. arXiv preprint arXiv:2009.03509.
- Adaptive neighborhood metric learning. IEEE Transactions on Pattern Analysis and Machine Intelligence, 44(9): 4591–4604.
- Song, M. 2022. A preliminary exploration of extractive multi-document summarization in hyperbolic space. In Proceedings of the 31st ACM International Conference on Information & Knowledge Management, 4505–4509.
- Graph attention networks. stat, 1050(20): 10–48550.
- Hyperml: A boosting metric learning approach in hyperbolic space for recommender systems. In Proceedings of the 13th international conference on web search and data mining, 609–617.
- Technical report.
- Second-order pooling for graph neural networks. IEEE Transactions on Pattern Analysis and Machine Intelligence, 45(6): 6870–6880.
- Hyperbolic representation learning: Revisiting and advancing. In International Conference on Machine Learning, 39639–39659. PMLR.
- Revisiting semi-supervised learning with graph embeddings. In International conference on machine learning, 40–48. PMLR.
- Fast and compact bilinear pooling by shifted random Maclaurin. In Proceedings of the AAAI Conference on Artificial Intelligence, volume 35, 3243–3251.
- Hyperbolic Knowledge Transfer with Class Hierarchy for Few-Shot Learning. In IJCAI, 3723–3729.
- Link prediction based on graph neural networks. Advances in neural information processing systems, 31.
- Hyperbolic graph attention network. IEEE Transactions on Big Data, 8(6): 1690–1701.
- Bilinear graph neural network with neighbor interactions. arXiv preprint arXiv:2002.03575.
- Graph geometry interaction learning. Advances in Neural Information Processing Systems, 33: 7548–7558.
Paper Prompts
Sign up for free to create and run prompts on this paper using GPT-5.
Top Community Prompts
Collections
Sign up for free to add this paper to one or more collections.