Robust Training of Temporal GNNs using Nearest Neighbours based Hard Negatives (2402.09239v1)
Abstract: Temporal graph neural networks Tgnn have exhibited state-of-art performance in future-link prediction tasks. Training of these TGNNs is enumerated by uniform random sampling based unsupervised loss. During training, in the context of a positive example, the loss is computed over uninformative negatives, which introduces redundancy and sub-optimal performance. In this paper, we propose modified unsupervised learning of Tgnn, by replacing the uniform negative sampling with importance-based negative sampling. We theoretically motivate and define the dynamically computed distribution for a sampling of negative examples. Finally, using empirical evaluations over three real-world datasets, we show that Tgnn trained using loss based on proposed negative sampling provides consistent superior performance.
- Privacy-Aware Recommender Systems Challenge on Twitter’s Home Timeline. https://doi.org/10.48550/ARXIV.2004.13715
- On the properties of neural machine translation: Encoder-decoder approaches. arXiv preprint arXiv:1409.1259 (2014).
- CuCo: Graph Representation with Curriculum Contrastive Learning. In International Joint Conference on Artificial Intelligence.
- Debiased contrastive learning. Advances in Neural Information Processing Systems 33 (2020).
- Provably expressive temporal graph networks. ArXiv abs/2209.15059 (2022).
- BERT: Pre-training of Deep Bidirectional Transformers for Language Understanding. CoRR abs/1810.04805 (2018). arXiv:1810.04805 http://arxiv.org/abs/1810.04805
- DynGEM: Deep Embedding Method for Dynamic Graphs. ArXiv abs/1805.11273 (2018).
- Alex Graves. 2012. Long short-term memory. Supervised sequence labelling with recurrent neural networks (2012), 37–45.
- Shubham Gupta and Srikanta Bedathur. 2022. A Survey on Temporal Graph Representation Learning and Generative Modeling. arXiv:2208.12126 [cs.LG]
- TIGGER: Scalable Generative Modelling for Temporal Interaction Graphs. https://doi.org/10.48550/ARXIV.2203.03564
- GRAFENNE: Learning on Graphs with Heterogeneous and Dynamic Feature Sets. In Proceedings of the 40th International Conference on Machine Learning (Proceedings of Machine Learning Research, Vol. 202), Andreas Krause, Emma Brunskill, Kyunghyun Cho, Barbara Engelhardt, Sivan Sabato, and Jonathan Scarlett (Eds.). PMLR, 12165–12181. https://proceedings.mlr.press/v202/gupta23b.html
- Inductive Representation Learning on Large Graphs. In Proceedings of the 31st International Conference on Neural Information Processing Systems (Long Beach, California, USA) (NIPS’17). Curran Associates Inc., Red Hook, NY, USA, 1025–1035.
- Chih-Hui Ho and Nuno Nvasconcelos. 2020. Contrastive learning with adversarial examples. Advances in Neural Information Processing Systems 33 (2020), 17081–17093.
- Efficiently Teaching an Effective Dense Retriever with Balanced Topic Aware Sampling. https://doi.org/10.48550/ARXIV.2104.06967
- Yifei Hu and Ya Zhang. 2021. Graph Contrastive Learning with Local and Global Mutual Information Maximization. In Proceedings of the 2020 8th International Conference on Information Technology: IoT and Smart City (Xi’an, China) (ICIT ’20). Association for Computing Machinery, New York, NY, USA, 74–78. https://doi.org/10.1145/3446999.3447013
- A survey on contrastive self-supervised learning. Technologies 9, 1 (2020), 2.
- Glen Jeh and Jennifer Widom. 2003. Scaling Personalized Web Search. In Proceedings of the 12th International Conference on World Wide Web (Budapest, Hungary) (WWW ’03). Association for Computing Machinery, New York, NY, USA, 271–279. https://doi.org/10.1145/775152.775191
- Hard Negative Mixing for Contrastive Learning. ArXiv abs/2010.01028 (2020).
- Dense Passage Retrieval for Open-Domain Question Answering. In Proceedings of the 2020 Conference on Empirical Methods in Natural Language Processing (EMNLP). Association for Computational Linguistics, Online, 6769–6781. https://doi.org/10.18653/v1/2020.emnlp-main.550
- Angelos Katharopoulos and François Fleuret. 2018. Not All Samples Are Created Equal: Deep Learning with Importance Sampling. In ICML.
- Time2Vec: Learning a Vector Representation of Time. https://doi.org/10.48550/ARXIV.1907.05321
- Community interaction and conflict on the web. In Proceedings of the 2018 World Wide Web Conference on World Wide Web. International World Wide Web Conferences Steering Committee, 933–943.
- Edge weight prediction in weighted signed networks. In Data Mining (ICDM), 2016 IEEE 16th International Conference on. IEEE, 221–230.
- Predicting dynamic embedding trajectory in temporal interaction networks. In Proceedings of the 25th ACM SIGKDD International Conference on Knowledge Discovery & Data Mining. ACM, 1269–1278.
- In-Batch Negatives for Knowledge Distillation with Tightly-Coupled Teachers for Dense Retrieval. In Proceedings of the 6th Workshop on Representation Learning for NLP (RepL4NLP-2021). Association for Computational Linguistics, Online, 163–173. https://doi.org/10.18653/v1/2021.repl4nlp-1.17
- Towards Fine-Grained Temporal Network Representation via Time-Reinforced Random Walk. In AAAI.
- Temporal Network Embedding with Micro- and Macro-Dynamics. In Proceedings of the 28th ACM International Conference on Information and Knowledge Management (Beijing, China) (CIKM ’19). Association for Computing Machinery, New York, NY, USA, 469–478. https://doi.org/10.1145/3357384.3357943
- Image-Based Recommendations on Styles and Substitutes. In Proceedings of the 38th International ACM SIGIR Conference on Research and Development in Information Retrieval (Santiago, Chile) (SIGIR ’15). Association for Computing Machinery, New York, NY, USA, 43–52. https://doi.org/10.1145/2766462.2767755
- EvolveGCN: Evolving Graph Convolutional Networks for Dynamic Graphs. In Proceedings of the Thirty-Fourth AAAI Conference on Artificial Intelligence.
- Graph Representation Learning via Graphical Mutual Information Maximization. In Proceedings of The Web Conference 2020 (Taipei, Taiwan) (WWW ’20). Association for Computing Machinery, New York, NY, USA, 259–270. https://doi.org/10.1145/3366423.3380112
- RocketQA: An Optimized Training Approach to Dense Passage Retrieval for Open-Domain Question Answering. In Proceedings of the 2021 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies. Association for Computational Linguistics, Online, 5835–5847. https://doi.org/10.18653/v1/2021.naacl-main.466
- RocketQAv2: A Joint Training Method for Dense Passage Retrieval and Passage Re-ranking. In Proceedings of the 2021 Conference on Empirical Methods in Natural Language Processing. Association for Computational Linguistics, Online and Punta Cana, Dominican Republic, 2825–2835. https://doi.org/10.18653/v1/2021.emnlp-main.224
- Contrastive learning with hard negative samples. arXiv preprint arXiv:2010.04592 (2020).
- Temporal Graph Networks for Deep Learning on Dynamic Graphs. In ICML 2020 Workshop on Graph Representation Learning.
- Aravind Sankar and Hao Yang. 2020. DySAT: Deep Neural Representation Learning on Dynamic Graphs via Self-Attention Networks. In Proceedings of the 13th International Conference on Web Search and Data Mining (Houston, TX, USA) (WSDM ’20). Association for Computing Machinery, New York, NY, USA, 519–527. https://doi.org/10.1145/3336191.3371845
- Node Embedding over Temporal Graphs. In IJCAI.
- DyRep: Learning Representations over Dynamic Graphs. In 7th International Conference on Learning Representations, ICLR 2019, New Orleans, LA, USA, May 6-9, 2019. OpenReview.net. https://openreview.net/forum?id=HyePrhR5KX
- Laurens van der Maaten and Geoffrey E. Hinton. 2008. Visualizing Data using t-SNE. In Journal of Machine Learning Research, Vol. 9. 2579–2605.
- Graph Attention Networks. International Conference on Learning Representations (2018). https://openreview.net/forum?id=rJXMpikCZ
- Deep Graph Infomax. In International Conference on Learning Representations. https://openreview.net/forum?id=rklz9iAcKQ
- Inductive Representation Learning in Temporal Networks via Causal Anonymous Walks. CoRR abs/2101.05974 (2021). arXiv:2101.05974 https://arxiv.org/abs/2101.05974
- ProGCL: Rethinking Hard Negative Mining in Graph Contrastive Learning. In International conference on machine learning. PMLR.
- Approximate Nearest Neighbor Negative Contrastive Learning for Dense Text Retrieval. In 9th International Conference on Learning Representations, ICLR 2021, Virtual Event, Austria, May 3-7, 2021. OpenReview.net. https://openreview.net/forum?id=zeFrfgyZln
- Inductive representation learning on temporal graphs. arXiv preprint arXiv:2002.07962 (2020).
- Self-supervised Graph-level Representation Learning with Local and Global Structure. In International Conference on Machine Learning.
- Understanding Negative Sampling in Graph Representation Learning. In Proceedings of the 26th ACM SIGKDD International Conference on Knowledge Discovery & Data Mining (Virtual Event, CA, USA) (KDD ’20). Association for Computing Machinery, New York, NY, USA, 1666–1676. https://doi.org/10.1145/3394486.3403218
- Graph Convolutional Neural Networks for Web-Scale Recommender Systems. In Proceedings of the 24th ACM SIGKDD International Conference on Knowledge Discovery & Data Mining (London, United Kingdom) (KDD ’18). Association for Computing Machinery, New York, NY, USA, 974–983. https://doi.org/10.1145/3219819.3219890
- Optimizing Dense Retrieval Model Training with Hard Negatives. https://doi.org/10.48550/ARXIV.2104.08051
- Learning Node Embeddings in Interaction Graphs. In Proceedings of the 2017 ACM on Conference on Information and Knowledge Management (Singapore, Singapore) (CIKM ’17). Association for Computing Machinery, New York, NY, USA, 397–406. https://doi.org/10.1145/3132847.3132918
- Zhen Zhang and Can Wang. 2020. Learning Temporal Interaction Graph Embedding via Coupled Memory Networks. In Proceedings of The Web Conference 2020 (Taipei, Taiwan) (WWW ’20). Association for Computing Machinery, New York, NY, USA, 3049–3055. https://doi.org/10.1145/3366423.3380076
- Graph Debiased Contrastive Learning with Joint Representation Clustering. In International Joint Conference on Artificial Intelligence.
- Structure-Aware Hard Negative Mining for Heterogeneous Graph Contrastive Learning. ArXiv abs/2108.13886 (2021).
- Deep Graph Contrastive Representation Learning. In ICML Workshop on Graph Representation Learning and Beyond. http://arxiv.org/abs/2006.04131
- Link and Triadic Closure Delay: Temporal Metrics for Social Network Dynamics. Proceedings of the International AAAI Conference on Web and Social Media 8, 1 (May 2014), 564–573. https://ojs.aaai.org/index.php/ICWSM/article/view/14507
- Yuan Zuo and Junjie Wu. 2018. Embedding Temporal Network via Neighborhood Formation. In Proceedings of the 24th ACM SIGKDD International Conference on Knowledge Discovery; Data Mining (London, United Kingdom) (KDD ’18). Association for Computing Machinery, New York, NY, USA, 2857–2866. https://doi.org/10.1145/3219819.3220054