Federated Neural Graph Databases (2402.14609v3)
Abstract: The increasing demand for large-scale LLMs has highlighted the importance of efficient data retrieval mechanisms. Neural graph databases (NGDBs) have emerged as a promising approach to storing and querying graph-structured data in neural space, enabling the retrieval of relevant information for LLMs. However, existing NGDBs are typically designed to operate on a single graph, limiting their ability to reason across multiple graphs. Furthermore, the lack of support for multi-source graph data in existing NGDBs hinders their ability to capture the complexity and diversity of real-world data. In many applications, data is distributed across multiple sources, and the ability to reason across these sources is crucial for making informed decisions. This limitation is particularly problematic when dealing with sensitive graph data, as directly sharing and aggregating such data poses significant privacy risks. As a result, many applications that rely on NGDBs are forced to choose between compromising data privacy or sacrificing the ability to reason across multiple graphs. To address these limitations, we propose Federated Neural Graph Database (FedNGDB), a novel framework that enables reasoning over multi-source graph-based data while preserving privacy. FedNGDB leverages federated learning to collaboratively learn graph representations across multiple sources, enriching relationships between entities and improving the overall quality of the graph data. Unlike existing methods, FedNGDB can handle complex graph structures and relationships, making it suitable for various downstream tasks.
- Query Embedding on Hyper-Relational Knowledge Graphs. In International Conference on Learning Representations.
- Complex Query Answering with Neural Link Predictors. In International Conference on Learning Representations.
- Complex Query Answering on Eventuality Knowledge Graph with Implicit Logical Constraints. arXiv preprint arXiv:2305.19068 (2023).
- Knowledge graph reasoning over entities and numerical values. arXiv preprint arXiv:2306.01399 (2023).
- Query2Particles: Knowledge Graph Reasoning with Particle Embeddings. Findings of the Association for Computational Linguistics: NAACL 2022-Findings (2022).
- Sequential query encoding for complex query answering on knowledge graphs. arXiv preprint arXiv:2302.13114 (2023).
- Freebase: a collaboratively created graph database for structuring human knowledge. In Proceedings of the 2008 ACM SIGMOD international conference on Management of data. 1247–1250.
- Translating embeddings for modeling multi-relational data. Advances in neural information processing systems 26 (2013).
- Unifying knowledge graph learning and recommendation: Towards a better understanding of user preferences. In The world wide web conference. 151–161.
- Toward an architecture for never-ending language learning. In Proceedings of the AAAI conference on artificial intelligence, Vol. 24. 1306–1313.
- Meta-Learning Based Knowledge Extrapolation for Knowledge Graphs in the Federated Setting. In Proceedings of the Thirty-First International Joint Conference on Artificial Intelligence, IJCAI-22, Lud De Raedt (Ed.). International Joint Conferences on Artificial Intelligence Organization, 1966–1972. https://doi.org/10.24963/ijcai.2022/273 Main Track.
- Fede: Embedding knowledge graphs in federated setting. In Proceedings of the 10th International Joint Conference on Knowledge Graphs. 80–88.
- Federated knowledge graph completion via embedding-contrastive learning. Knowledge-Based Systems 252 (2022), 109459. https://doi.org/10.1016/j.knosys.2022.109459
- Meta-Knowledge Transfer for Inductive Knowledge Graph Embedding. In Proceedings of the 45th International ACM SIGIR Conference on Research and Development in Information Retrieval (SIGIR ’22). Association for Computing Machinery, New York, NY, USA, 927–937. https://doi.org/10.1145/3477495.3531757
- Probabilistic entity representation model for reasoning over knowledge graphs. Advances in Neural Information Processing Systems 34 (2021), 23440–23451.
- Whitfield Diffie and Martin E Hellman. 2022. New directions in cryptography. In Democratizing Cryptography: The Work of Whitfield Diffie and Martin Hellman. 365–390.
- Quantifying privacy leakage in graph embedding. In MobiQuitous 2020-17th EAI International Conference on Mobile and Ubiquitous Systems: Computing, Networking and Services. 76–85.
- Cynthia Dwork. 2008. Differential privacy: A survey of results. In International conference on theory and applications of models of computation. Springer, 1–19.
- Differentially private federated learning: A client level perspective. arXiv preprint arXiv:1712.07557 (2017).
- Embedding logical queries on knowledge graphs. Advances in neural information processing systems 31 (2018).
- A Federated Multi-Server Knowledge Graph Embedding Framework For Link Prediction. In 2022 IEEE 34th International Conference on Tools with Artificial Intelligence (ICTAI). IEEE, 366–371.
- Learning privacy-preserving graph convolutional network with partially observed sensitive attributes. In Proceedings of the ACM Web Conference 2022. 3552–3561.
- Qi Hu and Yangqiu Song. 2023. User Consented Federated Recommender System Against Personalized Attribute Inference Attack. arXiv preprint arXiv:2312.16203 (2023).
- Quantifying and Defending against Privacy Threats on Federated Knowledge Graph Embedding. In Proceedings of the ACM Web Conference 2023 (Austin, TX, USA) (WWW ’23). Association for Computing Machinery, New York, NY, USA, 2306–2317. https://doi.org/10.1145/3543507.3583450
- FedCKE: Cross-Domain Knowledge Graph Embedding in Federated Learning. IEEE Transactions on Big Data (2022).
- Answering complex queries in knowledge graphs with bidirectional sequence encoders. In Proceedings of the AAAI Conference on Artificial Intelligence, Vol. 35. 4968–4977.
- Julien Leblay and Melisachew Wudage Chekol. 2018. Deriving validity time in knowledge graph. In Companion Proceedings of the The Web Conference 2018. 1771–1776.
- Neural-answering logical queries on knowledge graphs. In Proceedings of the 27th ACM SIGKDD conference on knowledge discovery & data mining. 1087–1097.
- Mask and reason: Pre-training knowledge graph transformers for complex logical queries. In Proceedings of the 28th ACM SIGKDD Conference on Knowledge Discovery and Data Mining. 1120–1130.
- Federated learning for open banking. In Federated Learning: Privacy and Incentive. Springer, 240–254.
- Ilya Loshchilov and Frank Hutter. 2018. Decoupled Weight Decay Regularization. In International Conference on Learning Representations.
- Communication-efficient learning of deep networks from decentralized data. In Artificial Intelligence and Statistics. PMLR, 1273–1282.
- Payman Mohassel and Yupeng Zhang. 2017. Secureml: A system for scalable privacy-preserving machine learning. In 2017 IEEE symposium on security and privacy (SP). IEEE, 19–38.
- Pascal Paillier. 1999. Public-key cryptosystems based on composite degree residuosity classes. In International conference on the theory and applications of cryptographic techniques. Springer, 223–238.
- Differentially private federated knowledge graphs embedding. In Proceedings of the 30th ACM International Conference on Information & Knowledge Management. 1416–1425.
- Differentially Private Federated Knowledge Graphs Embedding. In CIKM 2021. https://arxiv.org/abs/2105.07615
- Neural graph reasoning: Complex logical query answering meets graph databases. arXiv preprint arXiv:2303.14617 (2023).
- Query2box: Reasoning over knowledge graphs in vector space using box embeddings. arXiv preprint arXiv:2002.05969 (2020).
- Hongyu Ren and Jure Leskovec. 2020. Beta embeddings for multi-hop logical reasoning in knowledge graphs. Advances in Neural Information Processing Systems 33 (2020), 19716–19726.
- FedMKGC: Privacy-Preserving Federated Multilingual Knowledge Graph Completion. arXiv preprint arXiv:2312.10645 (2023).
- Kristina Toutanova and Danqi Chen. 2015. Observed versus latent features for knowledge base and text inference. In Proceedings of the 3rd workshop on continuous vector space models and their compositionality. 57–66.
- Efficient embeddings of logical variables for query answering over incomplete knowledge graphs. In Proceedings of the AAAI Conference on Artificial Intelligence, Vol. 37. 4652–4659.
- Federated Knowledge Graph Completion via Latent Embedding Sharing and Tensor Factorization. arXiv preprint arXiv:2311.10341 (2023).
- Knowledge base completion using embeddings and rules. In Twenty-fourth international joint conference on artificial intelligence.
- Kgat: Knowledge graph attention network for recommendation. In Proceedings of the 25th ACM SIGKDD international conference on knowledge discovery & data mining. 950–958.
- Logical Message Passing Networks with One-hop Inference on Atomic Formulas. In The Eleventh International Conference on Learning Representations.
- Reasoning over Multi-view Knowledge Graphs. arXiv preprint arXiv:2209.13702 (2022).
- Explicit semantic ranking for academic search via knowledge graph embedding. In Proceedings of the 26th international conference on world wide web. 1271–1279.
- GammaE: Gamma Embeddings for Logical Queries on Knowledge Graphs. In Proceedings of the 2022 Conference on Empirical Methods in Natural Language Processing. 745–760.
- Federated recommendation systems. Federated Learning: Privacy and Incentive (2020), 225–239.
- Federated machine learning: Concept and applications. ACM Transactions on Intelligent Systems and Technology (TIST) 10, 2 (2019), 1–19.
- Efficient federated learning on knowledge graphs via privacy-preserving relation embedding aggregation. arXiv preprint arXiv:2203.09553 (2022).
- Cone: Cone embeddings for multi-hop reasoning over knowledge graphs. Advances in Neural Information Processing Systems 34 (2021), 19172–19183.
- Neural-symbolic models for logical queries on knowledge graphs. In International Conference on Machine Learning. PMLR, 27454–27478.
- Qi Hu (33 papers)
- Weifeng Jiang (12 papers)
- Haoran Li (166 papers)
- Zihao Wang (216 papers)
- Jiaxin Bai (30 papers)
- Qianren Mao (13 papers)
- Yangqiu Song (196 papers)
- Lixin Fan (77 papers)
- Jianxin Li (128 papers)