Noise-powered Multi-modal Knowledge Graph Representation Framework (2403.06832v3)

Published 11 Mar 2024 in cs.CL and cs.AI

Abstract: The rise of Multi-modal Pre-training highlights the necessity for a unified Multi-Modal Knowledge Graph (MMKG) representation learning framework. Such a framework is essential for embedding structured knowledge into multi-modal LLMs effectively, alleviating issues like knowledge misconceptions and multi-modal hallucinations. In this work, we explore the efficacy of models in accurately embedding entities within MMKGs through two pivotal tasks: Multi-modal Knowledge Graph Completion (MKGC) and Multi-modal Entity Alignment (MMEA). Building on this foundation, we propose a novel SNAG method that utilizes a Transformer-based architecture equipped with modality-level noise masking to robustly integrate multi-modal entity features in KGs. By incorporating specific training objectives for both MKGC and MMEA, our approach achieves SOTA performance across a total of ten datasets, demonstrating its versatility. Moreover, SNAG can not only function as a standalone model but also enhance other existing methods, providing stable performance improvements. Code and data are available at https://github.com/zjukg/SNAG.

Authors (8)

Zhuo Chen (319 papers)
Yin Fang (32 papers)
Yichi Zhang (184 papers)
Lingbing Guo (27 papers)
Huajun Chen (198 papers)
Wen Zhang (170 papers)
Jiaoyan Che (1 paper)
Jeff Z. Pan (78 papers)

Citations (1)

View on Semantic Scholar

Summary

We haven't generated a summary for this paper yet.

Summarize Now

Noise-powered Multi-modal Knowledge Graph Representation Framework (2403.06832v3)

Summary

Related Papers