Papers
Topics
Authors
Recent
Gemini 2.5 Flash
Gemini 2.5 Flash
41 tokens/sec
GPT-4o
59 tokens/sec
Gemini 2.5 Pro Pro
41 tokens/sec
o3 Pro
7 tokens/sec
GPT-4.1 Pro
50 tokens/sec
DeepSeek R1 via Azure Pro
28 tokens/sec
2000 character limit reached

Augmenting Textual Generation via Topology Aware Retrieval (2405.17602v1)

Published 27 May 2024 in cs.IR

Abstract: Despite the impressive advancements of LLMs in generating text, they are often limited by the knowledge contained in the input and prone to producing inaccurate or hallucinated content. To tackle these issues, Retrieval-augmented Generation (RAG) is employed as an effective strategy to enhance the available knowledge base and anchor the responses in reality by pulling additional texts from external databases. In real-world applications, texts are often linked through entities within a graph, such as citations in academic papers or comments in social networks. This paper exploits these topological relationships to guide the retrieval process in RAG. Specifically, we explore two kinds of topological connections: proximity-based, focusing on closely connected nodes, and role-based, which looks at nodes sharing similar subgraph structures. Our empirical research confirms their relevance to text relationships, leading us to develop a Topology-aware Retrieval-augmented Generation framework. This framework includes a retrieval module that selects texts based on their topological relationships and an aggregation module that integrates these texts into prompts to stimulate LLMs for text generation. We have curated established text-attributed networks and conducted comprehensive experiments to validate the effectiveness of this framework, demonstrating its potential to enhance RAG with topological awareness.

User Edit Pencil Streamline Icon: https://streamlinehq.com
Authors (9)
  1. Yu Wang (939 papers)
  2. Nedim Lipka (49 papers)
  3. Ruiyi Zhang (98 papers)
  4. Alexa Siu (13 papers)
  5. Yuying Zhao (25 papers)
  6. Bo Ni (12 papers)
  7. Xin Wang (1306 papers)
  8. Ryan Rossi (67 papers)
  9. Tyler Derr (48 papers)