GraphText: Bridging Graphs with Language Models

Updated 18 September 2025

GraphText is a framework that translates graph-structured data into hierarchical natural language prompts, enabling graph reasoning tasks like node classification and link prediction.
It employs a deterministic transformation to convert graphs into text, preserving key node features and relational structures in a graph-syntax tree format.
GraphText facilitates interactive and explainable analytics by leveraging LLMs for in-context learning, allowing transparent reasoning and human intervention.

GraphText refers to a family of frameworks and methodologies that bridge graph-structured data with natural language representations, enabling graph reasoning and learning via LLMs. Most notably, the "GraphText" framework introduced in (Zhao et al., 2023) systematically “translates” arbitrary graphs into structured natural language prompts, thereby allowing LLMs to execute graph analytic and prediction tasks—such as node classification, link prediction, and even interactive graph reasoning—entirely in text space through prompt-based or in-context learning.

1. Transformation of Graphs to Natural Language

The critical innovation in GraphText is the design of a deterministic transformation $g$ that encodes a graph $G = (\mathcal{V}, \mathcal{E}, \mathbf{X})$ (nodes, edges, and optional node features) into a structured natural language prompt $T_{\text{in}}$ . This conversion is not a simple flattening or serialization; instead, the process organizes graph information into a hierarchical, human-interpretable sequence:

Graph-syntax tree: This data structure, inspired by linguistic syntax trees, formally encodes both node/edge features and higher-order relations.
- Leaves: Each leaf corresponds to a discretized or natural-language-rendered node/edge attribute (e.g., quantized feature values assigned natural-language descriptors).
- Internal nodes: Each encapsulates relational substructures, such as the center-node, immediate (1st-hop) and secondary neighbors (2nd-hop), and, optionally, complex topological orderings (e.g., by personalized PageRank rank).
- Hierarchical/ordered composition: By traversing this tree in a topological sort, an ordered text sequence is produced describing the graph or local subgraph of interest.

Effectively, this allows any arbitrary graph to be rendered as an ordered, information-rich text sequence $T_{\text{in}} = g(G)$ . During inference, $T_{\text{in}}$ is supplied to an LLM, e.g., ChatGPT, which outputs a text result subsequently decoded by a simple mapping $h$ to the desired prediction (such as a node label).

2. Training-Free and In-Context Learning for Graph Reasoning

Unlike traditional graph machine learning, which relies on parameterized models (GCNs, GATs, etc.) specifically trained for each graph or task, the GraphText methodology enables training-free graph reasoning using LLMs. The key mechanisms:

In-Context Learning (ICL): By providing a few annotated demonstration samples (i.e., pairs of input text $T_{\text{in}}$ and labels) within the prompt, the LLM generalizes from these few-shot examples to predict unseen nodes. No parameter update is required for the LLM—reasoning occurs entirely at inference time, leveraging the LLM’s pretraining.
Incorporation of Graph Inductive Biases: The text prompt is designed to capture graph-theoretic properties such as local neighborhood aggregation (akin to $A^k X$ in GNN architectures), node rankings (PPR), or even synthetic, domain-specific biases. This allows encoding of model inductive biases directly into the prompt served to the LLM.

Empirically, such LLM-based reasoning can achieve parity or even outperform supervised GNN models on node classification benchmarks, without requiring retraining or fine-tuning for graph-specific data (Zhao et al., 2023).

3. The Graph-Syntax Tree: Structure and Generality

The graph-syntax tree is the formal device enabling the expressive translation of structured, non-Euclidean, non-sequential data into a text domain suitable for LLMs. Features include:

Multi-level composition: Nodes at different depths correspond to various relational levels—from the ego node (“center-node”), through 1st-hop and multi-hop neighborhoods, to global ranking-based summaries. Each attribute (feature, label, role) is mapped onto unique textual descriptors.
Order and information preservation: Unlike naïve serializations, the tree ensures that vital topological configurations (positions in the neighborhood, relative ranks) are not lost during translation. This enables LLMs—otherwise deprived of structure—to effectively “see” the graph.
Task generality: The design naturally incorporates various graph tasks. For node classification, tree traversal encodes the ego-graph for the target node; for link prediction, it optionally aggregates features from both endpoints and their collective neighbors; for graph classification, it can describe subgraphs or the full graph in structured text.

A simplified construct is as follows:

Root (empty or task-prompt)
- “feature”: F_Xi
- “label”: F_Yj
- "center-node": attributes of the target node
- “1st-hop”: attributes and labels of direct neighbors
- “2nd-hop”: more distant context
- “PPR”: node orderings induced by personalized PageRank

4. Interactive and Explainable Graph Reasoning

GraphText’s operation in natural language space offers two unique properties:

Interactive reasoning: Users can communicate with GraphText-augmented LLMs in natural language, augmenting, overriding, or querying intermediate reasoning steps. The entire reasoning cascade (from aggregation of neighbor information to decision rationales) is visible and modifiable. In experiments, human interventions (such as asking the LLM to increase reliance on PPR neighbors vs.\ the central node) can measurably improve prediction accuracy for difficult cases.
Explainability: Since reasoning occurs as text, the LLM’s “thought process” is transparent by construction. For example, the LLM might write: “Most of the 1st-hop neighbors of this node are in class A, thus I predict class A.” This chain-of-thought naturally supports error diagnostic and rationalization that would be opaque in GNN-based classifiers.

5. Advantages, Limitations, and Research Outlook

The “GraphText” approach yields several key advantages:

Model generality and flexibility: A single pre-trained LLM, with the right prompt construction, can process arbitrary graphs across domains without specialized architecture-per-graph.
Elimination of task-specific retraining: No training or fine-tuning is necessarily required for each new graph, reducing computational and data requirements, enabling efficient application to novel data.
Configurable inductive biases: By altering the graph-to-text mapping (attributes $F$ , relations $R$ ), one can imbue the reasoning process with domain-specific knowledge or mimic advanced GNN strategies.
Interactivity and transparency: End-to-end reasoning is available in natural language, facilitating human-in-the-loop analytics and iterative improvement.

However, several challenges persist:

Expressivity and scaling: The size of generated text for large graphs may constrain the effective use of LLMs with limited context window, stressing prompt engineering for maximal information density.
Continuous and high-dimensional attributes: Discretization or paraphrasing continuous node or edge features into salient natural language without excessive information loss is an open question.
Optimal text design: Systematic methodologies for mapping complex graph substructures or higher-order motifs to text remain under-explored. The effects of various attribute orderings and prompt templates on LLM reasoning fidelity are not fully understood.

Future research directions include:

Extending GraphText to multi-modal and multi-step reasoning tasks, e.g., integrating with chain-of-thought paradigms or with hybrid graph-LLM systems.
Exploring principled approaches to prompt compression, such as programmatic summarization of graph regions, to scale to massive graphs.
Bridging to multi-modal LLMs for graph data—incorporating visualizations, tabular representations, or raw attribute matrices into composite prompts.

6. Summary Table: Core Design and Evaluation Elements

Component	Description/Function	Example (from (Zhao et al., 2023))
Graph-syntax tree	Hierarchical text encoding of node/edge features and relations	Center-node, 1st-hop, 2nd-hop, PPR ranks
LLM inference	Prediction and rationalization performed in text space	LLM outputs “Label X; reason: Neighbor majority.”
In-context learning	Few-shot reasoning via example prompts	Prompt includes labeled examples
Inductive bias	Encoded via prompt template ( $F$ , $R$ choices)	PPR-ordering, feature aggregation
Interactive revision	User-provided language modifies inference	“Please focus on PPR order for verdict.”

References

"GraphText: Graph Reasoning in Text Space" (Zhao et al., 2023)

This methodology opens new frontiers in general-purpose, explainable, and interactive graph analytics by leveraging the representational bandwidth and reasoning abilities of advanced LLMs, translating the historically discrete world of graphs into the universal medium of language.

PDF Markdown Chat (Pro)

References (1)

GraphText: Graph Reasoning in Text Space (2023)

Whiteboard

Generate a whiteboard explanation of this topic.

Follow Topic

Get notified by email when new papers are published related to GraphText.