MiA-Emb: Mindscape-Guided Retriever
- MiA-Emb is a retrieval module that employs hierarchical summarization to construct a global mindscape for enhanced context-aware query embeddings.
- It utilizes a residual fusion mechanism to integrate local query features with global semantic cues, guiding selective retrieval from large documents.
- Empirical evaluations show MiA-Emb outperforms baselines by up to 20 recall points on long-context benchmarks, demonstrating its effectiveness.
The Mindscape-Guided Retriever (MiA-Emb) is the retrieval module within the Mindscape-Aware Retrieval-Augmented Generation (MiA-RAG) system, engineered for enhanced long-context understanding. MiA-Emb introduces explicit global context conditioning through hierarchical summarization, constructing a document-level “mindscape” that guides the retrieval of semantically relevant content from lengthy input corpora. By fusing this mindscape with user queries through a residual integration mechanism, MiA-Emb generates enriched, context-aware query embeddings that bias the retrieval process toward coherence with the global semantic structure, in contrast to local or unconditioned embedding approaches (Li et al., 19 Dec 2025).
1. System Architecture and Role in MiA-RAG
MiA-Emb serves as the retriever component in the MiA-RAG pipeline, producing global-context–conditioned query embeddings. Its central objective is to leverage document-level semantic structure to (a) direct retrieval toward the topical scope of the input document (“Selective Retrieval”) and (b) enable enriched understanding by fusing explicit query intent with holistic semantic signals. The architecture is built atop Qwen3-Embedding-8B, further adapted with LoRA parameter-efficient adapters (rank=128, ), and incorporates special control tokens for mode signaling. A residual fusion mechanism ensures that global cues are integrated while preserving the discriminative local features of the original query.
2. Mindscape Construction via Hierarchical Summarization
To construct the global semantic scaffold (“mindscape”), MiA-Emb applies a two-step hierarchical summarization procedure, operationalized through a LLM summarizer (, e.g., GPT-4o):
- Chunk-level summarization: For each chunk from a document decomposition , generate chunk summary .
- Global summarization: Generate the mindscape .
The result is a global summary (approximately 500–1,000 tokens), capturing the essential semantics of the entire document. This mindscape serves as contextual input for both subsequent retrieval and generation.
3. Mindscape-Guided Query Embedding and Residual Fusion
MiA-Emb constructs the retriever input by concatenating the user query , the mindscape , and task-specific control tokens:
where 0 is the embedding instruction, 1 is an end-of-query marker, 2 and 3 are control tokens for node vs. chunk retrieval, respectively.
A transformer-based encoder 4 computes contextual representations 5. To derive the mindscape-aware embedding, a residual fusion is applied:
6
where 7 is the hidden state at 8 (capturing query intent), and 9 is the hidden state at the active task token (with full cross-attention to 0). Empirically, 1.
This fusion yields 2, maintaining compatibility with local query signals while integrating global context.
4. Retrieval Process and Scoring
Chunks of the document are indexed via static embeddings 3. For inference, MiA-Emb computes the cosine similarity between the mindscape-enriched query 4 and each chunk embedding:
5
Top-6 chunks are selected based on descending score. No additional reranking is applied, as the global conditioning of the query embedding raises relevant context in the candidate list.
5. Training Objectives and Optimization
MiA-Emb employs a multi-task contrastive (InfoNCE) loss across chunk and node retrieval:
7
For each retrieval task 8 and batch size 9, the loss is formulated as:
0
where 1 is a silver-labeled positive and 2 includes this positive and a mixture of hard and random negatives. Temperature 3 is fixed at 0.01.
6. Generator Integration and Inference Workflow
At inference, MiA-Emb yields the top-4 relevant chunks 5 conditioned on the mindscape. The MiA-Gen generator (Qwen2.5-14B) is supplied with the mindscape, the retrieved chunks, and the original query in concatenated form:
6
The generation process thus leverages a unified global semantic context, ensuring coherence between retrieval and evidence-based reasoning.
7. Empirical Performance and Evaluation
Evaluation uses retrieval recall@7 (for 8) on long-context benchmarks:
| Dataset | MiA-Emb Recall@3/5/10 | Qwen-Emb-8B Recall@3/5/10 | SitEmb-8B Recall@3/5/10 |
|---|---|---|---|
| NarrativeQA | 62.68/75.92/88.09 | 41.81/54.51/71.13 | 59.98/70.70/82.68 |
| DetectiveQA-ZH | 46.75/59.17/72.50 | 28.58/39.08/55.58 | 42.50/54.50/69.30 |
| DetectiveQA-EN | 42.08/54.17/69.75 | 24.17/34.17/49.25 | 36.75/49.25/63.83 |
MiA-Emb consistently outperforms strong baselines (including SitEmb-8B) by +3–5 recall points, and exceeds vanilla local/global-unconditioned retrievers by +15–20 points. This demonstrates the efficacy of explicit global semantic conditioning for selective long-context retrieval.
Summary: MiA-Emb operationalizes mindscape-aware retrieval through (1) hierarchical document summarization for holistic semantic context, (2) contextual query embedding via residual fusion, and (3) multi-task contrastive alignment to silver-annotated evidence. These design features enable more effective and human-like navigation of long document semantics compared to prior RAG retrievers (Li et al., 19 Dec 2025).