Benefit of the reconstruction objective in a full-JEPA variant of LLM2Vec-Gen
Determine whether incorporating the reconstruction objective (next-token prediction conditioned on the learned compression tokens) remains beneficial in a full Joint Embedding Predictive Architecture (JEPA) variant of LLM2Vec-Gen, where the teacher and student are the same frozen large language model and training uses only an alignment objective to match the model’s mean-pooled embedding of its generated response; ascertain whether any benefit is primarily for interpretability rather than for embedding quality.
References
Whether the reconstruction objective remains beneficial in this setting, for interpretability rather than embedding quality, is an open empirical question.
— LLM2Vec-Gen: Generative Embeddings from Large Language Models
(2603.10913 - BehnamGhader et al., 11 Mar 2026) in Section: Open frontiers, Full JEPA mode