Can metadata enhance post-training?
Determine whether incorporating document-level metadata during post-training procedures for large language models (e.g., instruction tuning or related post-training stages) enhances performance or training efficiency compared to post-training without metadata.
Sponsor
References
An open question remains whether metadata can also enhance post-training.
— Beyond URLs: Metadata Diversity and Position for Efficient LLM Pretraining
(2511.21613 - Fan et al., 26 Nov 2025) in Conclusion