Learning to Plan for Language Modeling from Unlabeled Data (2404.00614v2)

Published 31 Mar 2024 in cs.CL and cs.AI

Abstract: By training to predict the next token in an unlabeled corpus, LLMs learn to perform many tasks without any labeled data. However, their next-token-prediction objective arguably limits their performance in scenarios that require planning, such as writing a coherent article. In this paper, we train a module for planning the future writing process via a self-supervised learning objective. Given the textual context, this planning module learns to predict future abstract writing actions, which correspond to centroids in a clustered text embedding space. By conditioning on these actions, our model extends the successful LLM formula to more abstract planning in an unsupervised way. Empirically, we demonstrate that our method improves LLMing performance in general, particularly with respect to the text structure. Because our framework uses a planner module that is unsupervised and external to the LLM, new planner modules can be trained at large scale and easily be shared with the community.

References (72)

Citations (5)

View on Semantic Scholar

Summary

We haven't generated a summary for this paper yet.

Summarize Now

Tweets

https://twitter.com/Natithan5005/status/1823324974087618580

https://twitter.com/_florianmai/status/1871909365525164203

Learning to Plan for Language Modeling from Unlabeled Data (2404.00614v2)

Summary

Related Papers

Tweets