Same Pre-training Loss, Better Downstream: Implicit Bias Matters for Language Models (2210.14199v1)

Published 25 Oct 2022 in cs.LG

Abstract: LLMing on large-scale datasets leads to impressive performance gains on various downstream language tasks. The validation pre-training loss (or perplexity in autoregressive LLMing) is often used as the evaluation metric when developing LLMs since the pre-training loss tends to be well-correlated with downstream performance (which is itself difficult to evaluate comprehensively). Contrary to this conventional wisdom, this paper shows that 1) pre-training loss cannot fully explain downstream performance and 2) flatness of the model is well-correlated with downstream performance where pre-training loss is not. On simplified datasets, we identify three ways to produce models with the same (statistically optimal) pre-training loss but different downstream performance: continue pre-training after convergence, increasing the model size, and changing the training algorithm. These experiments demonstrate the existence of implicit bias of pre-training algorithms/optimizers -- among models with the same minimal pre-training loss, they implicitly prefer more transferable ones. Toward understanding this implicit bias, we prove that SGD with standard mini-batch noise implicitly prefers flatter minima in LLMs, and empirically observe a strong correlation between flatness and downstream performance among models with the same minimal pre-training loss. We also prove in a synthetic language setting that among the models with the minimal pre-training loss, the flattest model transfers to downstream tasks.

PDF Abstract

Summarize Bookmark Chat (Pro)

Authors (4)

Hong Liu (395 papers)
Sang Michael Xie (21 papers)
Zhiyuan Li (304 papers)
Tengyu Ma (117 papers)

Citations (37)

View on Semantic Scholar

Tweets

https://twitter.com/siddharthvader_/status/1868847228137377934

HackerNews

Implicit Bias Matters for Language Models (2 points, 0 comments)

Same Pre-training Loss, Better Downstream: Implicit Bias Matters for Language Models (2210.14199v1)

Related Papers

Tweets

HackerNews