Is It Worth the (Environmental) Cost? Limited Evidence for Temporal Adaptation via Continuous Training (2210.07365v2)

Published 13 Oct 2022 in cs.CL

Abstract: Language is constantly changing and evolving, leaving LLMs to become quickly outdated. Consequently, we should continuously update our models with new data to expose them to new events and facts. However, that requires additional computing, which means new carbon emissions. Do any measurable benefits justify this cost? This paper looks for empirical evidence to support continuous training. We reproduce existing benchmarks and extend them to include additional time periods, models, and tasks. Our results show that the downstream task performance of temporally adapted English models for social media data do not improve over time. Pretrained models without temporal adaptation are actually significantly more effective and efficient. However, we also note a lack of suitable temporal benchmarks. Our findings invite a critical reflection on when and how to temporally adapt LLMs, accounting for sustainability.

Authors (4)

Giuseppe Attanasio (21 papers)
Debora Nozza (17 papers)
Federico Bianchi (47 papers)
Dirk Hovy (57 papers)

Citations (3)

View on Semantic Scholar

Summary

We haven't generated a summary for this paper yet.

Summarize Now

Tweets

https://twitter.com/MilaNLProc/status/1849450780169847295

Is It Worth the (Environmental) Cost? Limited Evidence for Temporal Adaptation via Continuous Training (2210.07365v2)

Summary

Related Papers

Tweets