Efficient Industrial sLLMs through Domain Adaptive Continual Pretraining: Method, Evaluation and Applications (2507.06795v1)

Published 9 Jul 2025 in cs.CL, cs.AI, and cs.LG

Abstract: The emergence of open-source LLMs has expanded opportunities for enterprise applications; however, many organizations still lack the infrastructure to deploy and maintain large-scale models. As a result, small LLMs (sLLMs) have become a practical alternative, despite their inherent performance limitations. While Domain Adaptive Continual Pretraining (DACP) has been previously explored as a method for domain adaptation, its utility in commercial applications remains under-examined. In this study, we validate the effectiveness of applying a DACP-based recipe across diverse foundation models and service domains. Through extensive experiments and real-world evaluations, we demonstrate that DACP-applied sLLMs achieve substantial gains in target domain performance while preserving general capabilities, offering a cost-efficient and scalable solution for enterprise-level deployment.

Collections

Summary

We haven't generated a summary for this paper yet.

Summarize Now

Follow-up Questions

We haven't generated follow-up questions for this paper yet.

Generate Now

Efficient Industrial sLLMs through Domain Adaptive Continual Pretraining: Method, Evaluation and Applications (2507.06795v1)

Collections

Summary

Follow-up Questions

Related Papers

Authors (10)