Enhancing Financial Domain Adaptation of Language Models via Model Augmentation (2411.09249v1)

Published 14 Nov 2024 in cs.CL and cs.AI

Abstract: The domain adaptation of LLMs, including LLMs, has become increasingly important as the use of such models continues to expand. This study demonstrates the effectiveness of Composition to Augment LLMs (CALM) in adapting to the financial domain. CALM is a model to extend the capabilities of existing models by introducing cross-attention between two LLMs with different functions. In our experiments, we developed a CALM to enhance the financial performance of an LLM with strong response capabilities by leveraging a financial-specialized LLM. Notably, the CALM was trained using a financial dataset different from the one used to train the financial-specialized LLM, confirming CALM's ability to adapt to various datasets. The models were evaluated through quantitative Japanese financial benchmarks and qualitative response comparisons, demonstrating that CALM enables superior responses with higher scores than the original models and baselines. Additionally, comparative experiments on connection points revealed that connecting the middle layers of the models is most effective in facilitating adaptation to the financial domain. These findings confirm that CALM is a practical approach for adapting LLMs to the financial domain.

Summary

We haven't generated a summary for this paper yet.

Summarize Now

Enhancing Financial Domain Adaptation of Language Models via Model Augmentation (2411.09249v1)

Summary

Related Papers