Generalizability of model merging beyond sentence classification
Determine whether the adaptation strategy of merging a continued pre-training checkpoint with a base multilingual model using Task Arithmetic or TIES, followed by fine-tuning on labeled data—shown to improve performance on code-mixed sentence classification for English-Hindi and English-Spanish—generalizes to other NLP tasks beyond sentence classification in both monolingual and code-mixed settings.
References
Therefore, the generalizability of our findings to other NLP tasks is unclear.
— Adapting Multilingual Models to Code-Mixed Tasks via Model Merging
(2510.19782 - Kodali et al., 22 Oct 2025) in Section 6 (Discussion), Limitations