Generalization of LLaMA Pro block expansion beyond coding and mathematics
Ascertain whether the LLaMA Pro post-pretraining block expansion method remains effective in complex and open-ended domains outside coding and mathematics.
Sponsor
References
Therefore, the effectiveness of the block expansion method in more complex and open-ended domains is yet to be verified.
— Towards Incremental Learning in Large Language Models: A Critical Review
(2404.18311 - Jovanovic et al., 28 Apr 2024) in Section 2.1 (Continual Learning) – LLAMA PRO