Effectiveness of Model Tailor patches in complex and open-domain settings

Establish the effectiveness of Model Tailor’s extracted and decorated sparse parameter patches for mitigating catastrophic forgetting in more complex and open-domain multimodal tasks beyond the evaluated benchmarks.

Background

Model Tailor identifies a minimal set of finetuned parameters (a "patch") and decorates them to balance performance on current and prior tasks, showing improvements on VQA and image captioning with InstructBLIP and LLaVA-1.5.

The authors note that validation to date is limited and the method’s effectiveness remains to be demonstrated in broader, more complex domains.

References

However, the effectiveness of extracted and decorated patches remains to be verified in more complex and open domains.

Towards Incremental Learning in Large Language Models: A Critical Review (2404.18311 - Jovanovic et al., 28 Apr 2024) in Section 2.1 (Continual Learning) – Model Tailor