ColPali performance on underrepresented (low-resource) languages
Determine the retrieval performance and generalization behavior of ColPali, a late-interaction vision–language retriever built on PaliGemma-3B with a Gemma-2B language backbone, on languages that are underrepresented in the Gemma-2B pretraining corpus, beyond the high-resource languages (English and French) evaluated in this work.
References
We also focus on high-resource languages (English and French) and although we have shown the capacity of the ColPali model to generalize to languages outside of its fine-tuning set, it is unclear how the model would perform on languages that are not as represented in the model's language backbone.
— ColPali: Efficient Document Retrieval with Vision Language Models
(2407.01449 - Faysse et al., 27 Jun 2024) in Section: Limitations