Unified-template synergy between prompts and sequence length in ColBERT pre-training
Establish whether ColBERT treats the combination of explicit pre-training prompts ("search_query:" and "search_document:") and increased sequence length as a unified template that yields synergistic performance gains, and isolate the individual and combined mechanisms through controlled ablations across prompt variations and incremental length scales.
References
We conjecture that ColBERT, unlike dense models, treats the pre-training configuration, specifically the combination of prompts and extended sequence length, as a unified template. However, this remains a preliminary conjecture; further investigation is required using a wider range of prompt variations and incremental length scales to definitively isolate the individual and combined mechanisms at play.