Applicability of ShufflEval to image captioning
Determine whether and how the ShufflEval evaluation methodology—based on segment-by-segment translation and order-based plausibility comparisons—can be applied to image captioning tasks where the source modality lacks an inherent linear order, and if feasible, construct a concrete adaptation that enables reference-free evaluation for image captions.
Sponsor
References
Put another way, it is not clear how to apply ShufflEval to image captioning, but it could be applied to describing videos.
— On Non-interactive Evaluation of Animal Communication Translators
(2510.15768 - Paradise et al., 17 Oct 2025) in Section 2.4 (Implications for ShufflEval)