Papers
Topics
Authors
Recent
Gemini 2.5 Flash
Gemini 2.5 Flash
97 tokens/sec
GPT-4o
53 tokens/sec
Gemini 2.5 Pro Pro
43 tokens/sec
o3 Pro
4 tokens/sec
GPT-4.1 Pro
47 tokens/sec
DeepSeek R1 via Azure Pro
28 tokens/sec
2000 character limit reached

COCO-Counterfactuals: Automatically Constructed Counterfactual Examples for Image-Text Pairs (2309.14356v2)

Published 23 Sep 2023 in cs.LG, cs.CL, and cs.CV

Abstract: Counterfactual examples have proven to be valuable in the field of NLP for both evaluating and improving the robustness of LLMs to spurious correlations in datasets. Despite their demonstrated utility for NLP, multimodal counterfactual examples have been relatively unexplored due to the difficulty of creating paired image-text data with minimal counterfactual changes. To address this challenge, we introduce a scalable framework for automatic generation of counterfactual examples using text-to-image diffusion models. We use our framework to create COCO-Counterfactuals, a multimodal counterfactual dataset of paired image and text captions based on the MS-COCO dataset. We validate the quality of COCO-Counterfactuals through human evaluations and show that existing multimodal models are challenged by our counterfactual image-text pairs. Additionally, we demonstrate the usefulness of COCO-Counterfactuals for improving out-of-domain generalization of multimodal vision-LLMs via training data augmentation.

User Edit Pencil Streamline Icon: https://streamlinehq.com
Authors (3)
  1. Tiep Le (10 papers)
  2. Vasudev Lal (44 papers)
  3. Phillip Howard (28 papers)
Citations (18)

Summary

We haven't generated a summary for this paper yet.

X Twitter Logo Streamline Icon: https://streamlinehq.com