Papers
Topics
Authors
Recent
Gemini 2.5 Flash
Gemini 2.5 Flash
119 tokens/sec
GPT-4o
56 tokens/sec
Gemini 2.5 Pro Pro
43 tokens/sec
o3 Pro
6 tokens/sec
GPT-4.1 Pro
47 tokens/sec
DeepSeek R1 via Azure Pro
28 tokens/sec
2000 character limit reached

CPLIP: Zero-Shot Learning for Histopathology with Comprehensive Vision-Language Alignment (2406.05205v1)

Published 7 Jun 2024 in cs.CV, cs.CL, cs.LG, cs.MM, and eess.IV

Abstract: This paper proposes Comprehensive Pathology Language Image Pre-training (CPLIP), a new unsupervised technique designed to enhance the alignment of images and text in histopathology for tasks such as classification and segmentation. This methodology enriches vision-LLMs by leveraging extensive data without needing ground truth annotations. CPLIP involves constructing a pathology-specific dictionary, generating textual descriptions for images using LLMs, and retrieving relevant images for each text snippet via a pre-trained model. The model is then fine-tuned using a many-to-many contrastive learning method to align complex interrelated concepts across both modalities. Evaluated across multiple histopathology tasks, CPLIP shows notable improvements in zero-shot learning scenarios, outperforming existing methods in both interpretability and robustness and setting a higher benchmark for the application of vision-LLMs in the field. To encourage further research and replication, the code for CPLIP is available on GitHub at https://cplip.github.io/

User Edit Pencil Streamline Icon: https://streamlinehq.com
Authors (6)
  1. Sajid Javed (39 papers)
  2. Arif Mahmood (50 papers)
  3. Iyyakutti Iyappan Ganapathi (6 papers)
  4. Fayaz Ali Dharejo (8 papers)
  5. Naoufel Werghi (43 papers)
  6. Mohammed Bennamoun (124 papers)
Citations (4)

Summary

We haven't generated a summary for this paper yet.

Github Logo Streamline Icon: https://streamlinehq.com

GitHub