Papers
Topics
Authors
Recent
Gemini 2.5 Flash
Gemini 2.5 Flash
41 tokens/sec
GPT-4o
59 tokens/sec
Gemini 2.5 Pro Pro
41 tokens/sec
o3 Pro
7 tokens/sec
GPT-4.1 Pro
50 tokens/sec
DeepSeek R1 via Azure Pro
28 tokens/sec
2000 character limit reached

Pathological Primitive Segmentation Based on Visual Foundation Model with Zero-Shot Mask Generation (2404.08584v1)

Published 12 Apr 2024 in cs.CV

Abstract: Medical image processing usually requires a model trained with carefully crafted datasets due to unique image characteristics and domain-specific challenges, especially in pathology. Primitive detection and segmentation in digitized tissue samples are essential for objective and automated diagnosis and prognosis of cancer. SAM (Segment Anything Model) has recently been developed to segment general objects from natural images with high accuracy, but it requires human prompts to generate masks. In this work, we present a novel approach that adapts pre-trained natural image encoders of SAM for detection-based region proposals. Regions proposed by a pre-trained encoder are sent to cascaded feature propagation layers for projection. Then, local semantic and global context is aggregated from multi-scale for bounding box localization and classification. Finally, the SAM decoder uses the identified bounding boxes as essential prompts to generate a comprehensive primitive segmentation map. The entire base framework, SAM, requires no additional training or fine-tuning but could produce an end-to-end result for two fundamental segmentation tasks in pathology. Our method compares with state-of-the-art models in F1 score for nuclei detection and binary/multiclass panoptic(bPQ/mPQ) and mask quality(dice) for segmentation quality on the PanNuke dataset while offering end-to-end efficiency. Our model also achieves remarkable Average Precision (+4.5%) on the secondary dataset (HuBMAP Kidney) compared to Faster RCNN. The code is publicly available at https://github.com/learner-codec/autoprom_sam.

Definition Search Book Streamline Icon: https://streamlinehq.com
References (11)
  1. “U-net: Convolutional networks for biomedical image segmentation,” 2015, vol. 9351.
  2. “Deep residual learning for image recognition,” 2015.
  3. “An image is worth 16 x 16 words :,” International Conference on Learning Representations, 2021.
  4. “Hover-net: Simultaneous segmentation and classification of nuclei in multi-tissue histology images,” Medical Image Analysis, vol. 58, 2019.
  5. “Mask r-cnn,” IEEE Transactions on Pattern Analysis and Machine Intelligence, vol. 42, 2020.
  6. “End-to-end object detection with transformers,” 2020, vol. 12346 LNCS.
  7. “Segment anything,” 2023.
  8. “Focal loss for dense object detection,” 2017, vol. 2017-October.
  9. “Pannuke dataset extension, insights and baselines,” 2020.
  10. Caltech-UW TMC, “The human body at cellular resolution: the nih human biomolecular atlas program,” Nature, vol. 574, no. 7777, pp. 187–192, 2019.
  11. “Panoptic segmentation,” 2019.
User Edit Pencil Streamline Icon: https://streamlinehq.com
Authors (6)
  1. Abu Bakor Hayat Arnob (3 papers)
  2. Xiangxue Wang (1 paper)
  3. Yiping Jiao (2 papers)
  4. Xiao Gan (4 papers)
  5. Wenlong Ming (3 papers)
  6. Jun Xu (397 papers)