3DSAM-adapter: Holistic adaptation of SAM from 2D to 3D for promptable tumor segmentation (2306.13465v2)

Published 23 Jun 2023 in cs.CV

Abstract: Despite that the segment anything model (SAM) achieved impressive results on general-purpose semantic segmentation with strong generalization ability on daily images, its demonstrated performance on medical image segmentation is less precise and not stable, especially when dealing with tumor segmentation tasks that involve objects of small sizes, irregular shapes, and low contrast. Notably, the original SAM architecture is designed for 2D natural images, therefore would not be able to extract the 3D spatial information from volumetric medical data effectively. In this paper, we propose a novel adaptation method for transferring SAM from 2D to 3D for promptable medical image segmentation. Through a holistically designed scheme for architecture modification, we transfer the SAM to support volumetric inputs while retaining the majority of its pre-trained parameters for reuse. The fine-tuning process is conducted in a parameter-efficient manner, wherein most of the pre-trained parameters remain frozen, and only a few lightweight spatial adapters are introduced and tuned. Regardless of the domain gap between natural and medical data and the disparity in the spatial arrangement between 2D and 3D, the transformer trained on natural images can effectively capture the spatial patterns present in volumetric medical images with only lightweight adaptations. We conduct experiments on four open-source tumor segmentation datasets, and with a single click prompt, our model can outperform domain state-of-the-art medical image segmentation models on 3 out of 4 tasks, specifically by 8.25%, 29.87%, and 10.11% for kidney tumor, pancreas tumor, colon cancer segmentation, and achieve similar performance for liver tumor segmentation. We also compare our adaptation method with existing popular adapters, and observed significant performance improvement on most datasets.

References (55)

Authors (8)

Shizhan Gong (6 papers)
Yuan Zhong (70 papers)
Wenao Ma (11 papers)
Jinpeng Li (67 papers)
Zhao Wang (155 papers)
Jingyang Zhang (58 papers)
Pheng-Ann Heng (196 papers)
Qi Dou (163 papers)

Citations (57)

View on Semantic Scholar

Summary

Holistic Adaptation of SAM from 2D to 3D for Medical Image Segmentation

The paper presents an innovative approach to adapting the Segment Anything Model (SAM) from 2D semantic segmentation tasks to 3D medical image segmentation, particularly targeting tumor segmentation. Designed originally for 2D natural images, the SAM architecture exhibits limitations due to the distinct spatial features inherent in volumetric medical data which are essential for accurate tumor identification. The transition from 2D to 3D is not straightforward, given the volumetric nature of medical imaging modalities such as CT and MRI, necessitating a holistic architecture adaptation to extract 3D spatial information effectively.

The authors introduce a novel 3DSAM-adapter, an efficient parameter adaptation methodology that leverages the existing pre-trained SAM parameters, incorporating selective modifications to accommodate the 3D structure. This method maintains a parameter-efficient tuning process, reducing the necessity for comprehensive retraining by freezing the majority of the SAM's pre-existing parameters and integrating lightweight spatial adapters. This approach effectively steers the model to navigate the domain gap and the spatial complexities between 2D and 3D data representations.

Key performance evaluation on four open-access tumor segmentation datasets demonstrated promising results. The 3DSAM-adapter achieved superior performance compared to state-of-the-art medical image segmentation models on 3 out of 4 datasets, registering Dice score improvements of 8.25%, 29.87%, and 10.11% for kidney tumor, pancreas tumor, and colon cancer segmentation tasks, respectively. For liver tumors, it reached comparable performance levels. These results indicate a significant enhancement in segmentation accuracy, especially given the notable challenge posed by tumors' small size, irregular shapes, and low contrast.

The contributions emphasized in this paper are manifold. Firstly, the paper outlines a holistic framework for 2D-to-3D adaptation in segmentation models, adding only a marginal 7.79% increase in parameters while retaining most pre-trained weights. It also introduces an efficient fine-tuning method, tuning only 16.96% of the model's parameters, highlighting the potential for significant memory and computation savings without compromising accuracy. The inclusion of a multi-layer aggregation mechanism in the decoder further enhances the model's capability to leverage high-resolution textures necessary for delineating fine-grained tumor boundaries. Collectively, these findings advocate the viability of efficient adaptation strategies for domain-specific applications in medical imaging, suggesting broader implications for volumetric image simulation within the AI research community.

The results underscore the potential for this adapted SAM to become a crucial tool in the domain of medical image segmentation, opening avenues for further research on integrating domain-specific knowledge with generalized pre-trained models. The focus on efficient parameter reuse and fine-tuning is a promising avenue for future research and practical deployment, potentially extending to other fields requiring volumetric analysis. This approach heralds a shift in leveraging large foundational models for specialized applications, harnessing their widespread capabilities while addressing the inherent challenges posed by unique domain requirements. The tools and methodologies developed within this paper could redefine how foundational models are perceived and applied across diverse AI applications, particularly where volume-based data processing is crucial.

PDF Markdown

GitHub

GitHub - med-air/3DSAM-adapter: Holistic Adaptation of SAM from 2D to 3D for Promptable Medical Image Segmentation (186 stars)

3DSAM-adapter: Holistic adaptation of SAM from 2D to 3D for promptable tumor segmentation (2306.13465v2)

Summary

Holistic Adaptation of SAM from 2D to 3D for Medical Image Segmentation

Related Papers

GitHub