ViM-UNet: Vision Mamba for Biomedical Segmentation
Abstract: CNNs, most notably the UNet, are the default architecture for biomedical segmentation. Transformer-based approaches, such as UNETR, have been proposed to replace them, benefiting from a global field of view, but suffering from larger runtimes and higher parameter counts. The recent Vision Mamba architecture offers a compelling alternative to transformers, also providing a global field of view, but at higher efficiency. Here, we introduce ViM-UNet, a novel segmentation architecture based on it and compare it to UNet and UNETR for two challenging microscopy instance segmentation tasks. We find that it performs similarly or better than UNet, depending on the task, and outperforms UNETR while being more efficient. Our code is open source and documented at https://github.com/constantinpape/torch-em/blob/main/vimunet.md.
- Segment anything for microscopy. bioRxiv, pages 2023–08, 2023. URL \urlhttps://doi.org/10.1101/2023.08.21.554208.
- Multicut brings automated neurite segmentation closer to human performance. Nature methods, 14(2):101–102, 2017. URL \urlhttps://doi.org/10.1038/nmeth.4151.
- An image is worth 16x16 words: Transformers for image recognition at scale. ICLR, 2021. URL \urlhttps://doi.org/10.48550/arXiv.2010.11929.
- Livecell—a large-scale dataset for label-free live cell segmentation. Nature methods, 18(9):1038–1045, 2021. URL \urlhttps://doi.org/10.1038/s41592-021-01249-6.
- The pascal visual object classes (voc) challenge. International Journal of Computer Vision, 88(2):303–338, June 2010. URL \urlhttps://doi.org/10.1007/s11263-009-0275-4.
- Cremi: Miccai challenge on circuit reconstruction from electron microscopy images, 2016. URL \urlhttps://cremi.org/.
- Mamba: Linear-time sequence modeling with selective state spaces, 2023. URL \urlhttps://doi.org/10.48550/arXiv.2312.00752.
- Efficiently modeling long sequences with structured state spaces, 2022. URL \urlhttps://doi.org/10.48550/arXiv.2111.00396.
- Unetr: Transformers for 3d medical image segmentation, 2021. URL \urlhttps://doi.org/10.48550/arXiv.2103.10504.
- Swin unetr: Swin transformers for semantic segmentation of brain tumors in mri images, 2022. URL \urlhttps://doi.org/10.48550/arXiv.2103.10504.
- Cellvit: Vision transformers for precise cell segmentation and classification, 2023. URL \urlhttps://doi.org/10.48550/arXiv.2306.15350.
- nnu-net: a self-configuring method for deep learning-based biomedical image segmentation. Nature methods, 18(2):203–211, 2021. URL \urlhttps://doi.org/10.1038/s41592-020-01008-z.
- Segment anything. In 2023 IEEE/CVF International Conference on Computer Vision (ICCV), pages 3992–4003, 2023. 10.1109/ICCV51070.2023.00371. URL \urlhttps://doi.org/10.1109/ICCV51070.2023.00371.
- U-mamba: Enhancing long-range dependency for biomedical image segmentation, 2024a. URL \urlhttps://doi.org/10.48550/arXiv.2401.04722.
- The multimodality cell segmentation challenge: toward universal solutions. Nature Methods, pages 1–11, 2024b. URL \urlhttps://doi.org/10.1038/s41592-024-02233-6.
- Constantin Pape. torch-em: Deep learning based semantic and instance segmentation for 3d electron microscopy and other bioimage anaylsis problems based on pytorch. URL \urlhttps://doi.org/10.5281/zenodo.10779428.
- U-net: Convolutional networks for biomedical image segmentation. In Medical image computing and computer-assisted intervention–MICCAI 2015: 18th international conference, Munich, Germany, October 5-9, 2015, proceedings, part III 18, pages 234–241. Springer, 2015. URL \urlhttps://doi.org/10.1007/978-3-319-24574-4_28.
- Cell detection with star-convex polygons. In Medical Image Computing and Computer Assisted Intervention - MICCAI 2018 - 21st International Conference, Granada, Spain, September 16-20, 2018, Proceedings, Part II, pages 265–273, 2018. 10.1007/978-3-030-00934-2_30.
- Cellpose: a generalist algorithm for cellular segmentation. Nature methods, 18(1):100–106, 2021. URL \urlhttps://doi.org/10.1038/s41592-020-01018-x.
- Vision mamba: Efficient visual representation learning with bidirectional state space model, 2024. URL \urlhttps://doi.org/10.48550/arXiv.2401.09417.
Paper Prompts
Sign up for free to create and run prompts on this paper using GPT-5.
Top Community Prompts
Collections
Sign up for free to add this paper to one or more collections.