MatSAM: Efficient Extraction of Microstructures of Materials via Visual Large Model (2401.05638v2)
Abstract: Efficient and accurate extraction of microstructures in micrographs of materials is essential in process optimization and the exploration of structure-property relationships. Deep learning-based image segmentation techniques that rely on manual annotation are laborious and time-consuming and hardly meet the demand for model transferability and generalization on various source images. Segment Anything Model (SAM), a large visual model with powerful deep feature representation and zero-shot generalization capabilities, has provided new solutions for image segmentation. In this paper, we propose MatSAM, a general and efficient microstructure extraction solution based on SAM. A simple yet effective point-based prompt generation strategy is designed, grounded on the distribution and shape of microstructures. Specifically, in an unsupervised and training-free way, it adaptively generates prompt points for different microscopy images, fuses the centroid points of the coarsely extracted region of interest (ROI) and native grid points, and integrates corresponding post-processing operations for quantitative characterization of microstructures of materials. For common microstructures including grain boundary and multiple phases, MatSAM achieves superior zero-shot segmentation performance to conventional rule-based methods and is even preferable to supervised learning methods evaluated on 16 microscopy datasets whose micrographs are imaged by the optical microscope (OM) and scanning electron microscope (SEM). Especially, on 4 public datasets, MatSAM shows unexpected competitive segmentation performance against their specialist models. We believe that, without the need for human labeling, MatSAM can significantly reduce the cost of quantitative characterization and statistical analysis of extensive microstructures of materials, and thus accelerate the design of new materials.
- Property Prediction and Properties-to-Microstructure Inverse Analysis of Steels by a Machine-Learning Approach. Materials Science and Engineering: A, 744:661–670, January 2019.
- Building a Quantitative Composition-Microstructure-Property Relationship of Dual-Phase Steels via Multimodal Data Mining. Acta Materialia, 252:118954, June 2023.
- Linking Atomic Structural Defects to Mesoscale Properties in Crystalline Solids Using Graph Neural Networks. npj Computational Materials, 8(1):198, September 2022.
- Modeling and Simulation of Microstructure in Metallic Systems Based on Multi-Physics Approaches. npj Computational Materials, 8(1):93, April 2022.
- A deep learning approach for complex microstructure inference. Nature communications, 12(1):6272, 2021. doi:10.1038/s41467-021-26565-5.
- 3d microstructural characterization of nickel superalloys via serial-sectioning using a dual beam fib-sem. Scripta Materialia, 55(1):23–28, July 2006. ISSN 13596462.
- Tem-based dislocation tomography: Challenges and opportunities. Current Opinion in Solid State and Materials Science, 24(3):100833, June 2020. ISSN 1359-0286.
- Characterization of microstructure in additively manufactured 316l using automated serial sectioning. Current Opinion in Solid State and Materials Science, 24(3):100819, June 2020. ISSN 1359-0286.
- An autonomous laboratory for the accelerated synthesis of novel materials. Nature, 624(7990):86–91, December 2023. ISSN 1476-4687. doi:10.1038/s41586-023-06734-w.
- Serial sectioning in the sem for three dimensional materials science. Current Opinion in Solid State and Materials Science, 24(2):100817, April 2020. ISSN 1359-0286.
- Electron tomography: An imaging method for materials deformation dynamics. Current Opinion in Solid State and Materials Science, 24(4):100850, August 2020. ISSN 1359-0286.
- John Canny. A computational approach to edge detection. IEEE Transactions on Pattern Analysis and Machine Intelligence, page 679–698, Nov 1986. doi:10.1109/tpami.1986.4767851. URL http://dx.doi.org/10.1109/tpami.1986.4767851.
- Nobuyuki Otsu. A threshold selection method from gray-level histograms. IEEE Transactions on Systems, Man, and Cybernetics, page 62–66, Jan 1979. doi:10.1109/tsmc.1979.4310076. URL http://dx.doi.org/10.1109/tsmc.1979.4310076.
- Adaptive thresholding: A comparative study. In 2014 International Conference on Control, Instrumentation, Communication and Computational Technologies (ICCICCT), pages 1182–1186.
- Deepmmp: Efficient 3d perception of microstructures from serial section microscopic images. Computational Materials Science, 235:112826, February 2024. ISSN 0927-0256. doi:10.1016/j.commatsci.2024.112826.
- A pseudo-labeling based weakly supervised segmentation method for few-shot texture images. Expert Systems with Applications, 238:122110, 2024. ISSN 0957-4174. doi:10.1016/j.eswa.2023.122110.
- Microstructural Crack Segmentation of Three-Dimensional Concrete Images Based on Deep Convolutional Neural Networks. Construction and Building Materials, 253:119185, August 2020.
- Nih image to imagej: 25 years of image analysis. Nature Methods, 9(7):671–675. ISSN 1548-7105.
- Data augmentation in microscopic images for material data mining. npj Computational Materials, 6(1):125, 2020. doi:10.1038/s41524-020-00392-6.
- Microstructure segmentation with deep learning encoders pre-trained on a large microscopy dataset. npj Computational Materials, Sep 2022. doi:10.1038/s41524-022-00878-5. URL http://dx.doi.org/10.1038/s41524-022-00878-5.
- Rethinking ImageNet Pre-Training. In 2019 IEEE/CVF International Conference on Computer Vision (ICCV), pages 4917–4926, Seoul, Korea (South), October 2019. IEEE.
- Transfer learning for microstructure segmentation with cs-unet: A hybrid algorithm with transformer and cnn encoders, August 2023.
- An image is worth 16x16 words: Transformers for image recognition at scale. arXiv: Computer Vision and Pattern Recognition,arXiv: Computer Vision and Pattern Recognition, Oct 2020.
- Segment anything. arXiv preprint arXiv:2304.02643, 2023.
- A comprehensive survey on segment anything model for vision and beyond. arXiv preprint arXiv:2305.08196, 2023.
- On the opportunities and risks of foundation models. arXiv preprint arXiv:2108.07258, 2021.
- A foundation model for cell segmentation. bioRxiv, pages 2023–11, 2023.
- Segment anything in medical images. Nature Communications, 15(1):654, January 2024. ISSN 2041-1723. doi:10.1038/s41467-024-44824-z.
- Sam-med2d. arXiv preprint arXiv:2308.16184, 2023.
- Autosam: Adapting sam to medical images by overloading the prompt encoder. arXiv preprint arXiv:2306.06370, 2023.
- Segment anything in high quality. arXiv preprint arXiv:2306.01567, 2023.
- All-in-sam: from weak annotation to pixel-wise nuclei segmentation with prompt-based finetuning. Jul 2023.
- Samaug: Point prompt augmentation for segment anything model. arXiv preprint arXiv:2307.01187, 2023.
- Sam-octa: Prompting segment-anything for octa image segmentation. arXiv preprint arXiv:2310.07183, 2023.
- U-Net: Convolutional Networks for Biomedical Image Segmentation, page 234–241. Jan 2015. doi:10.1007/978-3-319-24574-4_28. URL http://dx.doi.org/10.1007/978-3-319-24574-4_28.
- Segnet: A deep convolutional encoder-decoder architecture for image segmentation. IEEE Transactions on Pattern Analysis and Machine Intelligence, page 2481–2495, Dec 2017. doi:10.1109/tpami.2016.2644615. URL http://dx.doi.org/10.1109/tpami.2016.2644615.
- Transunet: Transformers make strong encoders for medical image segmentation. Cornell University - arXiv,Cornell University - arXiv, Feb 2021.
- A unified microstructure segmentation approach via human-in-the-loop machine learning. Acta Materialia, page 119086, 2023. doi:10.1016/j.actamat.2023.119086.
- Distance between sets. Nature, 234(5323):34–35, November 1971. ISSN 1476-4687. doi:10.1038/234034a0.
- Soft-nms–improving object detection with one line of code. In Proceedings of the IEEE international conference on computer vision, pages 5561–5569, 2017. doi:10.1109/iccv.2017.593.
- Optimizing convolutional neural networks to perform semantic segmentation on large materials imaging datasets: X-ray tomography and serial sectioning. Materials Characterization, 160:110119, 2020. doi:10.1016/j.matchar.2020.110119.
- F. Meyer and S. Beucher. Morphological segmentation. Journal of Visual Communication & Image Representation, 1(1):21–46, 1990.
- Douglas Steinley. Properties of the hubert-arable adjusted rand index. Psychological Methods, 9(3):386–396, 2004. ISSN 1939-1463. doi:10.1037/1082-989X.9.3.386.
- Hal-ia: A hybrid active learning framework using interactive annotation for medical image segmentation. Medical Image Analysis, 88:102862, August 2023. ISSN 1361-8415. doi:10.1016/j.media.2023.102862.
- High throughput quantitative metallography for complex microstructures using deep learning: A case study in ultrahigh carbon steel. Microscopy and Microanalysis, page 21–29, Feb 2019. doi:10.1017/s1431927618015635. URL http://dx.doi.org/10.1017/s1431927618015635.
- Extreme low-light environment-driven image denoising over permanently shadowed lunar regions with a physical noise model. In 2021 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR), pages 6313–6323, June 2021. doi:10.1109/CVPR46437.2021.00625.
- Self-Supervised Visual Feature Learning With Deep Neural Networks: A Survey. IEEE Transactions on Pattern Analysis and Machine Intelligence, 43(11):4037–4058, November 2021.
- Self-supervised optimization of random material microstructures in the small-data regime. npj Computational Materials, 8(1):46, March 2022. ISSN 2057-3960. doi:10.1038/s41524-022-00718-6.
- G. Bradski. The OpenCV Library. Dr. Dobb’s Journal: Software Tools for the Professional Programmer, 25(11):120–123, 2000.
- Holistically-nested edge detection. In Proceedings of the IEEE International Conference on Computer Vision, pages 1395–1403, 2015.
- PyTorch: An Imperative Style, High-Performance Deep Learning Library. In NeurIPS, pages 8024–8035, 2019.