Towards Generalizable Tumor Synthesis (2402.19470v2)
Abstract: Tumor synthesis enables the creation of artificial tumors in medical images, facilitating the training of AI models for tumor detection and segmentation. However, success in tumor synthesis hinges on creating visually realistic tumors that are generalizable across multiple organs and, furthermore, the resulting AI models being capable of detecting real tumors in images sourced from different domains (e.g., hospitals). This paper made a progressive stride toward generalizable tumor synthesis by leveraging a critical observation: early-stage tumors (< 2cm) tend to have similar imaging characteristics in computed tomography (CT), whether they originate in the liver, pancreas, or kidneys. We have ascertained that generative AI models, e.g., Diffusion Models, can create realistic tumors generalized to a range of organs even when trained on a limited number of tumor examples from only one organ. Moreover, we have shown that AI models trained on these synthetic tumors can be generalized to detect and segment real tumors from CT volumes, encompassing a broad spectrum of patient demographics, imaging protocols, and healthcare facilities.
- Automatic detection of pancreatic lesions and main pancreatic duct dilatation on portal venous ct scans using deep learning. Investigative Radiology, 2023.
- The medical segmentation decathlon. arXiv preprint arXiv:2106.05735, 2021.
- ViViT: A video vision transformer. In Proceedings of the IEEE/CVF international conference on computer vision, pages 6836–6846, 2021.
- Diagnosis and staging of hepatocellular carcinoma (hcc): current guidelines. European journal of radiology, 101:72–81, 2018.
- Is space-time attention all you need for video understanding? In International Conference on Machine Learning, pages 813–824. PMLR, 2021.
- The liver tumor segmentation benchmark (lits). arXiv preprint arXiv:1901.04056, 2019.
- Synthseg: Segmentation of brain mri scans of any contrast and resolution without retraining. Medical image analysis, 86:102789, 2023.
- Harry B Burke. Outcome prediction and the future of the tnm staging system, 2004.
- Cancerunit: Towards a single unified model for effective detection, segmentation, and diagnosis of eight major cancers using a large collection of ct scans. In Proceedings of the IEEE/CVF International Conference on Computer Vision, pages 21327–21338, 2023.
- Synthetic data in machine learning for medicine and healthcare. Nature Biomedical Engineering, 5(6):493–497, 2021.
- A review of medical image data augmentation techniques for deep learning applications. Journal of Medical Imaging and Radiation Oncology, 65(5):545–563, 2021.
- Ct and mr imaging diagnosis and staging of hepatocellular carcinoma: part i. development, growth, and spread: key pathologic and imaging aspects. Radiology, 272(3):635–654, 2014.
- Acquiring weak annotations for tumor localization in temporal and volumetric data. Machine Intelligence Research, pages 1–13, 2024.
- Diagnosis and detection of pancreatic cancer. The Cancer Journal, 23(6):333–342, 2017.
- Utility of ct radiomics features in differentiation of pancreatic ductal adenocarcinoma from normal pancreatic tissue. American Journal of Roentgenology, 213(2):349–357, 2019.
- Generative adversarial networks: An overview. IEEE signal processing magazine, 35(1):53–65, 2018.
- Boosting dermatoscopic lesion segmentation via diffusion models with visual and textual prompts. arXiv preprint arXiv:2310.02906, 2023.
- Implicit generation and modeling with energy based models. Advances in Neural Information Processing Systems, 32, 2019.
- N Reed Dunnick. Renal cell carcinoma: staging and surveillance. Abdominal Radiology, 41:1079–1085, 2016.
- Imaging diagnosis and staging of pancreatic ductal adenocarcinoma: a comprehensive review. Insights into imaging, 11(1):1–13, 2020.
- Taming transformers for high-resolution image synthesis. In Proceedings of the IEEE/CVF conference on computer vision and pattern recognition, pages 12873–12883, 2021.
- Can segmentation models be trained with fully synthetically generated data? In International Workshop on Simulation and Synthesis in Medical Imaging, pages 79–90. Springer, 2022.
- Tnm staging system for renal-cell carcinoma: current status and future perspectives. The lancet oncology, 8(6):554–558, 2007.
- Pathologic, molecular, and prognostic radiologic features of hepatocellular carcinoma. Radiographics, 41(6):1611–1631, 2021.
- Synthetic data accelerates the development of generalizable learning-based algorithms for x-ray image analysis. Nature Machine Intelligence, 5(3):294–308, 2023.
- Pet image denoising based on denoising diffusion probabilistic model. European Journal of Nuclear Medicine and Molecular Imaging, pages 1–11, 2023.
- Generative adversarial nets. Advances in neural information processing systems, 27, 2014.
- Deep learning. MIT press Cambridge, 2016.
- Generative adversarial networks. Communications of the ACM, 63(11):139–144, 2020.
- Synthesizing diverse lung nodules wherever massively: 3d multi-conditional gan-based ct image augmentation for object detection. In 2019 International Conference on 3D Vision (3DV), pages 729–737. IEEE, 2019.
- Swin unetr: Swin transformers for semantic segmentation of brain tumors in mri images. In International MICCAI Brainlesion Workshop, pages 272–284. Springer, 2021.
- An international challenge to use artificial intelligence to define the state-of-the-art in kidney and kidney tumor segmentation in ct imaging., 2020.
- Denoising diffusion probabilistic models. Advances in Neural Information Processing Systems, 33:6840–6851, 2020.
- Metgan: Generative tumour inpainting and modality synthesis in light sheet microscopy. In Proceedings of the IEEE/CVF Winter Conference on Applications of Computer Vision, pages 227–237, 2022.
- Synthetic tumors make ai segment tumors better. NeurIPS Workshop on Medical Imaging meets NeurIPS, 2022.
- Label-free liver tumor segmentation. In Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, pages 7422–7432, 2023a.
- Synthetic data as validation, 2023b.
- nnu-net: a self-configuring method for deep learning-based biomedical image segmentation. Nature Methods, 18(2):203–211, 2021.
- Deflating dataset bias using synthetic data augmentation. In Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition Workshops, pages 772–773, 2020.
- Free-form tumor synthesis in computed tomography images via richer generative adversarial network. Knowledge-Based Systems, 218:106753, 2021.
- Pate-gan: Generating synthetic data with differential privacy guarantees. In International conference on learning representations, 2018.
- Label-assemble: Leveraging multiple datasets with partial labels. In 2023 IEEE 20th International Symposium on Biomedical Imaging (ISBI), pages 1–5. IEEE, 2023.
- Diffusion adversarial representation learning for self-supervised vessel segmentation. arXiv preprint arXiv:2209.14566, 2022.
- Maximum likelihood training of parametrized diffusion model. 2021.
- Auto-encoding variational bayes. arXiv preprint arXiv:1312.6114, 2013.
- Semi-supervised learning with deep generative models. Advances in neural information processing systems, 27, 2014.
- An introduction to variational autoencoders. Foundations and Trends® in Machine Learning, 12(4):307–392, 2019.
- Normalizing flows: An introduction and review of current methods. IEEE transactions on pattern analysis and machine intelligence, 43(11):3964–3979, 2020.
- Combining in vitro diagnostics with in vivo imaging for earlier detection of pancreatic ductal adenocarcinoma: challenges and solutions. Radiology, 277(3):644–661, 2015.
- A tutorial on energy-based learning. Predicting structured data, 1(0), 2006.
- Imaging renal cell carcinoma with ultrasonography, ct and mri. Nature Reviews Urology, 7(6):311–325, 2010.
- Early detection and localization of pancreatic cancer by label-free tumor synthesis. MICCAI Workshop on Big Task Small Data, 1001-AI, 2023.
- How well do supervised models transfer to 3d image segmentation? In The Twelfth International Conference on Learning Representations, 2024.
- Clip-driven universal model for organ segmentation and tumor detection. In Proceedings of the IEEE/CVF International Conference on Computer Vision, pages 21152–21164, 2023.
- Pseudo-label guided image synthesis for semi-supervised covid-19 pneumonia infection segmentation. IEEE Transactions on Medical Imaging, 2022a.
- Learning from synthetic ct images via test-time training for liver tumor segmentation. IEEE transactions on medical imaging, 41(9):2510–2520, 2022b.
- Conversion between ct and mri images using diffusion and score-matching models. arXiv preprint arXiv:2209.12104, 2022.
- A novel unified conditional score-based generative framework for multi-modal medical image completion. arXiv preprint arXiv:2207.03430, 2022.
- Improved denoising diffusion probabilistic models. In International Conference on Machine Learning, pages 8162–8171. PMLR, 2021.
- Multi-domain adaptation in brain mri through paired consistency and adversarial learning. In Domain Adaptation and Representation Transfer and Medical Image Learning with Less Labels and Imperfect Data, pages 54–62. Springer, 2019.
- Unsupervised medical image translation with adversarial diffusion models. IEEE Transactions on Medical Imaging, 2023.
- Normalizing flows for probabilistic modeling and inference. The Journal of Machine Learning Research, 22(1):2617–2680, 2021.
- Abdomenatlas-8k: Annotating 8,000 abdominal ct volumes for multi-organ segmentation in three weeks. Conference on Neural Information Processing Systems, 2023.
- Hierarchical text-conditional image generation with clip latents. arXiv preprint arXiv:2204.06125, 1(2):3, 2022.
- Tnm staging of neoplasms of the endocrine pancreas: results from a large international cohort study. Journal of the National Cancer Institute, 104(10):764–777, 2012.
- High-resolution image synthesis with latent diffusion models. In Proceedings of the IEEE/CVF conference on computer vision and pattern recognition, pages 10684–10695, 2022.
- Palette: Image-to-image diffusion models. In ACM SIGGRAPH 2022 Conference Proceedings, pages 1–10, 2022.
- Abnormal colon polyp image synthesis using conditional adversarial networks for improved detection performance. IEEE Access, 6:56007–56017, 2018.
- Learning fixed points in generative adversarial networks: From image-to-image translation to disease detection and localization. In Proceedings of the IEEE International Conference on Computer Vision, pages 191–200, 2019.
- Arthur T Skarin. Atlas of Diagnostic Oncology E-Book. Elsevier Health Sciences, 2015.
- Deep unsupervised learning using nonequilibrium thermodynamics. In International conference on machine learning, pages 2256–2265. PMLR, 2015.
- Score-based generative modeling through stochastic differential equations. arXiv preprint arXiv:2011.13456, 2020.
- Solving inverse problems in medical imaging with score-based generative models. arXiv preprint arXiv:2111.08005, 2021.
- Score-based generative modeling in latent space. Advances in Neural Information Processing Systems, 34:11287–11302, 2021.
- Computational radiomics system to decode the radiographic phenotype. Cancer research, 77(21):e104–e107, 2017.
- Comparison of machine learning methods for classifying mediastinal lymph node metastasis of non-small cell lung cancer from 18f-fdg pet/ct images. EJNMMI research, 7(1):1–11, 2017.
- Anomaly segmentation in retinal images with poisson-blending data augmentation. Medical Image Analysis, page 102534, 2022.
- CT and MRI of small renal masses. Br J Radiol, 91(1087):20180131, 2018.
- Pancreatic image augmentation based on local region texture synthesis for tumor segmentation. In International Conference on Artificial Neural Networks, pages 419–431. Springer, 2022.
- Diffusion models for implicit image segmentation ensembles. In International Conference on Medical Imaging with Deep Learning, pages 1336–1348. PMLR, 2022.
- Anoddpm: Anomaly detection with denoising diffusion probabilistic models using simplex noise. In Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, pages 650–656, 2022.
- The felix project: Deep networks to detect pancreatic neoplasms. medRxiv, 2022.
- Squid: Deep feature in-painting for unsupervised anomaly detection. In Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, pages 23890–23901, 2023.
- Measurement-conditioned denoising diffusion probabilistic model for under-sampled medical image reconstruction. In International Conference on Medical Image Computing and Computer-Assisted Intervention, pages 655–664. Springer, 2022.
- Class-aware adversarial lung nodule synthesis in ct images. In 2019 IEEE 16th International Symposium on Biomedical Imaging (ISBI 2019), pages 1348–1352. IEEE, 2019.
- Label-free segmentation of covid-19 lesions in lung ct. IEEE Transactions on Medical Imaging, 2021.
- Time-series generative adversarial networks. Advances in neural information processing systems, 32, 2019.
- Fastflow: Unsupervised anomaly detection and localization via 2d normalizing flows. arXiv preprint arXiv:2111.07677, 2021.
- Self-supervised tumor segmentation with sim2real adaptation. IEEE Journal of Biomedical and Health Informatics, 2023a.
- Continual learning for abdominal multi-organ and tumor segmentation. In International conference on medical image computing and computer-assisted intervention, pages 35–45. Springer, 2023b.
- Unsupervised liver tumor segmentation with pseudo anomaly synthesis. In International Workshop on Simulation and Synthesis in Medical Imaging, pages 86–96. Springer, 2023c.
- Energy-based generative adversarial network. arXiv preprint arXiv:1609.03126, 2016.
- Zongwei Zhou. Towards Annotation-Efficient Deep Learning for Computer-Aided Diagnosis. PhD thesis, Arizona State University, 2021.
- Interpreting medical images. In Intelligent Systems in Medicine and Health, pages 343–371. Springer, 2022.