Papers
Topics
Authors
Recent
Gemini 2.5 Flash
Gemini 2.5 Flash
166 tokens/sec
GPT-4o
7 tokens/sec
Gemini 2.5 Pro Pro
42 tokens/sec
o3 Pro
4 tokens/sec
GPT-4.1 Pro
38 tokens/sec
DeepSeek R1 via Azure Pro
28 tokens/sec
2000 character limit reached

A Closer Look at Spatial-Slice Features Learning for COVID-19 Detection (2404.01643v2)

Published 2 Apr 2024 in cs.CV, eess.IV, and cs.LG

Abstract: Conventional Computed Tomography (CT) imaging recognition faces two significant challenges: (1) There is often considerable variability in the resolution and size of each CT scan, necessitating strict requirements for the input size and adaptability of models. (2) CT-scan contains large number of out-of-distribution (OOD) slices. The crucial features may only be present in specific spatial regions and slices of the entire CT scan. How can we effectively figure out where these are located? To deal with this, we introduce an enhanced Spatial-Slice Feature Learning (SSFL++) framework specifically designed for CT scan. It aim to filter out a OOD data within whole CT scan, enabling our to select crucial spatial-slice for analysis by reducing 70% redundancy totally. Meanwhile, we proposed Kernel-Density-based slice Sampling (KDS) method to improve the stability when training and inference stage, therefore speeding up the rate of convergence and boosting performance. As a result, the experiments demonstrate the promising performance of our model using a simple EfficientNet-2D (E2D) model, even with only 1% of the training data. The efficacy of our approach has been validated on the COVID-19-CT-DB datasets provided by the DEF-AI-MIA workshop, in conjunction with CVPR 2024. Our source code is available at https://github.com/ming053l/E2D

Definition Search Book Streamline Icon: https://streamlinehq.com
References (68)
  1. Robust real-time violence detection in video using cnn and lstm. In 2019 2nd Scientific Conference of Computer Sciences (SCCS), pages 104–108, 2019.
  2. A large imaging database and novel deep neural architecture for covid-19 diagnosis. In 2022 IEEE 14th Image, Video, and Multidimensional Signal Processing Workshop (IVMSP), page 1–5. IEEE, 2022.
  3. Data-driven covid-19 detection through medical imaging. In 2023 IEEE International Conference on Acoustics, Speech, and Signal Processing Workshops (ICASSPW), page 1–5. IEEE, 2023.
  4. Harrison H. Barrett. Iii the radon transform and its applications. pages 217–286. Elsevier, 1984.
  5. J. A. Beatty. The radon transform and the mathematics of medical imaging. 2012.
  6. Is space-time attention all you need for video understanding? In Proceedings of the International Conference on Machine Learning (ICML), 2021.
  7. Efficient video classification using fewer frames. In Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR), 2019.
  8. High accuracy optical flow estimation based on a theory for warping. In Computer Vision - ECCV 2004, pages 25–36, Berlin, Heidelberg, 2004. Springer Berlin Heidelberg.
  9. Quo vadis, action recognition? a new model and the kinetics dataset. In 2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR), pages 4724–4733, 2017.
  10. Grad-cam++: Generalized gradient-based visual explanations for deep convolutional networks. In 2018 IEEE Winter Conference on Applications of Computer Vision (WACV), pages 839–847, 2018.
  11. Adaptive distribution learning with statistical hypothesis testing for covid-19 ct scan classification. In Proceedings of the IEEE/CVF International Conference on Computer Vision, pages 471–479, 2021.
  12. M. et al. Chetoui. Explainable covid-19 detection based on chest x-rays using an end-to-end regnet architecture. Viruses, 2023.
  13. Empirical evaluation of gated recurrent neural networks on sequence modeling. arXiv preprint arXiv:1412.3555, 2014.
  14. M. et al Cobo. Enhancing radiomics and deep learning systems through the standardization of medical imaging workflows. Scientific Data, 2023.
  15. Improved regularization of convolutional neural networks with cutout. arXiv preprint arXiv:1708.04552, 2017.
  16. An image is worth 16x16 words: Transformers for image recognition at scale. arXiv preprint arXiv:2010.11929, 2020.
  17. Thyroid computed tomography imaging: pictorial review of variable pathologies. Insights Imaging, 2016.
  18. Deep learning algorithms for detection of critical findings in head ct scans: a retrospective study. Lancet, 2018.
  19. Long short-term memory. Neural computation, 9(8):1735–1780, 1997.
  20. X3d: Expanding architectures for efficient video recognition. In CVPR, 2020.
  21. Slowfast networks for video recognition. In Proceedings of the IEEE international conference on computer vision, pages 6202–6211, 2019.
  22. Learning end-to-end video classification with rank-pooling. In Proceedings of The 33rd International Conference on Machine Learning, pages 1187–1196, New York, New York, USA, 2016. PMLR.
  23. Andrey Gaidel. Method of automatic roi selection on lung ct images. Procedia Engineering, 201:258–264, 2017. 3rd International Conference “Information Technology and Nanotechnology”, ITNT-2017, 25-27 April 2017, Samara, Russia.
  24. Wen-Jeng Lee Goman, Taiwan Radiological Society (TRS). Aocr2024 ai challenge, 2023.
  25. Radnet: Radiologist level accuracy using deep learning for hemorrhage detection in ct scans. 2018 IEEE 15th International Symposium on Biomedical Imaging (ISBI 2018), pages 281–284, 2017.
  26. Bajaj V. Gupta K. Deep learning models-based ct-scan image classification for automated screening of covid-19. Biomed Signal Process Control, 2023.
  27. Hongchao et al. He. Computed tomography-based radiomics prediction of ctla4 expression and prognosis in clear cell renal cell carcinoma. Cancer medicine, 2023.
  28. Deep residual learning for image recognition. In Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR), 2016.
  29. Spatiotemporal feature learning based on two-step lstm and transformer for ct scans. arXiv preprint arXiv:2207.01579, 2022.
  30. Bag of tricks of hybrid network for covid-19 detection of ct scans. In 2023 IEEE International Conference on Acoustics, Speech, and Signal Processing Workshops (ICASSPW), pages 1–4. IEEE, 2023.
  31. Simple 2d convolutional neural network-based approach for covid-19 detection, 2024.
  32. Laura J et al. Jensen. Enhancing the stability of ct radiomics across different volume of interest sizes using parametric feature maps: a phantom study. European radiology experimental, 2022.
  33. Adam: A method for stochastic optimization. arXiv preprint arXiv:1412.6980, 2014.
  34. Deep transparent prediction through latent representation analysis. arXiv preprint arXiv:2009.07044, 2020a.
  35. Transparent adaptation in deep medical image diagnosis. In TAILOR, page 251–267, 2020b.
  36. Mia-cov19d: Covid-19 detection through 3-d chest ct image analysis. In Proceedings of the IEEE/CVF International Conference on Computer Vision, page 537–544, 2021.
  37. Ai-mia: Covid-19 detection and severity analysis through medical imaging. In European Conference on Computer Vision, page 677–690. Springer, 2022.
  38. Ai-enabled analysis of 3-d ct scans for diagnosis of covid-19 & its severity. In 2023 IEEE International Conference on Acoustics, Speech, and Signal Processing Workshops (ICASSPW), page 1–5. IEEE, 2023a.
  39. A deep neural architecture for harmonizing 3-d input data analysis and decision making in medical imaging. Neurocomputing, 542:126244, 2023b.
  40. Domain adaptation, explainability, fairness in ai for medical image analysis: Diagnosis of covid-19 based on 3-d chest ct-scans. arXiv preprint arXiv:2403.02192, 2024.
  41. Possibility study of scale invariant feature transform (sift) algorithm application to spine magnetic resonance imaging. PloS one, 11:e0153043, 2016.
  42. Less is more: Clipbert for video-and-language learningvia sparse sampling. In CVPR, 2021.
  43. Collaborative spatiotemporal feature learning for video action recognition. In Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, pages 7872–7881, 2019.
  44. Tsm: Temporal shift module for efficient video understanding. In Proceedings of the IEEE International Conference on Computer Vision, 2019.
  45. Lin et al. Lu. Uncontrolled confounders may lead to false or overvalued radiomics signature: A proof of concept using survival analysis in a multicenter cohort of kidney cancer. 2021.
  46. Scott et al. Lundberg. A unified approach to interpreting model predictions. In Advances in Neural Information Processing Systems 30, pages 4765–4774. Curran Associates, Inc., 2017.
  47. F. et al. Mercaldo. Coronavirus covid-19 detection by means of explainable deep learning. Scientific Reports, 2023.
  48. K et al. Moulaei. Comparing machine learning algorithms for predicting covid-19 mortality. BMC Med Inform Decis Mak, 2022.
  49. Analysis of ct and mri image fusion using wavelet transform. pages 124–127, 2012.
  50. Action recognition with stacked fisher vectors. In Computer Vision – ECCV 2014, pages 581–595. Springer International Publishing, 2014.
  51. André et al. Ramon. Role of dual-energy ct in the diagnosis and follow-up of gout: systematic analysis of the literature. Clinical Rheumatology, 2018.
  52. D.W. Scott. Multivariate density estimation: Theory, practice and visualization. 1992.
  53. Steven W. Smith. Computed tomography, 1999.
  54. Efficientnet: Rethinking model scaling for convolutional neural networks. In Proceedings of the International conference on machine learning (ICML), pages 6105–6114, 2019.
  55. Hallucinating optical flow features for video classification. In IJCAI, 2019.
  56. Learning spatiotemporal features with 3d convolutional networks. In Proceedings of the IEEE International Conference on Computer Vision (ICCV), 2015.
  57. A closer look at spatiotemporal convolutions for action recognition. In 2018 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR), pages 6450–6459, 2018.
  58. Video classification with channel-separated convolutional networks. In Proceedings of the IEEE/CVF International Conference on Computer Vision (ICCV), 2019.
  59. Non-local neural networks. In Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition (CVPR), 2018.
  60. Ross Wightman. Pytorch image models. https://github.com/rwightman/pytorch-image-models, 2019.
  61. Wikipedia contributors. Mathematical morphology — Wikipedia, the free encyclopedia, 2022. [Online; accessed 2-July-2022].
  62. Rethinking spatiotemporal feature learning: Speed-accuracy trade-offs in video classification. In Proceedings of the European Conference on Computer Vision (ECCV), 2018.
  63. Q. et al. Xu. Ai-based analysis of ct images for rapid triage of covid-19 patients. npj digital medicine, 2020.
  64. Temporal pyramid network for action recognition. In Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR), 2020.
  65. Beyond short snippets: Deep networks for video classification. In Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition (CVPR), 2015.
  66. An efficient deep learning framework of covid-19 ct scans using contrastive learning and ensemble strategy. In 2021 IEEE International Conference on Progress in Informatics and Computing (PIC), pages 388–396. IEEE, 2021.
  67. Rate-invariant analysis of covariance trajectories. Journal of Mathematical Imaging and Vision, 60, 2018.
  68. Sun Y. Zhang Y, Zhang L. Rigid motion artifact reduction in ct using frequency domain analysis. J Xray Sci Technol, 2017.

Summary

We haven't generated a summary for this paper yet.