Streaming Lossless Volumetric Compression of Medical Images Using Gated Recurrent Convolutional Neural Network (2311.16200v1)
Abstract: Deep learning-based lossless compression methods offer substantial advantages in compressing medical volumetric images. Nevertheless, many learning-based algorithms encounter a trade-off between practicality and compression performance. This paper introduces a hardware-friendly streaming lossless volumetric compression framework, utilizing merely one-thousandth of the model weights compared to other learning-based compression frameworks. We propose a gated recurrent convolutional neural network that combines diverse convolutional structures and fusion gate mechanisms to capture the inter-slice dependencies in volumetric images. Based on such contextual information, we can predict the pixel-by-pixel distribution for entropy coding. Guided by hardware/software co-design principles, we implement the proposed framework on Field Programmable Gate Array to achieve enhanced real-time performance. Extensive experimental results indicate that our method outperforms traditional lossless volumetric compressors and state-of-the-art learning-based lossless compression methods across various medical image benchmarks. Additionally, our method exhibits robust generalization ability and competitive compression speed
- Imaging of covid-19: Ct, mri, and pet. In Seminars in Nuclear Medicine, volume 51, pages 312–320. Elsevier, 2021.
- Automated segmentation of tissues using ct and mri: a systematic review. Academic radiology, 26(12):1695–1706, 2019.
- Whole liver segmentation based on deep learning and manual adjustment for clinical use in sirt. European journal of nuclear medicine and molecular imaging, 47:2742–2752, 2020.
- The current role of image compression standards in medical imaging. Information, 8(4):131, 2017.
- Overview of the dicom standard. In 2008 50th International Symposium ELMAR, volume 1, pages 39–44. IEEE, 2008.
- Greg Roelofs. PNG: the definitive guide. O’Reilly & Associates, Inc., 1999.
- The loco-i lossless image compression algorithm: Principles and standardization into jpeg-ls. IEEE Transactions on Image processing, 9(8):1309–1324, 2000.
- Jpeg2000: Standard for interactive imaging. Proceedings of the IEEE, 90(8):1336–1357, 2002.
- Jp3d–extensions for three-dimensional data (part 10). The JPEG 2000 Suite, pages 199–227, 2009.
- Overview of the high efficiency video coding (hevc) standard. IEEE Transactions on circuits and systems for video technology, 22(12):1649–1668, 2012.
- Michael Niedermayer. Ffv1 video codec specification. URL https://www. ffmpeg. org/michael/ffv1. html, 2013.
- Low dose abdominal ct image reconstruction: An unsupervised learning based approach. In 2019 IEEE international conference on image processing (ICIP), pages 1351–1355. IEEE, 2019.
- Pixelcnn++: Improving the pixelcnn with discretized logistic mixture likelihood and other modifications. arXiv preprint arXiv:1701.05517, 2017.
- Practical full resolution learned lossless image compression. In Proceedings of the IEEE/CVF conference on computer vision and pattern recognition, pages 10629–10638, 2019.
- Pilc: Practical image lossless compression with an end-to-end gpu oriented neural framework. In Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, pages 3739–3748, 2022.
- Integer discrete flows and lossless compression. Advances in Neural Information Processing Systems, 32, 2019.
- Lossless coding of point cloud geometry using a deep generative model. IEEE Transactions on Circuits and Systems for Video Technology, 31(12):4617–4629, 2021.
- Exploiting intra-slice and inter-slice redundancy for learning-based lossless volumetric image compression. IEEE Transactions on Image Processing, 31:1697–1707, 2022.
- aiwave: Volumetric image compression with 3-d trained affine wavelet-like transform. IEEE Transactions on Medical Imaging, 42(3):606–618, 2022.
- Volumetric end-to-end optimized compression for brain images. In 2020 IEEE International Conference on Visual Communications and Image Processing (VCIP), pages 503–506. IEEE, 2020.
- Lossless compression of medical images using 3-d predictors. IEEE transactions on medical imaging, 36(11):2250–2260, 2017.
- Lossy image compression with normalizing flows. arXiv preprint arXiv:2008.10486, 2020.
- Deep double descent: Where bigger models and more data hurt. Journal of Statistical Mechanics: Theory and Experiment, 2021(12):124003, 2021.
- Predicting on the edge: Identifying where a larger model does better. arXiv preprint arXiv:2202.07652, 2022.
- Conditional image generation with pixelcnn decoders. Advances in neural information processing systems, 29, 2016.
- Realizing transparent os/apps compression in mobile devices at zero latency overhead. IEEE Transactions on Computers, 66(7):1188–1199, 2017.
- Neural discrete representation learning. Advances in neural information processing systems, 30, 2017.
- The expressive power of neural networks: A view from the width. Advances in neural information processing systems, 30, 2017.
- Claude Elwood Shannon. A mathematical theory of communication. The Bell system technical journal, 27(3):379–423, 1948.
- Arithmetic coding for data compression. Communications of the ACM, 30(6):520–540, 1987.
- François Chollet. Xception: Deep learning with depthwise separable convolutions. In Proceedings of the IEEE conference on computer vision and pattern recognition, pages 1251–1258, 2017.
- Empirical evaluation of gated recurrent neural networks on sequence modeling. arXiv preprint arXiv:1412.3555, 2014.
- Binarized neural networks. Advances in neural information processing systems, 29, 2016.
- Paul J Werbos. Backpropagation through time: what it does and how to do it. Proceedings of the IEEE, 78(10):1550–1560, 1990.
- Long short-term memory. Supervised sequence labelling with recurrent neural networks, pages 37–45, 2012.
- Deep-learning-assisted diagnosis for knee magnetic resonance imaging: development and retrospective validation of mrnet. PLoS medicine, 15(11):e1002699, 2018.
- Chaos challenge-combined (ct-mr) healthy abdominal organ segmentation. Medical Image Analysis, 69:101950, 2021.
- Deeplesion: automated mining of large-scale lesion annotations and universal lesion detection with deep learning. Journal of medical imaging, 5(3):036501–036501, 2018.
- Qianhao Chen (2 papers)
- Jietao Chen (2 papers)