Papers
Topics
Authors
Recent
Gemini 2.5 Flash
Gemini 2.5 Flash
153 tokens/sec
GPT-4o
7 tokens/sec
Gemini 2.5 Pro Pro
45 tokens/sec
o3 Pro
4 tokens/sec
GPT-4.1 Pro
38 tokens/sec
DeepSeek R1 via Azure Pro
28 tokens/sec
2000 character limit reached

ECRF: Entropy-Constrained Neural Radiance Fields Compression with Frequency Domain Optimization (2311.14208v1)

Published 23 Nov 2023 in cs.CV and eess.IV

Abstract: Explicit feature-grid based NeRF models have shown promising results in terms of rendering quality and significant speed-up in training. However, these methods often require a significant amount of data to represent a single scene or object. In this work, we present a compression model that aims to minimize the entropy in the frequency domain in order to effectively reduce the data size. First, we propose using the discrete cosine transform (DCT) on the tensorial radiance fields to compress the feature-grid. This feature-grid is transformed into coefficients, which are then quantized and entropy encoded, following a similar approach to the traditional video coding pipeline. Furthermore, to achieve a higher level of sparsity, we propose using an entropy parameterization technique for the frequency domain, specifically for DCT coefficients of the feature-grid. Since the transformed coefficients are optimized during the training phase, the proposed model does not require any fine-tuning or additional information. Our model only requires a lightweight compression pipeline for encoding and decoding, making it easier to apply volumetric radiance field methods for real-world applications. Experimental results demonstrate that our proposed frequency domain entropy model can achieve superior compression performance across various datasets. The source code will be made publicly available.

Definition Search Book Streamline Icon: https://streamlinehq.com
References (44)
  1. End-to-end optimized image compression. In International Conference on Learning Representations (ICLR), 2017.
  2. Variational image compression with a scale hyperprior. In International Conference on Learning Representations (ICLR), 2018.
  3. Nonlinear transform coding. IEEE Journal of Selected Topics in Signal Processing, 15(2):339–353, 2020.
  4. Mip-nerf: A multiscale representation for anti-aliasing neural radiance fields. In Proceedings of the IEEE/CVF International Conference on Computer Vision (ICCV), pages 5855–5864, 2021.
  5. Mip-nerf 360: Unbounded anti-aliased neural radiance fields. In Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR), pages 5470–5479, 2022.
  6. 3d scene compression through entropy penalized neural representation functions. In 2021 Picture Coding Symposium (PCS), pages 1–5. IEEE, 2021.
  7. Overview of the versatile video coding (vvc) standard and its applications. IEEE Transactions on Circuits and Systems for Video technology (TCSVT), 31(10):3736–3764, 2021.
  8. Efficient geometry-aware 3d generative adversarial networks. In Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR), pages 16123–16133, 2022.
  9. Tensorf: Tensorial radiance fields. In European Conference on Computer Vision (ECCV), pages 333–350. Springer, 2022.
  10. Compressing explicit voxel grid representations: fast nerfs become also small. In Proceedings of the IEEE/CVF Winter Conference on Applications of Computer Vision (WACV), pages 1236–1245, 2023.
  11. K-planes: Explicit radiance fields in space, time, and appearance. In Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, pages 12479–12488, 2023.
  12. Lilnetx: Lightweight networks with EXtreme model compression and structured sparsification. In International Conference on Learning Representations (ICLR), 2023.
  13. Tanks and temples: Benchmarking large-scale scene reconstruction. ACM Transactions on Graphics (ToG), 36(4):1–13, 2017.
  14. Glen G. Langdon. An introduction to arithmetic coding. IBM Journal of Research and Development, 28(2):135–149, 1984.
  15. Compressing volumetric radiance fields to 1 mb. In Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR), pages 4222–4231, 2023.
  16. Neural sparse voxel fields. Advances in neural information processing systems (NIPS), 33:15651–15663, 2020.
  17. An overview of jpeg-2000. In Proceedings DCC 2000. Data Compression Conference, pages 523–541. IEEE, 2000.
  18. High-fidelity generative image compression. Advances in neural information processing systems (NIPS), 33, 2020.
  19. Nerf: Representing scenes as neural radiance fields for view synthesis. Communications of the ACM, 65(1):99–106, 2021.
  20. Joint autoregressive and hierarchical priors for learned image compression. Advances in Neural Information Processing Systems (NIPS), 31, 2018.
  21. Instant neural graphics primitives with a multiresolution hash encoding. ACM Transactions on Graphics (ToG), 41(4):1–15, 2022.
  22. Scalable model compression by entropy penalized reparameterization. In International Conference on Learning Representations (ICLR), 2020.
  23. Masked wavelet representation for compact neural radiance fields. In Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR), pages 20680–20690, 2023.
  24. Plenoxels: Radiance fields without neural networks. In Proceedings of the IEEE/CVF International Conference on Computer Vision (ICCV), 2022.
  25. Binary radiance fields. arXiv preprint arXiv:2306.07581, 2023.
  26. Implicit neural representations with periodic activation functions. Advances in neural information processing systems (NIPS), 33:7462–7473, 2020.
  27. Overview of the high efficiency video coding (hevc) standard. IEEE Transactions on circuits and systems for video technology (TCSVT), 22(12):1649–1668, 2012.
  28. Direct voxel grid optimization: Super-fast convergence for radiance fields reconstruction. In Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR), pages 5459–5469, 2022a.
  29. Improved direct voxel grid optimization for radiance fields reconstruction. arXiv preprint arXiv:2206.05085, 2022b.
  30. Neural geometric level of detail: Real-time rendering with implicit 3d shapes. In Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR), pages 11358–11367, 2021.
  31. Variable bitrate neural fields. In ACM SIGGRAPH 2022 Conference Proceedings, pages 1–9, 2022.
  32. Fourier features let networks learn high frequency functions in low dimensional domains. Advances in neural information processing systems (NIPS), 33:7537–7547, 2020.
  33. Compressible-composable nerf via rank-residual decomposition. Advances in neural information processing systems (NIPS), 35:14798–14809, 2022.
  34. Advances in neural rendering. In Computer Graphics Forum, pages 703–735. Wiley Online Library, 2022.
  35. Rtmv: A ray-traced multi-view synthetic dataset for novel view synthesis. IEEE/CVF European Conference on Computer Vision Workshop (Learn3DG ECCVW), 2022, 2022.
  36. Gregory K Wallace. The jpeg still picture compression standard. IEEE transactions on consumer electronics, 38(1):xviii–xxxiv, 1992.
  37. Neural trajectory fields for dynamic novel view synthesis. arXiv preprint arXiv:2105.05994, 2021.
  38. Fourier plenoctrees for dynamic radiance field rendering in real-time. In Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR), pages 13524–13534, 2022.
  39. Neural residual radiance fields for streamably free-viewpoint videos. In Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR), pages 76–87, 2023.
  40. Neural fourier filter bank. In Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR), pages 14153–14163, 2023.
  41. An introduction to neural data compression. Foundations and Trends in Computer Graphics and Vision, 15(2):113–200, 2023.
  42. Plenoctrees for real-time rendering of neural radiance fields. In Proceedings of the IEEE/CVF International Conference on Computer Vision (ICCV), pages 5752–5761, 2021.
  43. Nerf++: Analyzing and improving neural radiance fields. arXiv preprint arXiv:2010.07492, 2020.
  44. Tinynerf: Towards 100 x compression of voxel radiance fields. In Proceedings of the AAAI Conference on Artificial Intelligence, pages 3588–3596, 2023.
Citations (3)

Summary

We haven't generated a summary for this paper yet.