Papers
Topics
Authors
Recent
Gemini 2.5 Flash
Gemini 2.5 Flash
169 tokens/sec
GPT-4o
7 tokens/sec
Gemini 2.5 Pro Pro
45 tokens/sec
o3 Pro
4 tokens/sec
GPT-4.1 Pro
38 tokens/sec
DeepSeek R1 via Azure Pro
28 tokens/sec
2000 character limit reached

NeRFCodec: Neural Feature Compression Meets Neural Radiance Fields for Memory-Efficient Scene Representation (2404.02185v1)

Published 2 Apr 2024 in cs.CV, cs.GR, and eess.IV

Abstract: The emergence of Neural Radiance Fields (NeRF) has greatly impacted 3D scene modeling and novel-view synthesis. As a kind of visual media for 3D scene representation, compression with high rate-distortion performance is an eternal target. Motivated by advances in neural compression and neural field representation, we propose NeRFCodec, an end-to-end NeRF compression framework that integrates non-linear transform, quantization, and entropy coding for memory-efficient scene representation. Since training a non-linear transform directly on a large scale of NeRF feature planes is impractical, we discover that pre-trained neural 2D image codec can be utilized for compressing the features when adding content-specific parameters. Specifically, we reuse neural 2D image codec but modify its encoder and decoder heads, while keeping the other parts of the pre-trained decoder frozen. This allows us to train the full pipeline via supervision of rendering loss and entropy loss, yielding the rate-distortion balance by updating the content-specific parameters. At test time, the bitstreams containing latent code, feature decoder head, and other side information are transmitted for communication. Experimental results demonstrate our method outperforms existing NeRF compression methods, enabling high-quality novel view synthesis with a memory budget of 0.5 MB.

Definition Search Book Streamline Icon: https://streamlinehq.com
References (45)
  1. The jpeg ai standard: Providing efficient human and machine visual data consumption. IEEE Multimedia, 30(1):100–111, 2023.
  2. Nonlinear transform coding. IEEE Journal of Selected Topics in Signal Processing, 15(2):339–353, 2020.
  3. Variational image compression with a scale hyperprior. In Proc. of the International Conf. on Learning Representations (ICLR), 2018.
  4. Compressai: a pytorch library and evaluation platform for end-to-end compression research. arXiv preprint arXiv:2011.03029, 2020.
  5. 3d scene compression through entropy penalized neural representation functions. In 2021 Picture Coding Symposium (PCS), pages 1–5. IEEE, 2021.
  6. Overview of the versatile video coding (vvc) standard and its applications. IEEE Transactions on Circuits and Systems for Video Technology, 31(10):3736–3764, 2021.
  7. Hexplane: A fast representation for dynamic scenes. In Proc. IEEE Conf. on Computer Vision and Pattern Recognition (CVPR), pages 130–141, 2023.
  8. Efficient geometry-aware 3d generative adversarial networks. In Proc. IEEE Conf. on Computer Vision and Pattern Recognition (CVPR), 2022.
  9. Tensorf: Tensorial radiance fields. In Proc. of the European Conf. on Computer Vision (ECCV), 2022.
  10. Learned image compression with discretized gaussian mixture likelihoods and attention modules. In Proc. IEEE Conf. on Computer Vision and Pattern Recognition (CVPR), pages 7939–7948, 2020.
  11. Binaryconnect: Training deep neural networks with binary weights during propagations. Advances in Neural Information Processing Systems (NeurIPS), 28, 2015.
  12. Elements of Information Theory. John Wiley & Sons, 2012.
  13. Compression of 3d point clouds using a region-adaptive hierarchical transform. IEEE Transactions on Image Processing, 25(8):3947–3956, 2016.
  14. Objaverse: A universe of annotated 3d objects. In Proc. IEEE Conf. on Computer Vision and Pattern Recognition (CVPR), pages 13142–13153, 2023.
  15. Compressing explicit voxel grid representations: fast nerfs become also small. In Proc. of the IEEE Winter Conference on Applications of Computer Vision (WACV), pages 1236–1245, 2023.
  16. Acrf: Compressing explicit neural radiance fields via attribute compression. In Proc. of the International Conf. on Learning Representations (ICLR), 2024.
  17. K-planes: Explicit radiance fields in space, time, and appearance. In Proc. IEEE Conf. on Computer Vision and Pattern Recognition (CVPR), pages 12479–12488, 2023.
  18. Vivek K Goyal. Theoretical foundations of transform coding. IEEE Signal Processing Magazine, 18(5):9–21, 2001.
  19. Elic: Efficient learned image compression with unevenly grouped space-channel contextual adaptive coding. In Proc. IEEE Conf. on Computer Vision and Pattern Recognition (CVPR), pages 5718–5727, 2022.
  20. Learning end-to-end lossy image compression: A benchmark. IEEE Trans. on Pattern Analysis and Machine Intelligence (PAMI), 44(8):4194–4211, 2021.
  21. Relu fields: The little non-linearity that could. In ACM Trans. on Graphics, pages 1–9, 2022.
  22. Tanks and temples: Benchmarking large-scale scene reconstruction. ACM Transactions on Graphics (ToG), 36(4):1–13, 2017.
  23. Image compression using the 2-d wavelet transform. IEEE Transactions on image Processing, 1(2):244–250, 1992.
  24. Compressing volumetric radiance fields to 1 mb. In Proc. IEEE Conf. on Computer Vision and Pattern Recognition (CVPR), pages 4222–4231, 2023.
  25. Learned image compression with mixed transformer-cnn architectures. In Proc. IEEE Conf. on Computer Vision and Pattern Recognition (CVPR), pages 14388–14397, 2023.
  26. Neural sparse voxel fields. In Hugo Larochelle, Marc’Aurelio Ranzato, Raia Hadsell, Maria-Florina Balcan, and Hsuan-Tien Lin, editors, Advances in Neural Information Processing Systems (NeurIPS), 2020.
  27. High-fidelity generative image compression. Advances in Neural Information Processing Systems (NeurIPS), 33:11913–11924, 2020.
  28. Nerf: Representing scenes as neural radiance fields for view synthesis. In Proc. of the European Conf. on Computer Vision (ECCV), 2020.
  29. Joint autoregressive and hierarchical priors for learned image compression. Advances in Neural Information Processing Systems (NeurIPS), 31, 2018.
  30. Instant neural graphics primitives with a multiresolution hash encoding. ACM Trans. on Graphics, 2022.
  31. Scalable model compression by entropy penalized reparameterization. In Proc. of the International Conf. on Learning Representations (ICLR), 2019.
  32. Masked wavelet representation for compact neural radiance fields. In Proc. IEEE Conf. on Computer Vision and Pattern Recognition (CVPR), pages 20680–20690, 2023.
  33. Binary radiance fields. Advances in Neural Information Processing Systems (NeurIPS), 2023.
  34. The jpeg 2000 still image compression standard. IEEE Signal Processing Magazine, 18(5):36–58, 2001.
  35. Overview of the high efficiency video coding (hevc) standard. IEEE Transactions on circuits and systems for video technology, 22(12):1649–1668, 2012.
  36. Direct voxel grid optimization: Super-fast convergence for radiance fields reconstruction. In Proc. IEEE Conf. on Computer Vision and Pattern Recognition (CVPR), 2022.
  37. Variable bitrate neural fields. In ACM Trans. on Graphics, pages 1–9, 2022.
  38. Neural geometric level of detail: Real-time rendering with implicit 3d shapes. In Proc. IEEE Conf. on Computer Vision and Pattern Recognition (CVPR), pages 11358–11367, 2021.
  39. Compressible-composable nerf via rank-residual decomposition. Advances in Neural Information Processing Systems (NeurIPS), 35:14798–14809, 2022.
  40. Andrew B Watson et al. Image compression using the discrete cosine transform. Mathematica journal, 4(1):81, 1994.
  41. Omniobject3d: Large-vocabulary 3d object dataset for realistic perception, reconstruction and generation. In Proc. IEEE Conf. on Computer Vision and Pattern Recognition (CVPR), pages 803–814, 2023.
  42. Point-nerf: Point-based neural radiance fields. In Proc. IEEE Conf. on Computer Vision and Pattern Recognition (CVPR), pages 5438–5448, 2022.
  43. Plenoxels: Radiance fields without neural networks. Proc. IEEE Conf. on Computer Vision and Pattern Recognition (CVPR), 2022.
  44. Mvimgnet: A large-scale dataset of multi-view images. In Proc. IEEE Conf. on Computer Vision and Pattern Recognition (CVPR), pages 9150–9161, 2023.
  45. Differentiable point-based radiance fields for efficient view synthesis. In SIGGRAPH Asia 2022 Conference Papers, pages 1–12, 2022.
Citations (6)

Summary

We haven't generated a summary for this paper yet.