Papers
Topics
Authors
Recent
Gemini 2.5 Flash
Gemini 2.5 Flash
139 tokens/sec
GPT-4o
7 tokens/sec
Gemini 2.5 Pro Pro
46 tokens/sec
o3 Pro
4 tokens/sec
GPT-4.1 Pro
38 tokens/sec
DeepSeek R1 via Azure Pro
28 tokens/sec
2000 character limit reached

Distilled Low Rank Neural Radiance Field with Quantization for Light Field Compression (2208.00164v3)

Published 30 Jul 2022 in cs.CV and cs.MM

Abstract: We propose in this paper a Quantized Distilled Low-Rank Neural Radiance Field (QDLR-NeRF) representation for the task of light field compression. While existing compression methods encode the set of light field sub-aperture images, our proposed method learns an implicit scene representation in the form of a Neural Radiance Field (NeRF), which also enables view synthesis. To reduce its size, the model is first learned under a Low-Rank (LR) constraint using a Tensor Train (TT) decomposition within an Alternating Direction Method of Multipliers (ADMM) optimization framework. To further reduce the model's size, the components of the tensor train decomposition need to be quantized. However, simultaneously considering the optimization of the NeRF model with both the low-rank constraint and rate-constrained weight quantization is challenging. To address this difficulty, we introduce a network distillation operation that separates the low-rank approximation and the weight quantization during network training. The information from the initial LR-constrained NeRF (LR-NeRF) is distilled into a model of much smaller dimension (DLR-NeRF) based on the TT decomposition of the LR-NeRF. We then learn an optimized global codebook to quantize all TT components, producing the final QDLR-NeRF. Experimental results show that our proposed method yields better compression efficiency compared to state-of-the-art methods, and it additionally has the advantage of allowing the synthesis of any light field view with high quality.

Definition Search Book Streamline Icon: https://streamlinehq.com
References (44)
  1. HEVC-based light field image coding with bi-predicted self-similarity compensation, in: IEEE Int. Conf. on Multimedia Expo. Workshops (ICMEW), 2016, pp. 1–4.
  2. Computationally efficient light field image compression using a multiview HEVC framework, IEEE Access 7 (2019).
  3. Lenselet image compression scheme based on subaperture images streaming, in: IEEE Int. Conf. on Image Processing (ICIP), 2015, pp. 4733–4737.
  4. High efficient snake order pseudo-sequence based light field image compression, in: IEEE Data Compression Conf. (DCC), 2018, pp. 397–397.
  5. Light field compression using fourier disparity layers, in: IEEE Int. Conf. on Image Processing (ICIP), 2019, pp. 3751–3755.
  6. Light field compression with homography-based low-rank approximation, IEEE J. Sel. Topics Signal Process. (JSTSP) 11 (2017) 1132–1145.
  7. Steered mixture-of-experts for light field images and video: Representation and coding, IEEE Trans. on Multimedia (TMM) 22 (2020) 579–593.
  8. A 4D DCT-based lenslet light field codec, in: IEEE Int. Conf. on Image Processing (ICIP), 2018, pp. 435–439.
  9. Progressive compression and rendering of light fields, Conf. on Vision, Modeling & Visualization (VMV) (2000).
  10. Compression of plenoptic point cloud attributes using 6-d point clouds and 6-d transforms, IEEE Trans. on Multimedia (TMM) (2021).
  11. Content-based light field image compression method with gaussian process regression, IEEE Trans. on Multimedia (TMM) 22 (2019) 846–859.
  12. Shearlet transform based prediction scheme for light field compression, in: Data Compression Conf. (DCC), 2018, pp. 396–396.
  13. Light field compression using depth image based view synthesis, in: IEEE Int. Conf. on Multimedia Expo Workshops (ICMEW), 2017, pp. 19–24.
  14. Light field compression with graph learning and dictionary-guided sparse coding, IEEE Trans. on Multimedia (TMM) (2022).
  15. Low bitrate light field compression with geometry and content consistency, IEEE Trans. on Multimedia (TMM) 24 (2022) 152–165.
  16. NeRF: Representing scenes as neural radiance fields for view synthesis, in: Eur. Conf. on Computer Vision (ECCV), 2020, pp. 405–421.
  17. pixelneRF: Neural radiance fields from one or few images, IEEE. Int. Conf. on Computer Vision and Pattern Recognition (CVPR) (2021) 4576–4585.
  18. Neural sparse voxel fields, Advances in Neural Information Processing Systems (NIPS) (2020).
  19. MVSNeRF: Fast generalizable radiance field reconstruction from multi-view stereo, in: IEEE Int. Conf. on Computer Vision (ICCV), 2021.
  20. NeRF−⁣−--- -: Neural radiance fields without known camera parameters, arXiv preprint arXiv:2102.07064 (2021).
  21. X-Fields: Implicit neural view-, light- and time-image interpolation, ACM Trans. on Graphics (TOG) 39 (2020).
  22. Self-calibrating neural radiance fields, in: IEEE Int. Conf. on Computer Vision (ICCV), 2021, pp. 5846–5854.
  23. Deep compression: Compressing deep neural networks with pruning, trained quantization and huffman coding, Int. Conf. on Learning Representations (ICLR) (2016).
  24. Distilling the knowledge in a neural network, ArXiv abs/1503.02531 (2015).
  25. Overview of the high efficiency video coding (HEVC) standard, EEE Trans. Circuits Syst. Video Technol. (TCSVT) 22 (2012) 1649–1668.
  26. Learning for video compression with hierarchical quality and recurrent enhancement, in: IEEE. Int. Conf. on Computer Vision and Pattern Recognition (CVPR), 2020.
  27. Learning for video compression with recurrent auto-encoder and recurrent probability model, IEEE J. Sel. Topics Signal Process. (JSTSP) 15 (2021) 388–401.
  28. OpenDVC: An open source implementation of the DVC video compression method, CoRR abs/2006.15862 (2020).
  29. DVC: An end-to-end deep video compression framework, in: IEEE. Int. Conf. on Computer Vision and Pattern Recognition (CVPR), 2019, pp. 10998–11007.
  30. J. Shi, C. Guillemot, Light field compression via compact neural scene representation, in: IEEE Int. Conf. on Acoustics, Speech and Signal Processing (ICASSP), IEEE, 2023, pp. 1–5.
  31. M. Levoy, P. Hanrahan, Light field rendering, in: Proc. 23rd Annu. Conf. Comput. Graph. Interact. Techn.(CCGIT), 1996, pp. 31–42.
  32. J. Schonberger, J. Frahm, Structure-from-motion revisited, in: IEEE. Int. Conf. on Computer Vision and Pattern Recognition (CVPR), 2016, pp. 4104–4113.
  33. I. Oseledets, Tensor-train decomposition, SIAM Journal on Scientific Computing 33 (2011) 2295–2317.
  34. Towards efficient tensor decomposition-based dnn model compression with optimization framework, in: IEEE. Int. Conf. on Computer Vision and Pattern Recognition (CVPR), 2021, pp. 10674–10683.
  35. Training with quantization noise for extreme model compression, Int. Conf. on Learning Representations (ICLR) (2020).
  36. D. Huffman, A method for the construction of minimum-redundancy codes, Proceedings of the IRE 40 (1952) 1098–1101.
  37. A dataset and evaluation methodology for depth estimation on 4D light fields, in: Asian Conf. on Computer Vision (ACCV), 2016, pp. 19–34.
  38. M. Rerabek, T. Ebrahimi, New light field image dataset, in: Int. Conf. on Quality of Multimedia Experience (QoMEX), EPFL-CONF-218363, 2016.
  39. Impact of light field compression on focus stack and extended focus images, in: IEEE Eur. Signal Process. Conf. (EUSIPCO), 2016, pp. 898–902.
  40. A framework for learning depth from a flexible subset of dense and sparse light field views, IEEE Trans. Image Process. (TIP) 28 (2019) 5867–5880.
  41. Delving deep into rectifiers: Surpassing human-level performance on imagenet classification, in: IEEE Int. Conf. on Computer Vision (ICCV), 2015, pp. 1026–1034.
  42. Implicit neural representations with periodic activation functions, in: Advances in Neural Information Processing Systems (NIPS), 2020.
  43. Plenoctrees for real-time rendering of neural radiance fields, in: IEEE Int. Conf. on Computer Vision (ICCV), 2021, pp. 5752–5761.
  44. B. Y. Feng, A. Varshney, SIGNET: Efficient neural representation for light fields, in: IEEE Int. Conf. on Computer Vision (ICCV), 2021, pp. 14224–14233.
Citations (5)

Summary

We haven't generated a summary for this paper yet.