Cicero: Addressing Algorithmic and Architectural Bottlenecks in Neural Rendering by Radiance Warping and Memory Optimizations (2404.11852v1)
Abstract: Neural Radiance Field (NeRF) is widely seen as an alternative to traditional physically-based rendering. However, NeRF has not yet seen its adoption in resource-limited mobile systems such as Virtual and Augmented Reality (VR/AR), because it is simply extremely slow. On a mobile Volta GPU, even the state-of-the-art NeRF models generally execute only at 0.8 FPS. We show that the main performance bottlenecks are both algorithmic and architectural. We introduce, CICERO, to tame both forms of inefficiencies. We first introduce two algorithms, one fundamentally reduces the amount of work any NeRF model has to execute, and the other eliminates irregular DRAM accesses. We then describe an on-chip data layout strategy that eliminates SRAM bank conflicts. A pure software implementation of CICERO offers an 8.0x speed-up and 7.9x energy saving over a mobile Volta GPU. When compared to a baseline with a dedicated DNN accelerator, our speed-up and energy reduction increase to 28.2x and 37.8x, respectively - all with minimal quality loss (less than 1.0 dB peak signal-to-noise ratio reduction).
- ``Agisoft metashape,'' https://www.agisoft.com/.
- ``Apple A15 Die Shot and Annotation - IP Block Area Analysis.'' [Online]. Available: https://www.semianalysis.com/p/apple-a15-die-shot-and-annotation
- ``Micron 178-Ball, Single-Channel Mobile LPDDR3 SDRAM Features.'' [Online]. Available: https://www.micron.com/-/media/client/global/documents/products/data-sheet/dram/mobile-dram/low-power-dram/lpddr3/178b_8-16gb_2c0f_mobile_lpddr3.pdf
- ``Micron System Power Calculators.'' [Online]. Available: https://www.micron.com/support/tools-and-utilities/power-calc
- ``Nvidia reveals xavier soc details.'' [Online]. Available: https://www.forbes.com/sites/moorinsights/2018/08/24/nvidia-reveals-xavier-soc-details/amp/
- ``NVIDIA’s Xavier System-on-Chip, HotChips 30.'' [Online]. Available: https://fuse.wikichip.org/news/1618/hot-chips-30-nvidia-xavier-soc/
- PCL, ``Spatial partitioning and search operations with octrees,'' https://pcl.readthedocs.io/projects/tutorials/en/latest/octree.html.