Papers
Topics
Authors
Recent
Gemini 2.5 Flash
Gemini 2.5 Flash
139 tokens/sec
GPT-4o
7 tokens/sec
Gemini 2.5 Pro Pro
46 tokens/sec
o3 Pro
4 tokens/sec
GPT-4.1 Pro
38 tokens/sec
DeepSeek R1 via Azure Pro
28 tokens/sec
2000 character limit reached

LeanGaussian: Breaking Pixel or Point Cloud Correspondence in Modeling 3D Gaussians (2404.16323v4)

Published 25 Apr 2024 in cs.CV

Abstract: Recently, Gaussian splatting has demonstrated significant success in novel view synthesis. Current methods often regress Gaussians with pixel or point cloud correspondence, linking each Gaussian with a pixel or a 3D point. This leads to the redundancy of Gaussians being used to overfit the correspondence rather than the objects represented by the 3D Gaussians themselves, consequently wasting resources and lacking accurate geometries or textures. In this paper, we introduce LeanGaussian, a novel approach that treats each query in deformable Transformer as one 3D Gaussian ellipsoid, breaking the pixel or point cloud correspondence constraints. We leverage deformable decoder to iteratively refine the Gaussians layer-by-layer with the image features as keys and values. Notably, the center of each 3D Gaussian is defined as 3D reference points, which are then projected onto the image for deformable attention in 2D space. On both the ShapeNet SRN dataset (category level) and the Google Scanned Objects dataset (open-category level, trained with the Objaverse dataset), our approach, outperforms prior methods by approximately 6.1%, achieving a PSNR of 25.44 and 22.36, respectively. Additionally, our method achieves a 3D reconstruction speed of 7.2 FPS and rendering speed 500 FPS. Codes are available at https://github.com/jwubz123/LeanGaussian.

Definition Search Book Streamline Icon: https://streamlinehq.com
References (47)
  1. Neural RGB-D Surface Reconstruction. In IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR), 2022.
  2. Mip-NeRF: A Multiscale Representation for Anti-Aliasing Neural Radiance Fields. In IEEE/CVF International Conference on Computer Vision (ICCV), 2021.
  3. End-to-end object detection with transformers. In European conference on computer vision (ECCV), 2020.
  4. A Survey on 3D Gaussian Splatting. ArXiv, 2024.
  5. GaussianEditor: Swift and Controllable 3D Editing with Gaussian Splatting. In IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR), 2024.
  6. 3D-R2N2: A Unified Approach for Single and Multi-view 3D Object Reconstruction. In European conference on computer vision (ECCV), 2016.
  7. A Point Set Generation Network for 3D Object Reconstruction from a Single Image. In IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR), 2016.
  8. Plenoxels: Radiance fields without neural networks. In IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR), 2022.
  9. FastNeRF: High-Fidelity Neural Rendering at 200FPS. In IEEE/CVF International Conference on Computer Vision (ICCV), 2021.
  10. NerfDiff: Single-image View Synthesis with NeRF-guided Distillation from 3D-aware Diffusion. In International Conference on Machine Learning (ICML), 2023.
  11. Baking Neural Radiance Fields for Real-Time View Synthesis. In IEEE/CVF International Conference on Computer Vision (ICCV), 2021.
  12. CodeNeRF: Disentangled Neural Radiance Fields for Object Categories. In IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR), 2021.
  13. 3D Gaussian Splatting for Real-Time Radiance Field Rendering. In ACM Transactions on Graphics (TOG), 2023.
  14. DN-DETR: Accelerate DETR Training by Introducing Query DeNoising. In IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR), 2022.
  15. Lite DETR : An Interleaved Multi-Scale Encoder for Efficient DETR. In IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR), 2023a.
  16. Mask DINO: Towards A Unified Transformer-based Framework for Object Detection and Segmentation. In IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR), 2023b.
  17. TAPTR: Tracking Any Point with Transformers as Detection. ArXiv, 2024.
  18. NerfAcc: Efficient Sampling Accelerates NeRFs. In IEEE/CVF International Conference on Computer Vision (ICCV), 2023c.
  19. SteerNeRF: Accelerating NeRF Rendering via Smooth Viewpoint Trajectory. In IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR), 2023d.
  20. Vision Transformer for NeRF-Based View Synthesis from a Single Input Image. In 2023 IEEE/CVF Winter Conference on Applications of Computer Vision (WACV), 2022.
  21. Feature Pyramid Networks for Object Detection. In IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR), 2017.
  22. A Comprehensive Benchmark for Neural Human Radiance Fields. In Conference on Neural Information Processing Systems (NIPS), 2024a.
  23. One-2-3-45: Any single image to 3d mesh in 45 seconds without per-shape optimization. In Conference on Neural Information Processing Systems (NIPS), 2024b.
  24. Zero-1-to-3: Zero-shot One Image to 3D Object. In IEEE/CVF International Conference on Computer Vision (ICCV), 2023a.
  25. DAB-DETR: Dynamic anchor boxes are better queries for DETR. In International Conference on Learning Representations (ICLR), 2022.
  26. Grounding DINO: Marrying DINO with Grounded Pre-Training for Open-Set Object Detection. ArXiv, 2023b.
  27. Stable-DINO: Detection Transformer with Stable Matching. In IEEE/CVF International Conference on Computer Vision (ICCV), 2023c.
  28. 3DGSR: Implicit Surface Reconstruction with 3D Gaussian Splatting. ArXiv, 2024.
  29. NeRF: Representing Scenes as Neural Radiance Fields for View Synthesis. In The European Conference on Computer Vision (ECCV), 2020.
  30. Instant Neural Graphics Primitives with a Multiresolution Hash Encoding. In ACM Transactions on Graphics (SIGGRAPH), 2022.
  31. DINOv2: Learning Robust Visual Features without Supervision. In Transactions on Machine Learning Research (TMLR), 2024.
  32. NeRFMeshing: Distilling Neural Radiance Fields into Geometrically-Accurate 3D Meshes. In International Conference on 3D Vision (3DV), 2023.
  33. detrex: Benchmarking Detection Transformers. ArXiv, 2023.
  34. U-net: Convolutional networks for biomedical image segmentation. In Medical image computing and computer-assisted intervention (MICCAI), 2015.
  35. Scene Representation Networks: Continuous 3D-Structure-Aware Neural Scene Representations. In Conference on Neural Information Processing Systems (NIPS), 2019.
  36. Denoising Diffusion Implicit Models. In International Conference on Learning Representations (ICLR), 2021.
  37. Splatter Image: Ultra-Fast Single-View 3D Reconstruction. In IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR), 2024.
  38. DreamGaussian: Generative Gaussian Splatting for Efficient 3D Content Creation. In International Conference on Learning Representations (ICLR), 2024.
  39. Ref-NeRF: Structured View-Dependent Appearance for Neural Radiance Fields. In IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR), 2022.
  40. Surface Reconstruction from Gaussian Splatting via Novel Stereo Views. ArXiv, 2024.
  41. Volume rendering of neural implicit surfaces. In Conference on Neural Information Processing Systems (NIPS), 2021.
  42. pixelNeRF: Neural Radiance Fields from One or Few Images. In IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR), 2021.
  43. DINO: DETR with Improved DeNoising Anchor Boxes for End-to-End Object Detection. In The International Conference on Learning Representations (ICLR), 2023.
  44. NeRF++: Analyzing and Improving Neural Radiance Fields. ArXiv, 2020.
  45. The Unreasonable Effectiveness of Deep Features as a Perceptual Metric. In IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR), 2018.
  46. Deformable DETR: Deformable Transformers for End-to-End Object Detection. In The International Conference on Learning Representations (ICLR), 2021.
  47. EWA volume splatting. In IEEE Visualization (IEEE VIS), 2001.
Citations (1)

Summary

We haven't generated a summary for this paper yet.

X Twitter Logo Streamline Icon: https://streamlinehq.com
Reddit Logo Streamline Icon: https://streamlinehq.com