Papers
Topics
Authors
Recent
Gemini 2.5 Flash
Gemini 2.5 Flash
156 tokens/sec
GPT-4o
7 tokens/sec
Gemini 2.5 Pro Pro
45 tokens/sec
o3 Pro
4 tokens/sec
GPT-4.1 Pro
38 tokens/sec
DeepSeek R1 via Azure Pro
28 tokens/sec
2000 character limit reached

ContraNeRF: 3D-Aware Generative Model via Contrastive Learning with Unsupervised Implicit Pose Embedding (2304.14005v2)

Published 27 Apr 2023 in cs.CV

Abstract: Although 3D-aware GANs based on neural radiance fields have achieved competitive performance, their applicability is still limited to objects or scenes with the ground-truths or prediction models for clearly defined canonical camera poses. To extend the scope of applicable datasets, we propose a novel 3D-aware GAN optimization technique through contrastive learning with implicit pose embeddings. To this end, we first revise the discriminator design and remove dependency on ground-truth camera poses. Then, to capture complex and challenging 3D scene structures more effectively, we make the discriminator estimate a high-dimensional implicit pose embedding from a given image and perform contrastive learning on the pose embedding. The proposed approach can be employed for the dataset, where the canonical camera pose is ill-defined because it does not look up or estimate camera poses. Experimental results show that our algorithm outperforms existing methods by large margins on the datasets with multiple object categories and inconsistent canonical camera poses.

Definition Search Book Streamline Icon: https://streamlinehq.com
References (48)
  1. Gaudi: A neural architect for immersive 3d scene generation, 2022.
  2. Large scale GAN training for high fidelity natural image synthesis. CoRR, abs/1809.11096, 2018.
  3. Pix2nerf: Unsupervised conditional p-gan for single image to neural radiance fields translation. In CVPR, 2022.
  4. Efficient geometry-aware 3d generative adversarial networks. In CVPR, 2022.
  5. pi-gan: Periodic implicit generative adversarial networks for 3d-aware image synthesis. In CVPR, 2021.
  6. A simple framework for contrastive learning of visual representations. In ICML, 2020.
  7. Stargan v2: Diverse image synthesis for multiple domains. In CVPR, 2020.
  8. Gram: Generative radiance manifolds for 3d-aware image generation. In CVPR, 2022.
  9. Unconstrained scene generation with locally conditioned radiance fields. In ICCV, 2021.
  10. Generative temporal models with spatial memory for partially observed environments, 2018.
  11. Generative adversarial nets. In NeurIPS, 2014.
  12. Stylepeople: A generative model of fullbody human avatars. In CVPR, 2021.
  13. Stylenerf: A style-based 3d-aware generator for high-resolution image synthesis. In ICLR, 2022.
  14. Dual contrastive learning for unsupervised image-to-image translation. In ]CVPR, 2021.
  15. Momentum contrast for unsupervised visual representation learning. In CVPR, 2020.
  16. Gans trained by a two time-scale update rule converge to a local nash equilibrium. In NeurIPS, 2017.
  17. Eva3d: Compositional 3d human generation from 2d image collections, 2022.
  18. Contragan: Contrastive learning for conditional image generation. In NeurIPS, 2020.
  19. Training generative adversarial networks with limited data. In NeurIPS, 2020.
  20. A style-based generator architecture for generative adversarial networks. In CVPR, 2019.
  21. Analyzing and improving the image quality of stylegan. In CVPR, 2020.
  22. Adam: A method for stochastic optimization. In ICLR, 2015.
  23. Self-supervised dense consistency regularization for image-to-image translation. In CVPR, 2022.
  24. Depthgan: Gan-based depth generation of indoor scenes from semantic layouts. In ICCV, 2022.
  25. Smpl: A skinned multi-person linear model. In TOG, 2015.
  26. Which training methods for gans do actually converge? In ICML, 2018.
  27. Nerf: Representing scenes as neural radiance fields for view synthesis. In ECCV, 2020.
  28. Campari: Camera-aware decomposed generative neural radiance fields. In International Conference on 3D Vision (3DV), 2021.
  29. Giraffe: Representing scenes as compositional generative neural feature fields. In CVPR, 2021.
  30. Differentiable volumetric rendering: Learning implicit 3d representations without 3d supervision. In CVPR, 2020.
  31. Representation learning with contrastive predictive coding. arXiv preprint arXiv:1807.03748, 2018.
  32. Stylesdf: High-resolution 3d-consistent image and geometry generation. In CVPR, 2022.
  33. A shading-guided generative implicit model for shape-accurate 3d-aware image synthesis. In NeurIPS, 2021.
  34. Deepsdf: Learning continuous signed distance functions for shape representation. In CVPR, 2019.
  35. Contrastive learning for unpaired image-to-image translation. In ECCV, 2020.
  36. Pytorch: An imperative style, high-performance deep learning library. In NeurIPS, 2019.
  37. Graf: Generative radiance fields for 3d-aware image synthesis. In NeurIPS, 2020.
  38. Epigraf: Rethinking training of 3d gans. In NeurIPS, 2022.
  39. Fenerf: Face editing in neural radiance fields. In CVPR, 2022.
  40. The caltech-ucsd birds-200-2011 dataset. 2011.
  41. Unsupervised feature learning via non-parametric instance discrimination. In CVPR, 2018.
  42. Giraffe-hd: A high-resolution 3d-aware generative model. In CVPR, 2022.
  43. Learning to recover 3d scene shape from a single image. In CVPR, 2021.
  44. Lsun: Construction of a large-scale image dataset using deep learning with humans in the loop. arXiv preprint arXiv:1506.03365, 2015.
  45. Self-attention generative adversarial networks. In ICML, 2019.
  46. Cross-modal contrastive learning for text-to-image generation. In CVPR, 2021.
  47. Avatargen: A 3d generative model for animatable human avatars, 2022.
  48. Image augmentations for gan training. arXiv preprint arXiv:2006.02595, 2020.

Summary

We haven't generated a summary for this paper yet.