Papers
Topics
Authors
Recent
Gemini 2.5 Flash
Gemini 2.5 Flash
125 tokens/sec
GPT-4o
53 tokens/sec
Gemini 2.5 Pro Pro
42 tokens/sec
o3 Pro
4 tokens/sec
GPT-4.1 Pro
47 tokens/sec
DeepSeek R1 via Azure Pro
28 tokens/sec
2000 character limit reached

High-Fidelity 3D Head Avatars Reconstruction through Spatially-Varying Expression Conditioned Neural Radiance Field (2310.06275v1)

Published 10 Oct 2023 in cs.CV

Abstract: One crucial aspect of 3D head avatar reconstruction lies in the details of facial expressions. Although recent NeRF-based photo-realistic 3D head avatar methods achieve high-quality avatar rendering, they still encounter challenges retaining intricate facial expression details because they overlook the potential of specific expression variations at different spatial positions when conditioning the radiance field. Motivated by this observation, we introduce a novel Spatially-Varying Expression (SVE) conditioning. The SVE can be obtained by a simple MLP-based generation network, encompassing both spatial positional features and global expression information. Benefiting from rich and diverse information of the SVE at different positions, the proposed SVE-conditioned neural radiance field can deal with intricate facial expressions and achieve realistic rendering and geometry details of high-fidelity 3D head avatars. Additionally, to further elevate the geometric and rendering quality, we introduce a new coarse-to-fine training strategy, including a geometry initialization strategy at the coarse stage and an adaptive importance sampling strategy at the fine stage. Extensive experiments indicate that our method outperforms other state-of-the-art (SOTA) methods in rendering and geometry quality on mobile phone-collected and public datasets.

Definition Search Book Streamline Icon: https://streamlinehq.com
References (48)
  1. Flame-in-nerf: Neural control of radiance fields for free view face animation. In IEEE 17th International Conference on Automatic Face and Gesture Recognition (FG), 1–8.
  2. Rignerf: Fully controllable neural 3d portraits. In Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 20364–20373.
  3. A morphable model for the synthesis of 3D faces. In Proceedings of the 26th annual conference on Computer graphics and interactive techniques, 187–194.
  4. HexPlane: A Fast Representation for Dynamic Scenes. CVPR.
  5. Real-Time High-Fidelity Facial Performance Capture. ACM Trans. Graph., 34(4).
  6. Authentic Volumetric Avatars from a Phone Scan. ACM Trans. Graph., 41(4).
  7. Real-Time Facial Animation with Image-Based Dynamic Avatars. ACM Trans. Graph., 35(4).
  8. Efficient Geometry-aware 3D Generative Adversarial Networks. In Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR), 16102–16112.
  9. PP-Matting: High-Accuracy Natural Image Matting. arXiv preprint arXiv:2204.09433.
  10. Expressive Telepresence via Modular Codec Avatars. In Proceedings of the Proceedings of the European Conference on Computer Vision (ECCV), 330–345.
  11. EMOCA: Emotion Driven Monocular Face Capture and Animation. In Conference on Computer Vision and Pattern Recognition (CVPR), 20311–20322.
  12. MegaPortraits: One-shot Megapixel Neural Head Avatars.
  13. Fast Dynamic Radiance Fields with Time-Aware Neural Voxels. In SIGGRAPH Asia 2022 Conference Papers.
  14. Learning an Animatable Detailed 3D Face Model from In-the-Wild Images. ACM Transactions on Graphics (ToG), Proc. SIGGRAPH, 40(4): 88:1–88:13.
  15. Dynamic neural radiance fields for monocular 4d facial avatar reconstruction. In Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 8649–8658.
  16. Reconstructing Personalized Semantic Facial NeRF Models From Monocular Video. ACM Transactions on Graphics (Proceedings of SIGGRAPH Asia), 41(6).
  17. Morphable face models-an open framework. In 2018 13th IEEE International Conference on Automatic Face & Gesture Recognition (FG 2018), 75–82. IEEE.
  18. Neural head avatars from monocular RGB videos. In Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 18653–18664.
  19. AD-NeRF: Audio Driven Neural Radiance Fields for Talking Head Synthesis. In Proceedings of the IEEE/CVF International Conference on Computer Vision (ICCV), 5764–5774.
  20. HeadNeRF: A Real-Time NeRF-Based Parametric Head Model. In Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR), 20374–20384.
  21. Avatar Digitization from a Single Image for Real-Time Rendering. ACM Trans. Graph., 36(6).
  22. Dynamic 3D Avatar Creation from Hand-Held Video Input. ACM Trans. Graph., 34(4).
  23. NeRSemble: Multi-view Radiance Field Reconstruction of Human Heads. arXiv:2305.03027.
  24. Learning a model of facial shape and expression from 4D scans. ACM Transactions on Graphics, (Proc. SIGGRAPH Asia), 36(6): 194:1–194:17.
  25. Neural 3d video synthesis from multi-view video. In Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 5521–5531.
  26. Semantic-Aware Implicit Neural Audio-Driven Video Portrait Generation. In Proceedings of the European Conference on Computer Vision (ECCV).
  27. Deep Appearance Models for Face Rendering. ACM Trans. Graph., 37(4): 68:1–68:13.
  28. Neural Volumes: Learning Dynamic Renderable Volumes from Images. ACM Trans. Graph., 38(4): 65:1–65:14.
  29. Mixture of Volumetric Primitives for Efficient Neural Rendering. ACM Trans. Graph., 40(4).
  30. Pixel Codec Avatars. In 2021 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR), 64–73.
  31. NeRF: Representing Scenes as Neural Radiance Fields for View Synthesis. In ECCV.
  32. PaGAN: Real-Time Avatars Using Dynamic Textures. ACM Trans. Graph., 37(6).
  33. Nerfies: Deformable neural radiance fields. In Proceedings of the IEEE/CVF International Conference on Computer Vision, 5865–5874.
  34. Hypernerf: A higher-dimensional representation for topologically varying neural radiance fields. arXiv preprint arXiv:2106.13228.
  35. D-nerf: Neural radiance fields for dynamic scenes. In Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 10318–10327.
  36. K-Planes: Explicit Radiance Fields in Space, Time, and Appearance. In CVPR.
  37. IDE-3D: Interactive Disentangled Editing for High-Resolution 3D-aware Portrait Synthesis. ACM Transactions on Graphics (TOG), 41(6): 1–10.
  38. Next3D: Generative Neural Texture Rasterization for 3D-Aware Head Avatars. In Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR).
  39. MoRF: Morphable Radiance Fields for Multiview Neural Head Modeling. In ACM SIGGRAPH 2022 Conference Proceedings, SIGGRAPH ’22. New York, NY, USA: Association for Computing Machinery. ISBN 9781450393379.
  40. NeuS: Learning Neural Implicit Surfaces by Volume Rendering for Multi-view Reconstruction. NeurIPS.
  41. AvatarMAV: Fast 3D Head Avatar Reconstruction Using Motion-Aware Neural Voxels. In ACM SIGGRAPH 2023 Conference Proceedings.
  42. The unreasonable effectiveness of deep features as a perceptual metric. In Proceedings of the IEEE conference on computer vision and pattern recognition, 586–595.
  43. Im avatar: Implicit morphable head avatars from videos. In Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 13545–13555.
  44. PointAvatar: Deformable Point-based Head Avatars from Videos. In Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR).
  45. MoFaNeRF: Morphable Facial Neural Radiance Field. In Proceedings of the European Conference on Computer Vision (ECCV).
  46. Towards Metrical Reconstruction of Human Faces.
  47. Instant Volumetric Head Avatars.
  48. zllrunning. 2019. face-parsing.PyTorch.
Citations (3)

Summary

We haven't generated a summary for this paper yet.

Youtube Logo Streamline Icon: https://streamlinehq.com