Papers
Topics
Authors
Recent
Gemini 2.5 Flash
Gemini 2.5 Flash
156 tokens/sec
GPT-4o
7 tokens/sec
Gemini 2.5 Pro Pro
45 tokens/sec
o3 Pro
4 tokens/sec
GPT-4.1 Pro
38 tokens/sec
DeepSeek R1 via Azure Pro
28 tokens/sec
2000 character limit reached

MonoHair: High-Fidelity Hair Modeling from a Monocular Video (2403.18356v1)

Published 27 Mar 2024 in cs.CV

Abstract: Undoubtedly, high-fidelity 3D hair is crucial for achieving realism, artistic expression, and immersion in computer graphics. While existing 3D hair modeling methods have achieved impressive performance, the challenge of achieving high-quality hair reconstruction persists: they either require strict capture conditions, making practical applications difficult, or heavily rely on learned prior data, obscuring fine-grained details in images. To address these challenges, we propose MonoHair,a generic framework to achieve high-fidelity hair reconstruction from a monocular video, without specific requirements for environments. Our approach bifurcates the hair modeling process into two main stages: precise exterior reconstruction and interior structure inference. The exterior is meticulously crafted using our Patch-based Multi-View Optimization (PMVO). This method strategically collects and integrates hair information from multiple views, independent of prior data, to produce a high-fidelity exterior 3D line map. This map not only captures intricate details but also facilitates the inference of the hair's inner structure. For the interior, we employ a data-driven, multi-view 3D hair reconstruction method. This method utilizes 2D structural renderings derived from the reconstructed exterior, mirroring the synthetic 2D inputs used during training. This alignment effectively bridges the domain gap between our training data and real-world data, thereby enhancing the accuracy and reliability of our interior structure inference. Lastly, we generate a strand model and resolve the directional ambiguity by our hair growth algorithm. Our experiments demonstrate that our method exhibits robustness across diverse hairstyles and achieves state-of-the-art performance. For more results, please refer to our project page https://keyuwu-cs.github.io/MonoHair/.

Definition Search Book Streamline Icon: https://streamlinehq.com
References (45)
  1. Mip-nerf: A multiscale representation for anti-aliasing neural radiance fields. In Proceedings of the IEEE/CVF International Conference on Computer Vision, pages 5855–5864, 2021.
  2. Mip-nerf 360: Unbounded anti-aliased neural radiance fields. In Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, pages 5470–5479, 2022.
  3. Face alignment by explicit shape regression. International journal of computer vision, 107(2):177–190, 2014.
  4. Autohair: Fully automatic hair modeling from a single image. ACM Transactions on Graphics, 35(4), 2016.
  5. Blender Online Community. Blender - a 3D modelling and rendering package. Blender Foundation, Stichting Blender Foundation, Amsterdam, 2023.
  6. Learning disentangled avatars with hybrid 3d representations. arXiv, 2023.
  7. Geo-neus: Geometry-consistent neural implicit surfaces learning for multi-view reconstruction. Advances in Neural Information Processing Systems, 35:3403–3416, 2022.
  8. Strands and hair: modeling, animation, and rendering. In ACM SIGGRAPH 2007 courses, pages 1–150. 2007.
  9. Robust hair capture using simulated examples. ACM Transactions on Graphics, 33(4):1–10, 2014.
  10. Single-view hair modeling using a hairstyle database. ACM Transactions on Graphics (ToG), 34(4):1–9, 2015.
  11. Avatar digitization from a single image for real-time rendering. ACM Transactions on Graphics (ToG), 36(6):1–14, 2017.
  12. Modnet: Real-time trimap-free portrait matting via objective decomposition. In Proceedings of the AAAI Conference on Artificial Intelligence, pages 1140–1147, 2022.
  13. Deepmvshair: Deep hair modeling from sparse views. In SIGGRAPH Asia 2022 Conference Papers, pages 1–8, 2022.
  14. Facial performance sensing head-mounted display. ACM Transactions on Graphics (ToG), 34(4):1–9, 2015.
  15. Cdgnet: Class distribution guided network for human parsing. In Proceedings of the IEEE/CVF conference on computer vision and pattern recognition, pages 4473–4482, 2022.
  16. Multi-view hair capture using orientation fields. In CVPR, 2012.
  17. Structure-aware hair capture. ACM Transactions on Graphics, 32(4):1–12, 2013a.
  18. Wide-baseline hair capture using strand-based refinement. In Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, pages 265–272, 2013b.
  19. Nerf: Representing scenes as neural radiance fields for view synthesis. In ECCV, 2020.
  20. Nerf: Representing scenes as neural radiance fields for view synthesis. Communications of the ACM, 65(1):99–106, 2021.
  21. Instant neural graphics primitives with a multiresolution hash encoding. ACM Transactions on Graphics (ToG), 41(4):1–15, 2022.
  22. Strand-accurate multi-view hair capture. In Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, pages 155–164, 2019.
  23. Unisurf: Unifying neural implicit surfaces and radiance fields for multi-view reconstruction. In Proceedings of the IEEE/CVF International Conference on Computer Vision, pages 5589–5599, 2021.
  24. Capture of hair geometry from multiple images. ACM transactions on graphics (TOG), 23(3):712–719, 2004.
  25. H3d-net: Few-shot high-fidelity 3d head reconstruction. In Proceedings of the IEEE/CVF International Conference on Computer Vision, pages 5620–5629, 2021.
  26. Neural strands: Learning hair geometry and appearance from multi-view images. In European Conference on Computer Vision, pages 73–89. Springer, 2022.
  27. 3d hair synthesis using volumetric variational autoencoders. ACM Transactions on Graphics, 37(6):1–12, 2018.
  28. Structure-from-motion revisited. In Proceedings of the IEEE conference on computer vision and pattern recognition, pages 4104–4113, 2016.
  29. Ct2hair: High-fidelity 3d hair modeling using computed tomography. ACM Transactions on Graphics (TOG), 42(4):1–13, 2023.
  30. Neural haircut: Prior-guided strand-based hair reconstruction. In ICCV, 2023.
  31. Blindly assess image quality in the wild guided by a self-adaptive hyper network. In Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, pages 3667–3676, 2020.
  32. Neus: Learning neural implicit surfaces by volume rendering for multi-view reconstruction. arXiv preprint arXiv:2106.10689, 2021a.
  33. Neus2: Fast learning of neural implicit surfaces for multi-view reconstruction. In Proceedings of the IEEE/CVF International Conference on Computer Vision, pages 3295–3306, 2023a.
  34. Learning compositional radiance fields of dynamic human heads. In CVPR, 2021b.
  35. Hvh: Learning a hybrid neural volumetric representation for dynamic hair performance capture. In CVPR, 2022.
  36. Neuwigs: A neural dynamic model for volumetric hair capture and animation. In Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, pages 8641–8651, 2023b.
  37. Humannerf: Free-viewpoint rendering of moving people from monocular video. In Proceedings of the IEEE/CVF conference on computer vision and pattern Recognition, pages 16210–16220, 2022.
  38. Neuralhdhair: Automatic high-fidelity hair modeling from a single image using implicit neural representations. In Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, pages 1526–1535, 2022.
  39. Dynamic hair modeling from monocular videos using deep neural networks. ACM Transactions on Graphics (TOG), 38(6):1–12, 2019.
  40. Multiview neural surface reconstruction by disentangling geometry and appearance. Advances in Neural Information Processing Systems, 33, 2020.
  41. Hair meshes. ACM Transactions on Graphics (TOG), 28(5):1–7, 2009.
  42. Hair-gan: Recovering 3d hair structure from a single image using generative adversarial networks. Visual Informatics, 3(2):102–112, 2019.
  43. A data-driven approach to four-view image-based hair modeling. ACM Trans. Graph., 36(4):156–1, 2017.
  44. Hairstep: Transfer synthetic to real using strand and depth maps for single-view 3d hair modeling. In CVPR, 2023.
  45. Hairnet: Single-view hair reconstruction using convolutional neural networks. In ECCV, 2018.
Citations (5)

Summary

We haven't generated a summary for this paper yet.

X Twitter Logo Streamline Icon: https://streamlinehq.com