HeadCraft: Modeling High-Detail Shape Variations for Animated 3DMMs (2312.14140v1)

Published 21 Dec 2023 in cs.CV

Abstract: Current advances in human head modeling allow to generate plausible-looking 3D head models via neural representations. Nevertheless, constructing complete high-fidelity head models with explicitly controlled animation remains an issue. Furthermore, completing the head geometry based on a partial observation, e.g. coming from a depth sensor, while preserving details is often problematic for the existing methods. We introduce a generative model for detailed 3D head meshes on top of an articulated 3DMM which allows explicit animation and high-detail preservation at the same time. Our method is trained in two stages. First, we register a parametric head model with vertex displacements to each mesh of the recently introduced NPHM dataset of accurate 3D head scans. The estimated displacements are baked into a hand-crafted UV layout. Second, we train a StyleGAN model in order to generalize over the UV maps of displacements. The decomposition of the parametric model and high-quality vertex displacements allows us to animate the model and modify it semantically. We demonstrate the results of unconditional generation and fitting to the full or partial observation. The project page is available at https://seva100.github.io/headcraft.

References (67)

Citations (1)

View on Semantic Scholar

Summary

The paper presents a two-stage approach that first aligns a parametric head template with 3D scans and then refines surface displacements via UV maps.
The method employs a StyleGAN-based generative model to learn intricate head variations, including complex hairstyles and subtle facial details.
Quantitative and visual evaluations demonstrate the model's improved realism and adaptability for animation in various applications.

Introduction to Generative Head Modeling

The development of advanced modeling techniques has significantly progressed the creation of realistic 3D human head models. These models are not only important in the field of entertainment and virtual reality but are increasingly utilized in other industries such as medical simulations and digital communication. A crucial challenge here is generating models that are not only high in detail but can also be easily manipulated for animation and tracking while preserving detail.

Methodology

Crafting Detailed Displacements

A novel approach to create detailed 3D human head models involves two main stages. Initially, a parametric head template, known for its ability to be animated and represent basic shape variations, is aligned to a comprehensive database of 3D head scans. This process involves fitting a detailed mesh to each scanned head, allowing for free movement of vertices to capture subtle surface discrepancies. In the first phase of fitting, these deformations are regulated to avoid mesh self-intersections. Then, a second phase focuses on refining displacement along the head's surface normals.

To capture and reproduce the intricate variations observed in human heads, these displacements are translated into 2D maps that a generative model can learn from. This generative model, based on the StyleGAN architecture, is then trained on these UV maps of displacements, effectively encoding the rich details into a format that can be broadly applied.

Versatility and Adaptability

Learning from 2D UV displacement maps allows this method to achieve a high-resolution output that extends beyond the fundamental geometry of the parametric model. It also introduces considerable shape variations to the model, including complex hairstyles. This provides an unprecedented level of detail and variability when generating new head models or adapting existing ones to different shapes.

Evaluation and Applications

To showcase the effectiveness and practicality of the generated models, various evaluations are conducted. The diversity and fidelity of the models are compared with existing methods and measured against real human 3D scans. These comparisons are quantified using several metrics and are visually inspected regarding both the UV map space and the rendered image space.

The applications demonstrate the model's capacity to generate 3D heads unconditionally and fit them to complete or partial observations, such as point clouds obtained from depth sensors. Particularly noteworthy is the model's ability to animate and manipulate the 3D heads, thanks to the integration with the parametric template that allows for explicit expression and motion adjustments.

Conclusions

The two-stage registration procedure crafted to align the parametric model with scanned data results in detailed displacement maps that significantly enhance the model's realism and variability. By employing a StyleGAN architecture to generalize over these high-resolution maps, researchers have established a method capable of generating detailed and animatable 3D head models. The model's success in quantitatively and visually achieving high levels of detail and diversity highlights its utility across various settings and applications.

GitHub

Artem Sevastopolsky

Tweets

https://twitter.com/22146921/status/1738328414879461409