Neural Pose Transfer by Spatially Adaptive Instance Normalization (2003.07254v2)

Published 16 Mar 2020 in cs.CV

Abstract: Pose transfer has been studied for decades, in which the pose of a source mesh is applied to a target mesh. Particularly in this paper, we are interested in transferring the pose of source human mesh to deform the target human mesh, while the source and target meshes may have different identity information. Traditional studies assume that the paired source and target meshes are existed with the point-wise correspondences of user annotated landmarks/mesh points, which requires heavy labelling efforts. On the other hand, the generalization ability of deep models is limited, when the source and target meshes have different identities. To break this limitation, we proposes the first neural pose transfer model that solves the pose transfer via the latest technique for image style transfer, leveraging the newly proposed component -- spatially adaptive instance normalization. Our model does not require any correspondences between the source and target meshes. Extensive experiments show that the proposed model can effectively transfer deformation from source to target meshes, and has good generalization ability to deal with unseen identities or poses of meshes. Code is available at https://github.com/jiashunwang/Neural-Pose-Transfer .

Citations (54)

View on Semantic Scholar

Summary

The paper's main contribution is the introduction of a SPAdaIN-based network that eliminates the need for manual mesh correspondences in 3D pose transfer.
It employs a permutation-invariant pose extractor and a style transfer decoder to robustly encode and transfer pose and identity features across unseen subjects.
Empirical results show lower Point-wise Mesh Euclidean Distance, confirming the method's efficiency and generalization on noisy and diverse 3D meshes.

Neural Pose Transfer by Spatially Adaptive Instance Normalization

The paper "Neural Pose Transfer by Spatially Adaptive Instance Normalization" introduces a sophisticated approach to the problem of pose transfer between 3D meshes, a task integral to fields such as animation, robotics, and virtual reality. This work leverages advanced techniques from deep learning to address and overcome limitations inherent in traditional methods, particularly focusing on human meshes but with potential applicability to non-human subjects as well.

Summary

In traditional pose transfer methodologies, establishing point-wise correspondences between source and target meshes necessitates substantial manual effort. This paper presents a novel neural network architecture designed to effectively transfer poses without requiring any predefined correspondences or auxiliary information. The innovation lies in the application of spatially adaptive instance normalization (SPAdaIN), a concept adapted from style transfer literature, which allows the model to generalize pose transfers across identities that the network has not encountered during training.

The network architecture is composed of a permutation-invariant pose feature extractor and a style transfer decoder, both utilizing the SPAdaIN mechanism to encode and transfer pose and identity features. This decoder does not rely on fixed correspondences or additional inputs, making it highly flexible and practical for real-world applications where meshes come in random vertex orders and originate from diverse sources.

Key Findings

The proposed model demonstrates significant improvements over traditional pose transfer techniques, such as Deformation Transfer, which relies on additional control points and manual annotation for effective function. Empirical evaluations indicate that this model achieves lower Point-wise Mesh Euclidean Distance (PMD), a measure of accuracy, while maintaining computational practicality.

Ablation studies confirm the effectiveness of the SPAdaIN component and highlight its critical role in maintaining performance across unseen identities and poses. The model's robustness is further demonstrated through successful pose transfer on noisy input meshes and generalization to detailed non-standard 3D shapes from datasets like FAUST and MG-dataset, underscoring its versatility and robustness.

Implications and Future Work

The method outlined in this paper has major implications in fields that demand robust and flexible 3D pose transfer systems. Its independence from explicit mesh correspondences considerably lowers the bar for deploying pose transfer technology in interactive and real-time applications. The model's architecture suggests potential extensions beyond human figures into non-rigid objects or characters, with applications in autonomous systems and machine interaction.

Future developments could explore the adaptation and refinement of SPAdaIN within different structural domains, enhance model scalability to handle high-resolution meshes directly, and improve training efficiency to accommodate broader datasets without the need for intensive computational resources. The extension of this methodology to global features learned across multiple source identities and poses could further augment its generalizability and accuracy.

In conclusion, the integration of spatially adaptive instance normalization into the pose transfer process presents a substantive advance in generating realistic and precise deformations, providing a significant contribution to the domain of neural rendering and graphics.

PDF Markdown

Related Papers

GitHub

GitHub - jiashunwang/Neural-Pose-Transfer: Neural Pose Transfer by Spatially Adaptive Instance Normalization. In CVPR 2020 (145 stars)

Tweets

https://twitter.com/_akhaliq/status/1239719971606990848