Papers
Topics
Authors
Recent
Search
2000 character limit reached

CLOAF: CoLlisiOn-Aware Human Flow

Published 14 Mar 2024 in cs.CV | (2403.09050v1)

Abstract: Even the best current algorithms for estimating body 3D shape and pose yield results that include body self-intersections. In this paper, we present CLOAF, which exploits the diffeomorphic nature of Ordinary Differential Equations to eliminate such self-intersections while still imposing body shape constraints. We show that, unlike earlier approaches to addressing this issue, ours completely eliminates the self-intersections without compromising the accuracy of the reconstructions. Being differentiable, CLOAF can be used to fine-tune pose and shape estimation baselines to improve their overall performance and eliminate self-intersections in their predictions. Furthermore, we demonstrate how our CLOAF strategy can be applied to practically any motion field induced by the user. CLOAF also makes it possible to edit motion to interact with the environment without worrying about potential collision or loss of body-shape prior.

Definition Search Book Streamline Icon: https://streamlinehq.com
References (36)
  1. Inverse kinematics techniques in computer graphics: A survey. In Computer Graphics Forum. Wiley Online Library, 2018.
  2. An Inverse Kinematics Architecture for Enforcing an Arbitrary Number of Strict Priority Levels. The Visual Computer, 2004.
  3. Motion Capture of Hands in Action Using Discriminative Salient Points. In European Conference on Computer Vision, pages 640–653, 2012.
  4. Keep It SMPL: Automatic Estimation of 3D Human Pose and Shape from a Single Image. In European Conference on Computer Vision, 2016.
  5. Neural Ordinary Differential Equations. In Advances in Neural Information Processing Systems, 2018.
  6. Beyond Static Features for Temporally Consistent 3D Human Pose and Shape from a Video. In Conference on Computer Vision and Pattern Recognition, 2021.
  7. Tangent-space optimization for interactive animation control. ACM Transactions on Graphics, 2019.
  8. Theory of Ordinary Differential Equations. Tata McGrawHill Education, 1955.
  9. G. Farin. Curves and Surfaces for Computer Added Geometric Design. Academic Press, 1992.
  10. Humans in 4D: Reconstructing and Tracking Humans with Transformers. In International Conference on Computer Vision, 2023.
  11. Style-Based Inverse Kinematics. In ACM SIGGRAPH, pages 522–531, 2004.
  12. Neural Mesh Flow: 3D Manifold Mesh Generation via Diffeomorphic Flows. In Advances in Neural Information Processing Systems, 2020.
  13. Resolving 3D Human Pose Ambiguities with 3D Scene Constraints. In International Conference on Computer Vision, 2019.
  14. MeshODE: A Robust and Scalable Framework for Mesh Deformation. arXiv Preprint, 2020.
  15. Shapeflow: Learnable deformation flows among 3d shapes. In Advances in Neural Information Processing Systems, 2020.
  16. Exemplar Fine-Tuning for 3D Human Pose Fitting Towards In-the-Wild 3D Human Pose Estimation. In International Conference on 3D Vision, 2020.
  17. End-To-End Recovery of Human Shape and Pose. In Conference on Computer Vision and Pattern Recognition, 2018.
  18. VIBE: Video Inference for Human Body Pose and Shape Estimation. In Conference on Computer Vision and Pattern Recognition, pages 5252–5262, 2020.
  19. PARE: Part attention regressor for 3D human body estimation. In International Conference on Computer Vision, 2021.
  20. Learning to Reconstruct 3D Human Pose and Shape via Model-Fitting in the Loop. In Conference on Computer Vision and Pattern Recognition, 2019.
  21. Hybrik: A hybrid analytical-neural inverse kinematics solution for 3d human pose and shape estimation. In Conference on Computer Vision and Pattern Recognition, 2021.
  22. SMPL: A Skinned Multi-Person Linear Model. ACM SIGGRAPH Asia, 34(6), 2015.
  23. AMASS: Archive of Motion Capture as Surface Shapes. In International Conference on Computer Vision, pages 5442–5451, 2019.
  24. COAP: Compositional articulated occupancy of people. In Conference on Computer Vision and Pattern Recognition, 2022.
  25. I2L-MeshNet: Image-to-Lixel Prediction Network for Accurate 3D Human Pose and Mesh Estimation from a Single RGB Image. In European Conference on Computer Vision, 2020.
  26. On self-contact and human pose. In Conference on Computer Vision and Pattern Recognition, 2021.
  27. Occupancy Flow: 4D Reconstruction by Learning Particle Dynamics. In International Conference on Computer Vision, 2019.
  28. Expressive Body Capture: 3D Hands, Face, and Body from a Single Image. In Conference on Computer Vision and Pattern Recognition, 2019.
  29. Can Convolutional Resnets Approximately Preserve Input Distances? a Frequency Analysis Perspective. In arXiv Preprint, 2021.
  30. Fourier features let networks learn high frequency functions in low dimensional domains. Advances in Neural Information Processing Systems, 2020.
  31. Capturing Hands in Action Using Discriminative Salient Points and Physics Simulation. International Journal of Computer Vision, 118(2):172–193, 2016.
  32. Lipschitz Regularity of Deep Neural Networks: Analysis and Efficient Estimation. In Advances in Neural Information Processing Systems, 2018.
  33. Vector Field based Shape Deformations. ACM Transactions on Graphics, 2006.
  34. Recovering Accurate 3D Human Pose in the Wild Using IMUs and a Moving Camera. In European Conference on Computer Vision, 2018.
  35. Capturing Humans in Motion: Temporal-Attentive 3D Human Pose and Shape Estimation from Monocular Video. In Conference on Computer Vision and Pattern Recognition, 2022.
  36. We are more than our joints: Predicting how 3d bodies move. In Conference on Computer Vision and Pattern Recognition, 2021.

Summary

  • The paper introduces CLOAF, a novel ODE-based method that prevents self-intersections in 3D human pose and shape estimation.
  • It integrates SMPL body models into an ODE formulation, ensuring continuous and geometrically constrained deformations.
  • CLOAF enhances pose estimation via differentiable end-to-end training, promising realistic motion for applications in animation and robotics.

CoLlisiOn-Aware Human Flow: A New Approach to Eliminating Self-Intersections in 3D Human Pose and Shape Estimation

Introduction

The estimation of 3D human pose and shape from single images has made significant strides, with state-of-the-art methods such as transformer-based architectures achieving impressive accuracies. However, these models, even those pre-trained on extensive image datasets and fine-tuned on 3D body shape datasets, sometimes produce estimates with self-intersections of body parts. These unrealistic poses pose challenges in applications like robotics and animation, where maintaining realistic human motions is crucial. In this context, we introduce a novel approach, CoLlisiOn-Aware Flow (CLOAF), which leverages the properties of Ordinary Differential Equations (ODEs) to eliminate self-intersections in a differentiable and geometrically constrained manner.

Current Approaches to Handling Self-Intersections

Existing methods to address the issue of self-intersections in body shape and pose estimation primarily adopt iterative optimization techniques. These include penalizing self-intersections explicitly through loss functions or modifying the training data to reduce the likelihood of self-intersections. However, these approaches either lack differentiability, precluding their use in training deep networks, or do not offer guarantees against self-intersections at inference time.

The CLOAF Method

CLOAF operates by formulating the body shape deformation process as the solution to an ODE, inherently preventing self-intersections due to the diffeomorphic nature of ODE solutions. Given an initial body representation devoid of self-intersections, CLOAF can interpolate towards a target pose, ensuring that all intermediate poses adhere to realistic body shape constraints without self-intersections.

At the core of CLOAF is the integration of a body model, such as SMPL, into the ODE formulation. This integration allows us to compute trajectories in the parameter space of the body model, ensuring the imposition of the body shape prior throughout the deformation process. Through differentiable operations, CLOAF can be incorporated into the training pipelines of pose estimation networks to enhance their performance and reduce self-intersections.

Practical Implications and Future Directions

Our approach is not only capable of refining single-frame pose estimations to eliminate self-intersections but also introduces a methodology that can be integrated into the pose estimation networks' training process. This ability to refine and improve baseline methods through end-to-end training represents a significant advance in the pursuit of more accurate and realistic human pose and shape estimation techniques.

Moreover, the flexibility of CLOAF allows it to be applied to various motion fields induced by the user, paving the way for interactive applications where body movements can be edited in a collision-aware manner. This includes scenarios where human figures interact with environments or objects in a physically plausible way.

Conclusion

In summary, CLOAF represents an innovative step forward in addressing the persistent challenge of self-intersections in 3D human pose and shape estimation. By leveraging the properties of ODEs within a differentiable framework that respects geometric body shape constraints, CLOAF offers a robust solution that enhances the realism and accuracy of pose estimation models. Moving forward, the potential applications of CLOAF in the realms of animation, virtual reality, and robotics are vast and promising, heralding a new era of interaction between digital human representations and their environments.

Paper to Video (Beta)

Whiteboard

No one has generated a whiteboard explanation for this paper yet.

Open Problems

We haven't generated a list of open problems mentioned in this paper yet.

Continue Learning

We haven't generated follow-up questions for this paper yet.

Collections

Sign up for free to add this paper to one or more collections.

Tweets

Sign up for free to view the 2 tweets with 40 likes about this paper.