Papers
Topics
Authors
Recent
Gemini 2.5 Flash
Gemini 2.5 Flash
126 tokens/sec
GPT-4o
28 tokens/sec
Gemini 2.5 Pro Pro
42 tokens/sec
o3 Pro
4 tokens/sec
GPT-4.1 Pro
38 tokens/sec
DeepSeek R1 via Azure Pro
28 tokens/sec
2000 character limit reached

Multimodal Active Measurement for Human Mesh Recovery in Close Proximity (2310.08116v5)

Published 12 Oct 2023 in cs.RO and cs.CV

Abstract: For physical human-robot interactions (pHRI), a robot needs to estimate the accurate body pose of a target person. However, in these pHRI scenarios, the robot cannot fully observe the target person's body with equipped cameras because the target person must be close to the robot for physical interaction. This close distance leads to severe truncation and occlusions and thus results in poor accuracy of human pose estimation. For better accuracy in this challenging environment, we propose an active measurement and sensor fusion framework of the equipped cameras with touch and ranging sensors such as 2D LiDAR. Touch and ranging sensor measurements are sparse but reliable and informative cues for localizing human body parts. In our active measurement process, camera viewpoints and sensor placements are dynamically optimized to measure body parts with higher estimation uncertainty, which is closely related to truncation or occlusion. In our sensor fusion process, assuming that the measurements of touch and ranging sensors are more reliable than the camera-based estimations, we fuse the sensor measurements to the camera-based estimated pose by aligning the estimated pose towards the measured points. Our proposed method outperformed previous methods on the standard occlusion benchmark with simulated active measurement. Furthermore, our method reliably estimated human poses using a real robot, even with practical constraints such as occlusion by blankets.

Definition Search Book Streamline Icon: https://streamlinehq.com
References (40)
  1. Easymocap - make human motion capture easier. Github, 2021.
  2. Viewpoint selection for dermdrone using deep reinforcement learning. In IEEE International Conference on Control, Automation and Systems, ICCAS, 2021.
  3. 3d multi-bodies: Fitting sets of plausible 3d human models to ambiguous image data. In Annual Conference on Neural Information Processing Systems, NeurIPS, 2020.
  4. Keep it SMPL: automatic estimation of 3d human pose and shape from a single image. In European Conference on Computer Vision, ECCV, 2016.
  5. Realtime multi-person 2d pose estimation using part affinity fields. In IEEE Conference on Computer Vision and Pattern Recognition, CVPR, 2017.
  6. Selecting best viewpoint for human-pose estimation. In IEEE International Conference on Robotics and Automation, ICRA, 2014.
  7. Learning to estimate robust 3d human mesh from in-the-wild crowded scenes. In IEEE/CVF Conference on Computer Vision and Pattern Recognition, CVPR, 2022.
  8. Reconstructing 3d human pose by watching humans in the mirror. In IEEE Conference on Computer Vision and Pattern Recognition, CVPR, 2021.
  9. Mobile aloha: Learning bimanual mobile manipulation with low-cost whole-body teleoperation. arXiv preprint arXiv:2401.02117, 2024.
  10. Exploring severe occlusion: Multi-person 3d pose estimation with gated convolution. In 25th International Conference on Pattern Recognition, ICPR, 2020.
  11. Deep inertial poser: learning to reconstruct human pose from sparse inertial measurements in real time. ACM Trans. Graph., 37(6):185, 2018.
  12. Generating multiple diverse hypotheses for human 3d pose consistent with 2d joint detections. In IEEE International Conference on Computer Vision Workshops, ICCVW, 2017.
  13. Skeleton-aware 3d human shape reconstruction from point clouds. In IEEE/CVF International Conference on Computer Vision, ICCV, 2019.
  14. End-to-end recovery of human shape and pose. In IEEE Conference on Computer Vision and Pattern Recognition, CVPR, 2018.
  15. Occluded human mesh recovery. In IEEE/CVF Conference on Computer Vision and Pattern Recognition, CVPR, 2022.
  16. Activemocap: Optimized viewpoint selection for active human motion capture. In IEEE/CVF Conference on Computer Vision and Pattern Recognition, CVPR, 2020.
  17. Beyond weak perspective for monocular 3d human pose estimation. In European Conference on Computer Vision Workshops, ECCVW, 2020.
  18. PARE: part attention regressor for 3d human body estimation. In IEEE/CVF International Conference on Computer Vision, ICCV, 2021.
  19. Probabilistic modeling for human mesh recovery. In IEEE/CVF International Conference on Computer Vision, ICCV, 2021.
  20. JOTR: 3d joint contrastive learning with transformers for occluded human mesh recovery. In IEEE/CVF International Conference on Computer Vision, ICCV, 2023.
  21. SMPL: a skinned multi-person linear model. ACM Trans. Graph., 34(6):248:1–248:16, 2015.
  22. Monocular 3d human pose estimation in the wild using improved CNN supervision. In International Conference on 3D Vision, 3DV, 2017.
  23. Playing atari with deep reinforcement learning. CoRR, abs/1312.5602, 2013.
  24. I2l-meshnet: Image-to-lixel prediction network for accurate 3d human pose and mesh estimation from a single RGB image. In European Conference on Computer Vision, ECCV, 2020.
  25. Data-driven stochastic motion evaluation and optimization with image by spatially-aligned temporal encoding. In IEEE International Conference on Robotics and Automation, ICRA, 2023.
  26. Jointly optimize data augmentation and network training: Adversarial data augmentation in human pose estimation. In IEEE/CVF Conference on Computer Vision and Pattern Recognition, CVPR, 2018.
  27. Monocular, one-stage, regression of multiple 3d people. In IEEE/CVF International Conference on Computer Vision, ICCV, 2021.
  28. Recovering accurate 3d human pose in the wild using imus and a moving camera. In European Conference on Computer Vision, ECCV, 2018.
  29. Sparse inertial poser: Automatic 3d human pose estimation from sparse imus. Comput. Graph. Forum, 36(2):349–360, 2017.
  30. Repnet: Weakly supervised training of an adversarial reprojection network for 3d human pose estimation. In IEEE Conference on Computer Vision and Pattern Recognition, CVPR, 2019.
  31. Sequential 3d human pose and shape estimation from point clouds. In IEEE/CVF Conference on Computer Vision and Pattern Recognition, CVPR, 2020.
  32. Convolutional pose machines. In IEEE Conference on Computer Vision and Pattern Recognition, CVPR, 2016.
  33. A2J: anchor-to-joint regression network for 3d articulated pose estimation from a single depth image. In IEEE/CVF International Conference on Computer Vision, ICCV, 2019.
  34. Monocular 3d pose estimation via pose grammar and data augmentation. IEEE Trans. Pattern Anal. Mach. Intell., 44(10):6327–6344, 2022.
  35. Human support robot (HSR). In ACM Special Interest Group on Computer Graphics and Interactive Techniques Conference, SIGGRAPH, 2018.
  36. Development of human support robot as the research platform of a domestic mobile manipulator. ROBOMECH journal, 6(1):1–15, 2019.
  37. GLAMR: global occlusion-aware human mesh recovery with dynamic cameras. In IEEE/CVF Conference on Computer Vision and Pattern Recognition, CVPR, 2022.
  38. Pymaf: 3d human pose and shape regression with pyramidal mesh alignment feedback loop. In IEEE/CVF International Conference on Computer Vision, ICCV, 2021.
  39. Inference stage optimization for cross-scenario 3d human pose estimation. In Annual Conference on Neural Information Processing Systems, NeurIPS, 2020.
  40. Object-occluded human shape and pose estimation from a single color image. In IEEE/CVF Conference on Computer Vision and Pattern Recognition, CVPR, 2020.

Summary

We haven't generated a summary for this paper yet.

X Twitter Logo Streamline Icon: https://streamlinehq.com