Deep Learning-Based Human Pose Estimation: A Survey (2012.13392v5)
Abstract: Human pose estimation aims to locate the human body parts and build human body representation (e.g., body skeleton) from input data such as images and videos. It has drawn increasing attention during the past decade and has been utilized in a wide range of applications including human-computer interaction, motion analysis, augmented reality, and virtual reality. Although the recently developed deep learning-based solutions have achieved high performance in human pose estimation, there still remain challenges due to insufficient training data, depth ambiguities, and occlusion. The goal of this survey paper is to provide a comprehensive review of recent deep learning-based solutions for both 2D and 3D pose estimation via a systematic analysis and comparison of these solutions based on their input data and inference procedures. More than 250 research papers since 2014 are covered in this survey. Furthermore, 2D and 3D human pose estimation datasets and evaluation metrics are included. Quantitative performance comparisons of the reviewed methods on popular datasets are summarized and discussed. Finally, the challenges involved, applications, and future research directions are concluded. A regularly updated project page is provided: \url{https://github.com/zczcwh/DL-HPE}
- PoseTrack: A Benchmark for Human Pose Estimation and Tracking. In CVPR.
- Posetrack: A benchmark for human pose estimation and tracking. In CVPR.
- 2d human pose estimation: New benchmark and state of the art analysis. In CVPR.
- Actionxpose: A novel 2d multi-view pose-based algorithm for real-time human action recognition. In arXiv preprint arXiv:1810.12126.
- Exploiting Temporal Context for 3D Human Pose Estimation in the Wild. In CVPR.
- Bruno Artacho and Andreas Savakis. 2020. UniPose: Unified Human Pose Estimation in Single Images and Videos. In CVPR.
- Vasileios Belagiannis and Andrew Zisserman. 2017. Recurrent human pose estimation. In FG.
- PandaNet: Anchor-Based Single-Shot Multi-Person 3D Pose Estimation. In CVPR.
- Learning temporal pose estimation from sparsely-labeled videos. In NeurIPS.
- Keep it SMPL: Automatic Estimation of 3D Human Pose and Shape from a Single Image. In ECCV.
- Adrian Bulat and Georgios Tzimiropoulos. 2016. Human pose estimation via convolutional part heatmap regression. In ECCV.
- 3D Pictorial Structures for Multiple View Articulated Pose Estimation. In CVPR.
- Exploiting Spatial-Temporal Relationships for 3D Pose Estimation via Graph Convolutional Networks. In ICCV.
- Learning Delicate Local Representations for Multi-Person Pose Estimation. In arXiv preprint arXiv:2003.04030.
- Long-term human motion prediction with scene context. In ECCV.
- Realtime multi-person 2d pose estimation using part affinity fields. In CVPR.
- Human pose estimation with iterative error feedback. In CVPR.
- Ching-Hang Chen and Deva Ramanan. 2017. 3D Human Pose Estimation = 2D Pose Estimation + Matching. In CVPR.
- Unsupervised 3D Pose Estimation With Geometric Self-Supervision. In CVPR.
- Multi-person 3D Pose Estimation in Crowded Scenes Based on Multi-View Geometry. In ECCV.
- Patient-specific pose estimation in clinical environments. In JTEHM, Vol. 6. 1–11.
- Cross-View Tracking for Multi-Human 3D Pose Estimation at Over 100 FPS. In CVPR.
- Anatomy-aware 3D Human Pose Estimation with Bone-based Pose Decomposition. In TCSVT, Vol. 32. 198–209.
- A simple framework for contrastive learning of visual representations. In arXiv preprint arXiv:2002.05709.
- Fall Detection Based on Key Points of Human-Skeleton Using OpenPose. In Symmetry, Vol. 12. 744.
- Xianjie Chen and Alan L Yuille. 2014. Articulated pose estimation by a graphical model with image dependent pairwise relations. In NeurIPS.
- Adversarial posenet: A structure-aware convolutional network for human pose estimation. In ICCV.
- Monocular human pose estimation: A survey of deep learning-based methods. In CVIU, Vol. 192. 102897.
- Cascaded pyramid network for multi-person pose estimation. In CVPR.
- Towards Part-aware Monocular 3D Human Pose Estimation: An Architecture Search Approach. In ECCV.
- HigherHRNet: Scale-Aware Representation Learning for Bottom-Up Human Pose Estimation. In CVPR.
- Graph and Temporal Convolutional Networks for 3D Multi-person Pose Estimation in Monocular Videos. In AAAI.
- Monocular 3D Multi-Person Pose Estimation by Integrating Top-Down and Bottom-Up Networks. In CVPR.
- Occlusion-Aware Networks for 3D Human Pose Estimation in Video. In ICCV.
- Cross-Attention of Disentangled Modalities for 3D Human Mesh Recovery with Transformers. In ECCV.
- Beyond Static Features for Temporally Consistent 3D Human Pose and Shape From a Video. In CVPR.
- Pose2Mesh: Graph Convolutional Network for 3D Human Pose and Mesh Recovery from a 2D Human Pose. In ECCV.
- Self adversarial training for human pose estimation. In APSIPA ASC.
- Structured feature learning for pose estimation. In CVPR.
- Multi-context attention for human pose estimation. In CVPR.
- Optimizing Network Structure for 3D Human Pose Estimation. In ICCV.
- Bodies at Rest: 3D Human Pose and Shape Estimation From a Pressure Image Using Synthetic Data. In CVPR.
- Learning 3D Human Pose from Structure and Motion. In ECCV.
- VPN: Learning Video-Pose Embedding for Activities of Daily Living. In ECCV.
- Adapting MobileNets for mobile based upper body pose estimation. In AVSS.
- Joint flow: Temporal flow fields for multi person tracking. In arXiv preprint arXiv:1805.04596.
- Fast and Robust Multi-Person 3D Pose Estimation From Multiple Views. In CVPR.
- Shape-aware Multi-Person Pose Estimation from Multi-view Images. In ICCV.
- Can 3d pose be learned from 2d projections alone?. In ECCV.
- 2d articulated human pose estimation and retrieval in (almost) unconstrained still images. In IJCV, Vol. 99. 190–214.
- Neural Architecture Search: A Survey. In JMLR, Vol. 20. 1997–2017.
- Compressed Volumetric Heatmaps for Multi-Person 3D Pose Estimation. In CVPR.
- Combining local appearance and holistic view: Dual-source deep neural networks for human pose estimation. In CVPR.
- Rmpe: Regional multi-person pose estimation. In ICCV.
- Learning to refine human pose estimation. In CVPR Workshops.
- Martin Fisch and Ronald Clark. 2020. Orientation Keypoints for 6D Human Pose Estimation. In arXiv preprint arXiv:2009.04930.
- Hierarchical Kinematic Human Mesh Recovery. In ECCV.
- Chained predictions using convolutional neural networks. In ECCV.
- Human pose estimation from monocular images: A comprehensive survey. In Sensors, Vol. 16. 1966.
- Generative adversarial nets. In NeurIPS.
- Home-based physical therapy with an interactive computer vision system. In ICCV Workshops.
- Multi-domain pose network for multi-person pose estimation and tracking. In ECCV Workshops.
- In the Wild Human Pose Estimation Using Explicit 2D Features and Intermediate 3D Representations. In CVPR.
- Resolving 3D Human Pose Ambiguities with 3D Scene Constraints. In ICCV.
- Deep residual learning for image recognition. In CVPR.
- Human pose estimation and activity recognition from multi-view videos: Comparative explorations of recent developments. In IEEE J Sel Top Signal Process, Vol. 6. 538–552.
- Part Aware Contrastive Learning for Self-Supervised Action Recognition. In IJCAI.
- End-to-end Dynamic Matching Network for Multi-view Multi-person 3d Pose Estimation. In ECCV.
- DeepFuse: An IMU-Aware Network for Real-Time 3D Human Pose Estimation from Multi-View Image. In WACV.
- The Devil is in the Details: Delving into Unbiased Data Processing for Human Pose Estimation. In CVPR.
- A coarse-fine network for keypoint localization. In ICCV.
- Invariant Representation Learning for Infant Pose Estimation with Small Data. In arXiv preprint arXiv:2010.06100.
- Deep Inertial Poser: Learning to Reconstruct Human Pose from Sparse Inertial Measurements in Real Time. In ACM TOG.
- Arttrack: Articulated multi-person tracking in the wild. In CVPR.
- Deepercut: A deeper, stronger, and faster multi-person pose estimation model. In ECCV.
- Human3.6M: Large Scale Datasets and Predictive Methods for 3D Human Sensing in Natural Environments. In IEEE TPAMI, Vol. 36. 1325–1339.
- Umar Iqbal and Juergen Gall. 2016. Multi-person pose estimation with local joint-to-person associations. In ECCV.
- Optical Non-Line-of-Sight Physics-Based 3D Human Pose Estimation. In CVPR.
- Ehsan Jahangiri and Alan L Yuille. 2017. Generating multiple diverse hypotheses for human 3d pose consistent with 2d joint detections. In ICCV Workshops.
- Modeep: A deep learning framework using motion features for human pose estimation. In ACCV.
- On the Robustness of Human Pose Estimation. In CVPR Workshops.
- Towards understanding action recognition. In ICCV.
- A survey on monocular 3D human pose estimation. In Virtual Reality &\&& Intelligent Hardware, Vol. 2. 471–500.
- Xiaofei Ji and Honghai Liu. 2009. Advances in view-invariant human motion analysis: a review. In IEEE TSMC, Vol. 40. 13–24.
- Skeleton-Aware 3D Human Shape Reconstruction From Point Clouds. In ICCV.
- Coherent Reconstruction of Multiple Humans From a Single Image. In CVPR.
- Differentiable Hierarchical Graph Grouping for Multi-Person Pose Estimation. In arXiv preprint arXiv:2007.11864.
- Whole-Body Human Pose Estimation in the Wild. In arXiv preprint arXiv:2007.11858.
- Sam Johnson and Mark Everingham. 2010. Clustered Pose and Nonlinear Appearance Models for Human Pose Estimation. In BMVC.
- Sam Johnson and Mark Everingham. 2011. Learning effective human pose estimation from inaccurate annotation. In CVPR.
- Panoptic Studio: A Massively Multiview System for Social Interaction Capture. In IEEE TPAMI. 3334–3342.
- Total Capture: A 3D Deformation Model for Tracking Faces, Hands, and Bodies. In CVPR.
- A Multi-view RGB-D Approach for Human Pose Estimation in Operating Rooms. In WACV.
- Ladislav Kavan. 2014. Part I: direct skinning methods and deformation primitives. In ACM SIGGRAPH.
- Multi-scale structure-aware network for human pose estimation. In ECCV.
- Transformers in vision: A survey. In arXiv preprint arXiv:2101.01169.
- VIBE: Video inference for human body pose and shape estimation. In CVPR.
- Multiposenet: Fast multi-person pose estimation using pose residual network. In ECCV.
- Self-Supervised Learning of 3D Human Pose Using Multi-View Geometry. In CVPR.
- Learning to Reconstruct 3D Human Pose and Shape via Model-Fitting in the Loop. In ICCV.
- Convolutional Mesh Regression for Single-Image Human Shape Reconstruction. In CVPR.
- Probabilistic Modeling for Human Mesh Recovery. In ICCV.
- Pifpaf: Composite fields for human pose estimation. In CVPR.
- Imagenet classification with deep convolutional neural networks. In NeurIPS.
- Unsupervised Cross-Modal Alignment for Multi-Person 3D Pose Estimation. In ECCV.
- Self-Supervised 3D Human Pose Estimation via Part Guided Novel Image Synthesis. In CVPR.
- Kinematic-Structure-Preserved Representation for Unsupervised 3D Human Pose Estimation. In AAAI.
- Unite the People: Closing the Loop Between 3D and 2D Human Representations. In CVPR.
- Propagating LSTM: 3D Pose Estimation based on Joint Interdependency. In ECCV.
- Chen Li and Gim Hee Lee. 2019. Generating Multiple Hypotheses for 3D Human Pose Estimation With Mixture Density Network. In CVPR.
- Human pose regression with residual log-likelihood estimation. In ICCV.
- HMOR: Hierarchical Multi-Person Ordinal Relations for Monocular Multi-Person 3D Pose Estimation. In ECCV.
- Crowdpose: Efficient crowded scenes pose estimation and a new benchmark. In ICCV.
- HybrIK: A Hybrid Analytical-Neural Inverse Kinematics Solution for 3D Human Pose and Shape Estimation. In CVPR.
- Pose Recognition with Cascade Transformers. In CVPR.
- Sijin Li and Antoni B. Chan. 2014. 3D Human Pose Estimation from Monocular Images with Deep Convolutional Neural Network. In ACCV.
- Heterogeneous multi-task learning for human pose estimation with deep convolutional neural network. In CVPR Workshops.
- Maximum-Margin Structured Learning With Deep Networks for 3D Human Pose Estimation. In ICCV.
- MHFormer: Multi-Hypothesis Transformer for 3D Human Pose Estimation. In CVPR.
- Rethinking on multi-stage networks for human pose estimation. In arXiv preprint arXiv:1901.00148.
- Test-time personalization with a transformer for human pose estimation. In NeurIPS.
- Dense Intrinsic Appearance Flow for Human Pose Transfer. In CVPR.
- SimCC: A Simple Coordinate Classification Perspective for Human Pose Estimation. In ECCV.
- TokenPose: Learning Keypoint Tokens for Human Pose Estimation. In ICCV.
- A novel joint points and silhouette-based method to estimate 3D human pose and shape. In arXiv preprint arXiv:2012.06109.
- On Boosting Single-Frame 3D Human Pose Estimation via Monocular Videos. In ICCV.
- Junbang Liang and Ming C. Lin. 2019. Shape-Aware Human Pose and Shape Reconstruction Using Multi-View Images. In ICCV.
- Human pose estimation using deep consensus voting. In ECCV.
- End-to-end human pose and mesh reconstruction with transformers. In CVPR.
- Mesh Graphormer. In ICCV.
- Microsoft coco: Common objects in context. In ECCV.
- Human in events: A large-scale benchmark for human-centric video analysis in complex events. In arXiv preprint arXiv:2005.04490.
- Polarized self-attention: towards high-quality pixel-wise regression. In arXiv preprint arXiv:2107.00782.
- Adversarial Attack on Skeleton-based Human Action Recognition. In arXiv preprint arXiv:1909.06500.
- PoseTween: Pose-driven Tween Animation. In ACM UIST.
- A Comprehensive Study of Weight Sharing in Graph Networks for 3D Human Pose Estimation. In ECCV.
- Attention Mechanism Exploits Temporal Contexts: Real-Time 3D Human Pose Reconstruction. In CVPR.
- Deep dual consecutive network for human pose estimation. In CVPR.
- Temporal Feature Alignment and Mutual Information Maximization for Video-Based Human Pose Estimation. In CVPR.
- A survey of human pose estimation: the body parts parsing based methods. In JVCIR, Vol. 32. 10–19.
- Fully Convolutional Networks for Semantic Segmentation. In CVPR.
- SMPL: A Skinned Multi-Person Linear Model. In ACM TOG, Vol. 34. 1–16.
- Vision-based Estimation of MDS-UPDRS Gait Scores for Assessing Parkinson’s Disease Motor Severity. In arXiv preprint arXiv:2007.08920.
- Lstm pose machines. In CVPR.
- Rethinking the heatmap regression for bottom-up human pose estimation. In CVPR.
- 2d/3d pose estimation and action recognition using multitask deep learning. In CVPR.
- Human pose regression by combining indirect part detection and contextual information. In Computers & Graphics, Vol. 85. 15–22.
- Transfusion: Cross-view fusion with transformer for 3d human pose estimation. In BMVC.
- PPT: token-Pruned Pose Transformer for monocular and multi-view human pose estimation. In ECCV.
- Enhancing Human Pose Estimation in Ancient Vase Paintings via Perceptually-grounded Style Transfer Learning. In arXiv preprint arXiv:2012.05616.
- AMASS: Archive of Motion Capture as Surface Shapes. In ICCV.
- Tfpose: Direct human pose estimation with transformers. In arXiv preprint arXiv:2103.15320.
- Poseur: Direct Human Pose Regression with Transformers. In ECCV.
- Graph Embedded Pose Clustering for Anomaly Detection. In CVPR.
- A simple yet effective baseline for 3d human pose estimation. In ICCV.
- Monocular 3D Human Pose Estimation in the Wild Using Improved CNN Supervision. In 3DV.
- XNect: Real-time Multi-Person 3D Motion Capture with a Single RGB Camera. In ACM TOG, Vol. 39. 82–1.
- Single-Shot Multi-Person 3D Pose Estimation From Monocular RGB. In 3DV.
- Vnect: Real-time 3d human pose estimation with a single rgb camera. In ACM TOG, Vol. 36. 1–14.
- Real-time upper body detection and 3D pose estimation in monoscopic images. In ECCV.
- Multiview-Consistent Semi-Supervised Learning for 3D Human Pose Estimation. In CVPR.
- Thomas B Moeslund and Erik Granum. 2001. A survey of computer vision-based human motion capture. In CVIU, Vol. 81. 231–268.
- A survey of advances in vision-based human motion capture and analysis. In CVIU, Vol. 104. 90–126.
- Camera Distance-aware Top-down Approach for 3D Multi-person Pose Estimation from a Single RGB Image. In ICCV.
- Posefix: Model-agnostic general human pose refinement network. In CVPR.
- Gyeongsik Moon and Kyoung Mu Lee. 2020. I2L-MeshNet: Image-to-Lixel Prediction Network for Accurate 3D Human Pose and Mesh Estimation from a Single RGB Image. In ECCV.
- Francesc Moreno-Noguer. 2017. 3d human pose estimation from a single image via distance matrix regression. In CVPR.
- The Progress of Human Pose Estimation: A Survey and Taxonomy of Models Applied in 2D Human Pose Estimation. In IEEE Access, Vol. 8. 133330–133348.
- Associative embedding: End-to-end learning for joint detection and grouping. In NeurIPS.
- Stacked hourglass networks for human pose estimation. In ECCV.
- Numerical coordinate regression with convolutional neural networks. In arXiv preprint arXiv:1801.07372.
- Monocular 3D Human Pose Estimation by Predicting Depth on Joints. In ICCV.
- Unsupervised Human 3D Pose Representation with Viewpoint and Pose Disentanglement. In ECCV.
- Single-Stage Multi-Person Pose Machines. In ICCV.
- Neural Body Fitting: Unifying Deep Learning and Model-Based Human Pose and Shape Estimation. In 3DV.
- STAR: A Spare Trained Articulated Human Body Regressor. In ECCV.
- Paschalis Panteleris and Antonis Argyros. 2021. PE-former: Pose Estimation Transformer. In arXiv preprint arXiv:2112.04981.
- Personlab: Person pose estimation and instance segmentation with a bottom-up, part-based, geometric embedding model. In ECCV.
- Towards Accurate Multi-Person Pose Estimation in the Wild. In CVPR.
- TailorNet: Predicting Clothing in 3D as a Function of Human Pose, Shape and Garment Style. In CVPR.
- Expressive Body Capture: 3D Hands, Face, and Body from a Single Image. In CVPR.
- Ordinal Depth Supervision for 3D Human Pose Estimation. In CVPR.
- Coarse-to-Fine Volumetric Prediction for Single-Image 3D Human Pose. In CVPR.
- Harvesting Multiple Views for Marker-Less 3D Human Pose Annotations. In CVPR.
- Learning to Estimate 3D Human Pose and Shape from a Single Color Image. In CVPR.
- 3D human pose estimation in video with temporal convolutions and semi-supervised training. In CVPR.
- Jointly optimize data augmentation and network training: Adversarial data augmentation in human pose estimation. In CVPR.
- Flowing convnets for human pose estimation in videos. In ICCV.
- Deep convolutional neural networks for efficient pose estimation in gesture videos. In ACCV.
- Domes to Drones: Self-Supervised Active Triangulation for 3D Human Pose Reconstruction. In NeurIPS.
- Deepcut: Joint subset partition and labeling for multi person pose estimation. In CVPR.
- Dyna: A Model of Dynamic Human Shape in Motion. In ACM TOG, Vol. 34. 1–14.
- Ronald Poppe. 2007. Vision-based human motion analysis: An overview. In CVIU, Vol. 108. 4–18.
- Ammar Qammaz and Antonis A Argyros. 2019. MocapNET: Ensemble of SNN Encoders for 3D Human Pose Estimation in RGB Images. In BMVC.
- Pointnet: Deep learning on point sets for 3d classification and segmentation. In CVPR.
- Pointnet++: Deep hierarchical feature learning on point sets in a metric space. In NeurIPS.
- Cross View Fusion for 3D Human Pose Estimation. In ICCV.
- Peeking into occluded joints: A novel framework for crowd pose estimation. In arXiv preprint arXiv:2003.10506.
- Pose machines: Articulated pose estimation via inference machines. In ECCV.
- Mir Rayat Imtiaz Hossain and James J. Little. 2018. Exploiting temporal information for 3D human pose estimation. In ECCV.
- TesseTrack: End-to-End Learnable Multi-Person Articulated 3D Pose Tracking. In CVPR.
- Lightweight Multi-View 3D Pose Estimation Through Camera-Disentangled Representation. In CVPR.
- Faster r-cnn: Towards real-time object detection with region proposal networks. In NeurIPS.
- Unsupervised Geometry-Aware Representation for 3D Human Pose Estimation. In ECCV.
- Learning Monocular 3D Human Pose Estimation From Multi-View Images. In CVPR.
- LCR-Net: Localization-Classification-Regression for Human Pose. In CVPR.
- LCR-Net++: Multi-person 2D and 3D Pose Detection in Natural Images. In IEEE TPAMI, Vol. 42. 1146–1161.
- Sebastian Ruder. 2017. An overview of multi-task learning in deep neural networks. In arXiv preprint arXiv:1706.05098.
- Markerless Outdoor Human Motion Capture Using Multiple Autonomous Micro Aerial Vehicles. In ICCV.
- PIFu: Pixel-Aligned Implicit Function for High-Resolution Clothed Human Digitization. In ICCV.
- PIFuHD: Multi-Level Pixel-Aligned Implicit Function for High-Resolution 3D Human Digitization. In CVPR.
- Ben Sapp and Ben Taskar. 2013. Modec: Multimodal decomposable models for human pose estimation. In CVPR.
- 3d human pose estimation: A review of the literature and analysis of covariates. In CVIU, Vol. 152. 1–20.
- Monocular 3D Human Pose Estimation by Generation and Ordinal Ranking. In ICCV.
- End-to-End Multi-Person Pose Estimation With Transformers. In CVPR.
- HumanEva: Synchronized video and motion capture dataset and baseline algorithm for evaluation of articulated human motion. In IJCV, Vol. 87. 4.
- 15 keypoints is all you need. In CVPR.
- Multi-person pose estimation with enhanced channel-wise and spatial information. In CVPR.
- View-Invariant Probabilistic Embedding for Human Pose. In ECCV.
- Deep high-resolution representation learning for human pose estimation. In CVPR.
- Compositional human pose regression. In ICCV.
- Wei Tang and Ying Wu. 2019. Does Learning Specific Features for Related Parts Help Human Pose Estimation?. In CVPR.
- Deeply learned compositional models for human pose estimation. In ECCV.
- Structured Prediction of 3D Human Pose with Deep Neural Networks. In BMVC.
- Learning to fuse 2d and 3d image cues for monocular body pose estimation. In ICCV.
- Direct Prediction of 3D Body Poses From Motion Compensated Sequences. In CVPR.
- DirectPose: Direct End-to-End Multi-Person Pose Estimation. In arXiv preprint arXiv:1911.07451.
- SelfPose: 3D Egocentric Pose Estimation from a Headset Mounted Camera. In arXiv preprint arXiv:2011.01519.
- xR-EgoPose: Egocentric 3D Human Pose from an HMD Camera. In ICCV.
- Efficient object localization using convolutional networks. In CVPR.
- Joint training of a convolutional network and a graphical model for human pose estimation. In NeurIPS.
- Alexander Toshev and Christian Szegedy. 2014. Deeppose: Human pose estimation via deep neural networks. In CVPR.
- Total Capture: 3D Human Pose Estimation Fusing Video and Inertial Sensors. In BMVC.
- VoxelPose: Towards Multi-Camera 3D Human Pose Estimation in Wild Environment. In ECCV.
- Self-Supervised Learning of Motion Capture. In NeurIPS.
- Self-supervised Keypoint Correspondences for Multi-Person Pose Estimation and Tracking in Videos. In arXiv preprint arXiv:2004.12652.
- Learning from synthetic humans. In CVPR.
- Ignas Budvytis Vince Tan and Roberto Cipolla. 2017. Indirect deep structured learning for 3D human body shape and pose prediction. In BMVC.
- Recovering Accurate 3D Human Pose in The Wild Using IMUs and a Moving Camera. In ECCV.
- Sparse inertial poser: Automatic 3d human pose estimation from sparse imus. In Computer Graphics Forum.
- Bastian Wandt and Bodo Rosenhahn. 2019. RepNet: Weakly Supervised Training of an Adversarial Reprojection Network for 3D Human Pose Estimation. In CVPR.
- BLSM: A Bone-Level Skinned Model of the Human Mesh. In ECCV.
- Regularizing Vector Embedding in Bottom-Up Human Pose Estimation. In ECCV.
- Not All Parts Are Created Equal: 3D Pose Estimation by Modeling Bi-Directional Dependencies of Body Parts. In ICCV.
- Graph-PCNN: Two Stage Human Pose Estimation with Graph Pose Refinement. In arXiv preprint arXiv:2007.10599.
- AI Coach: Deep Human Pose Estimation and Analysis for Personalized Athletic Training Assistance. In ACM MM.
- Motion Guided 3D Pose Estimation from Videos. In ECCV.
- Sequential 3D Human Pose and Shape Estimation From Point Clouds. In CVPR.
- DRPose3D: Depth Ranking in 3D Human Pose Estimation. In IJCAI.
- Direct Multi-view Multi-person 3D Human Pose Estimation. In NeurIPS.
- Lite pose: Efficient architecture design for 2d human pose estimation. In CVPR.
- Distribution-Aware Single-Stage Models for Multi-Person 3D Pose Estimation. In CVPR.
- Convolutional pose machines. In CVPR.
- Photo Wake-Up: 3D Character Animation From a Single Photo. In CVPR.
- Pose2Pose: pose selection and transfer for 2D character animation. In IUI.
- Ai challenger: A large-scale dataset for going deeper in image understanding. In arXiv preprint arXiv:1711.06475.
- Monocular total capture: Posing face, body, and hands in the wild. In CVPR.
- Simple Baselines for Human Pose Estimation and Tracking. In ECCV.
- MetaFuse: A Pre-trained Fusion Model for Human Pose Estimation. In CVPR.
- A2J: Anchor-to-Joint Regression Network for 3D Articulated Pose Estimation from a Single Depth Image. In ICCV.
- GHUM &\&& GHUML: Generative 3D Human Shape and Articulated Pose Models. In CVPR.
- Deep Kinematics Analysis for Monocular 3D Human Pose Estimation. In CVPR.
- Vipnas: Efficient video pose estimation via neural architecture search. In CVPR.
- Mo 2 cap 2: Real-time mobile 3d motion capture with a cap-mounted fisheye camera. In IEEE TVCG Proc. VR, Vol. 25. 2093–2101.
- 3D Human Shape and Pose from a Single Low-Resolution Image with Self-Supervised Learning. In ECCV.
- Spatial temporal graph convolutional networks for skeleton-based action recognition. In AAAI.
- Transpose: Keypoint localization via transformer. In ICCV.
- Learning feature pyramids for human pose estimation. In ICCV.
- End-to-end learning of deformable mixture of parts and deep convolutional neural networks for human pose estimation. In CVPR.
- 3D Human Pose Estimation in the Wild by Adversarial Learning. In CVPR.
- Yi Yang and Deva Ramanan. 2012. Articulated human detection with flexible mixtures of parts. In IEEE TPAMI, Vol. 35. 2878–2890.
- Faster VoxelPose: Real-time 3D Human Pose Estimation by Orthographic Projection. In ECCV.
- Lite-hrnet: A lightweight high-resolution network. In CVPR.
- DoubleFusion: Real-time Capture of Human Performances with Inner Body Shapes from a Single Depth Sensor. In IEEE TPAMI. 7287–7296.
- Simulcap: Single-view human performance capture with cloth simulation. In CVPR.
- Hrformer: High-resolution vision transformer for dense predict. In NeurIPS.
- Weakly Supervised 3D Human Pose and Shape Reconstruction with Normalizing Flows. In ECCV.
- Monocular 3D Pose and Shape Estimation of Multiple People in Natural Scenes: The Importance of Multiple Scene Constraints. In CVPR.
- Deep Network for the Integrated 3D Sensing of Multiple People in Natural Images. In NeurIPS.
- SRNet: Improving Generalization in 3D Human Pose Estimation with a Split-and-Recombine Approach. In ECCV.
- 3D Human Mesh Regression With Dense Correspondence. In CVPR.
- Distribution-aware coordinate representation for human pose estimation. In CVPR.
- Fast human pose estimation. In CVPR.
- Human pose estimation with spatial contextual information. In arXiv preprint arXiv:1901.01760.
- Vid2Player: Controllable Video Sprites that Behave and Appear like Professional Tennis Players. In arXiv preprint arXiv:2008.04524.
- PyMAF: 3D Human Pose and Shape Regression with Pyramidal Mesh Alignment Feedback Loop. In ICCV.
- MixSTE: Seq2seq Mixed Spatio-Temporal Encoder for 3D Human Pose Estimation in Video. In CVPR.
- Object-Occluded Human Shape and Pose Estimation From a Single Color Image. In CVPR.
- EfficientPose: Efficient Human Pose Estimation with Neural Architecture Search. In arXiv preprint arXiv:2012.07086.
- From actemes to action: A strongly-supervised representation for detailed action understanding. In ICCV.
- 4D Association Graph for Realtime Multi-person Motion Capture Using Multiple Video Cameras. In CVPR.
- Key Frame Proposal Network for Efficient Pose Estimation in Videos. In arXiv preprint arXiv:2007.15217.
- Fusing Wearable IMUs With Multi-View Images for Human Pose Estimation: A Geometric Approach. In CVPR.
- AdaFuse: Adaptive Multiview Fusion for Accurate Human Pose Estimation in the Wild. In IJCV, Vol. 129. 703–718.
- Semantic Graph Convolutional Networks for 3D Human Pose Regression. In CVPR.
- Through-Wall Human Mesh Recovery Using Radio Signals. In ICCV.
- RF-based 3D skeletons. In SIGCOMM.
- PoseFormerV2: Exploring Frequency Domain for Efficient and Robust 3D Human Pose Estimation. In CVPR.
- SMAP: Single-Shot Multi-Person Absolute 3D Pose Estimation. In ECCV.
- POTTER: Pooling Attention Transformer for Efficient Human Mesh Recovery. In CVPR.
- A Lightweight Graph Transformer Network for Human Mesh Reconstruction from 2D Human Pose. In ACM Multimedia.
- FeatER: An Efficient Network for Human Reconstruction via Feature Map-Based TransformER. In CVPR.
- 3D Human Pose Estimation with Spatial and Temporal Transformers. In ICCV.
- TexMesh: Reconstructing Detailed Human Texture and Geometry from RGB-D Video. In ECCV.
- Unsupervised Shape and Pose Disentanglement for 3D Meshes. In arXiv preprint arXiv:2007.11341.
- HEMlets Pose: Learning Part-Centric Heatmap Triplets for Accurate 3D Human Pose Estimation. In ICCV.
- Towards 3D Human Pose Estimation in the Wild: A Weakly-Supervised Approach. In ICCV.
- Deep kinematic pose regression. In ECCV.
- Objects as points. In arXiv preprint arXiv:1904.07850.
- Sparseness Meets Deepness: 3D Human Pose Estimation from Monocular Video. In CVPR.
- Monocap: Monocular human motion capture using a cnn coupled with a geometric prior. In IEEE TPAMI, Vol. 41. 901–914.
- Detailed human shape estimation from a single image by hierarchical mesh deformation. In CVPR.
- Reconstructing NBA players. In ECCV.
- Multi-person pose estimation for posetrack with enhanced part affinity fields. In ICCV PoseTrack Workshop.
- Zhiming Zou and Wei Tang. 2021. Modulated Graph Convolutional Network for 3D Human Pose Estimation. In ICCV.
- Silvia Zuffi and Michael J. Black. 2015. The Stitched Puppet: A Graphical Model of 3D Human Shape and Pose. In CVPR.
- Ce Zheng (45 papers)
- Wenhan Wu (9 papers)
- Chen Chen (753 papers)
- Taojiannan Yang (26 papers)
- Sijie Zhu (27 papers)
- Ju Shen (9 papers)
- Nasser Kehtarnavaz (15 papers)
- Mubarak Shah (208 papers)