Ecosyste.ms: Awesome

An open API service indexing awesome lists of open source software.

Awesome Lists | Featured Topics | Projects

https://github.com/nxsEdson/awesome-3D-human

A detailed survey on 3D human pose estimation
https://github.com/nxsEdson/awesome-3D-human

List: awesome-3D-human

Last synced: 3 months ago
JSON representation

A detailed survey on 3D human pose estimation

Awesome Lists containing this project

README

        

# Awesome-3D-Human-Pose
A summary on 3D human pose estimation

## Survey
* [Recovering 3D Human Mesh from Monocular Images: A Survey](https://arxiv.org/abs/2203.01923)
Yating Tian, Hongwen Zhang, Yebin Liu, Limin Wang

## Body Models
* **\[SCAPE\]** [SCAPE: Shape Completion and Animation of People.](https://ai.stanford.edu/~drago/Papers/shapecomp.pdf) D. Anguelov, P. Srinivasan, D. Koller, S. Thrun, J. Rodgers, and J. Davis. *ACM Trans. Graphics, 2005*
**One Sentence Summary**: First body model disentangling human body into rigid transformation of pose, id-related shape, and pose-related shape.

* **\[SMPL\]** [SMPL: A Skinned Multi-Person Linear Model.](https://smpl.is.tue.mpg.de) Loper, Matthew and Mahmood, Naureen and Romero, Javier and Pons-Moll, Gerard and Black, Michael J. *ACM Trans. Graphics, 2015*
**One Sentence Summary**: The most widely-used body model which can be easily used in rendering engines for animation (with bones).

* **\[SMPL-X\]** [Expressive Body Capture: 3D Hands, Face, and Body from a Single Image](https://smpl-x.is.tue.mpg.de/) Pavlakos, Georgios and Choutas, Vasileios and Ghorbani, Nima and Bolkart, Timo and Osman, Ahmed A. A. and Tzionas, Dimitrios and Black, Michael J. *CVPR 2019*
**One Sentence Summary**: SMPL + MANO (hand model) + FLAME (head model)

* **\[STAR\]** [STAR: A Sparse Trained Articulated Human Body Regressor](https://star.is.tue.mpg.de) Osman, Ahmed A A and Bolkart, Timo and Black, Michael J. *ECCV 2020*
**One Sentence Summary**: Disentangling the pose-related blend shapes in SMPL to per-joint pose-related blend shapes

* **\[DeepDaz\]** [UltraPose: Synthesizing Dense Pose with 1 Billion Points by Human-body Decoupling 3D Model](https://github.com/MomoAILab/ultrapose) Haonan Yan, Jiaqi Chen, Xujie Zhang, Shengkai Zhang, Nianhong Jiao, Xiaodan Liang, Tianxiang Zheng. *ICCV 2021*
**One Sentence Summary**: Human body model with parameters having a specific physical meaning and decoupled with each other (based on [Daz](https://www.daz3d.com/) model)

* **\[GHUM\]** [GHUM & GHUML: Generative 3D Human Shape and Articulated Pose Models](https://openaccess.thecvf.com/content_CVPR_2020/papers/Xu_GHUM__GHUML_Generative_3D_Human_Shape_and_Articulated_Pose_CVPR_2020_paper.pdf) Hongyi Xu, Eduard Gabriel Bazavan, Andrei Zanfir, William T. Freeman, Rahul Sukthankar, Cristian Sminchisescu. *CVPR 2020*
**One Sentence Summary**: Human body model with non-linear (VAEs) id-related shape and face expression embedding spaces.

## Human Mesh Recovery

### From Single Images

* **\[SMPLify\]** [Keep it SMPL: Automatic Estimation of 3D Human Pose and Shape from a Single Image.](https://smplify.is.tue.mpg.de) Bogo, Federica and Kanazawa, Angjoo and Lassner, Christoph and Gehler, Peter and Romero, Javier and Black, Michael J. *ECCV 2016*
**One Sentence Summary**: One optimizion-based method using the reprojection loss of keypoints as well as several regularization terms.

* **\[HMR\]** [End-to-end Recovery of Human Shape and Pose.](https://openaccess.thecvf.com/content_cvpr_2018/papers/Kanazawa_End-to-End_Recovery_of_CVPR_2018_paper.pdf) Angjoo Kanazawa, Michael J. Black, David W. Jacobs, Jitendra Malik. *CVPR 2018*
**One Sentence Summary**: Human mesh recovery using reprojection loss of keypoints and adversary training to avoid unreasonable pose.

* **\[\]** [Learning to Estimate 3D Human Pose and Shape from a Single Color Image.](https://openaccess.thecvf.com/content_cvpr_2018/papers/Pavlakos_Learning_to_Estimate_CVPR_2018_paper.pdf) Georgios Pavlakos, Luyang Zhu, Xiaowei Zhou, Kostas Daniilidis *CVPR 2018*
**One Sentence Summary**: Training the network with keypoint heatmaps and masks as supervision for SMPL parameters regression.

* **\[SPIN\]** [Learning to Reconstruct 3D Human Pose and Shape via Model-fitting in the Loop.](https://openaccess.thecvf.com/content_ICCV_2019/papers/Kolotouros_Learning_to_Reconstruct_3D_Human_Pose_and_Shape_via_Model-Fitting_ICCV_2019_paper.pdf) Kolotouros, Nikos and Pavlakos, Georgios and Black, Michael J and Daniilidis, Kostas. *ICCV 2019*
**One Sentence Summary**: HMR + SMPLify (HMR is used to inilize the body model parameters & SMPLify is used to refine these parameters. The refined parameters are further used as the surpervision for the network.)

* **\[\]** [On the Continuity of Rotation Representations in Neural Networks.](https://openaccess.thecvf.com/content_CVPR_2019/papers/Zhou_On_the_Continuity_of_Rotation_Representations_in_Neural_Networks_CVPR_2019_paper.pdf) Yi Zhou, Connelly Barnes, Jingwan Lu, Jimei Yang, and Hao Li. *CVPR 2019*
**One Sentence Summary**: A new continuous representations for joint rotation.

* **\[GraphCMR\]** [Convolutional Mesh Regression for Single-Image Human Shape Reconstruction.](https://github.com/nkolot/GraphCMR) Nikos Kolotouros, Georgios Pavlakos, Kostas Daniilidis. *CVPR 2019*
**One Sentence Summary**: Directly regressing the meshes of human body with graph convolutions, then using meshes to regress SMPL parameters.

* **\[\]** [Delving Deep into Hybrid Annotations for 3D Human Recovery in the Wild.](https://penincillin.github.io/dct_iccv2019) Yu Rong, Ziwei Liu, Cheng Li, Kaidi Cao, Chen Change Loy. *ICCV 2019*
**One Sentence Summary**: A comprehensive study on the cost and effectiveness of different annotations for in-the-wild images. (Dense correspondence is effective.)

* **\[HoloPose\]** [HoloPose: Holistic 3D Human Reconstruction In-The-Wild.](https://www.arielai.com/holopose/) Rıza Alp Guler, and Iasonas Kokkinos. *CVPR 2019*
**One Sentence Summary**: Regressing the body model parameters from body-part features with reprojection loss of densepose and key points.

* **\[DecoMR\]** [3D Human Mesh Regression with Dense Correspondence.](https://openaccess.thecvf.com/content_CVPR_2020/papers/Zeng_3D_Human_Mesh_Regression_With_Dense_Correspondence_CVPR_2020_paper.pdf) Wang Zeng, Wanli Ouyang, Ping Luo, Wentao Liu, and Xiaogang Wang. *CVPR 2020*
**One Sentence Summary**: Recovering human mesh using the aligned features in UV space.

* **\[HKMR\]** [Hierarchical Kinematic Human Mesh Recovery.](https://www.ecva.net/papers/eccv_2020/papers_ECCV/papers/123620749.pdf) Georgios Georgakis, Ren Li, Srikrishna Karanam, Terrence Chen, Jana Ko seck a, and Ziyan Wu. *ECCV 2020*
**One Sentence Summary**: Optimizing the SMPL body pose parameters seperatly based on different parts of body.

* **\[PARE\]** [PARE: Part Attention Regressor for 3D Human Body Estimation.](https://pare.is.tue.mpg.de) Muhammed Kocabas, Chun-Hao P. Huang, Otmar Hilliges, and Michael J. Black. *ICCV 2021*
**One Sentence Summary**: Handling the occlusion problem using part attention. (Attention maps are inilized by segmentation mask and trained with the 3D branch jointly.)

* **\[DSR\]** [Learning to Regress Bodies from Images using Differentiable Semantic Rendering.](https://dsr.is.tue.mpg.de) Sai Kumar Dwivedi, Nikos Athanasiou, Muhammed Kocabas, Michael J. Black. *ICCV 2021*
**One Sentence Summary**: Using differentiable rendering to supervise the training of HMR with the semantic prior of clothes (calculated from AGORA).

* **\[Skeleton2Mesh\]** [Skeleton2Mesh: Kinematics Prior Injected Unsupervised Human Mesh Recovery.](https://dsr.is.tue.mpg.de) Zhenbo Yu, Junjie Wang, Jingwei Xu, Bingbing Ni, Chenglong Zhao, Minsi Wang, Wenjun Zhang. *ICCV 2021*
**One Sentence Summary**: 3D human pose estimation using differentiable IK.

* **\[PyMAF\]** [PyMAF: 3D Human Pose and Shape Regression with Pyramidal Mesh Alignment Feedback Loop.](https://github.com/HongwenZhang/PyMAF) Zhang, Hongwen and Tian, Yating and Zhou, Xinchi and Ouyang, Wanli and Liu, Yebin and Wang, Limin and Sun, Zhenan. *ICCV 2021*
**One Sentence Summary**: HMR (body model parameters regression network) using mesh-aligned multi-scale features & densepose supervisions.

* **\[METRO\]** [End-to-End Human Pose and Mesh Reconstruction with Transformers.]() Kevin Lin Lijuan Wang Zicheng Liu. *CVPR 2021*
**One Sentence Summary**: 3D human pose estimation using transformer. (3D joints/vertics locations are used as position embeddings.)

### From Videos

* **\[HMMR\]** [Learning 3D Human Dynamics from Video.](https://openaccess.thecvf.com/content_CVPR_2019/papers/Kanazawa_Learning_3D_Human_Dynamics_From_Video_CVPR_2019_paper.pdf) Angjoo Kanazawa, Jason Y. Zhang, Panna Felsen, Jitendra Malik *CVPR 2019*
**One Sentence Summary**: A temporal encoder with sliding windows and a hallucinator for the current time step to predict the pose of current and adjacent frames.

* **\[\]** [Exploiting temporal context for 3D human pose estimation in the wild.](https://openaccess.thecvf.com/content_CVPR_2019/papers/Arnab_Exploiting_Temporal_Context_for_3D_Human_Pose_Estimation_in_the_CVPR_2019_paper.pdf) Anurag Arnab, Carl Doersch, and Andrew Zisserman *CVPR 2019*
**One Sentence Summary**: Using buddle adjustment to add temporal smooth for pose estimation from video.

* **\[\]** [Human Mesh Recovery from Monocular Images via a Skeleton-disentangled Representation.](https://openaccess.thecvf.com/content_ICCV_2019/papers/Sun_Human_Mesh_Recovery_From_Monocular_Images_via_a_Skeleton-Disentangled_Representation_ICCV_2019_paper.pdf) Yu Sun, Yun Ye, Wu Liu, Wenpeng Gao, YiLi Fu, and Tao Mei *ICCV 2019*
**One Sentence Summary**: Extracting the skeleton and the rest detailed features separately, then using self-attention, temporal shuffling and adversarial training with these features to train the temporal human pose estimation model.

* **\[\]** [Occlusion-aware networks for 3d human pose estimation in video.](https://openaccess.thecvf.com/content_ICCV_2019/papers/Cheng_Occlusion-Aware_Networks_for_3D_Human_Pose_Estimation_in_Video_ICCV_2019_paper.pdf) Yu Cheng, Bo Yang, Bo Wang, Wending Yan, and Robby T. Tan. *ICCV 2019*
**One Sentence Summary**: 3D human pose estimating using 2D confidence heatmaps of keypoints and optical flow.

* **\[VIBE\]** [VIBE: Video Inference for Human Body Pose and Shape Estimation.](https://openaccess.thecvf.com/content_CVPR_2020/papers/Kocabas_VIBE_Video_Inference_for_Human_Body_Pose_and_Shape_Estimation_CVPR_2020_paper.pdf) Muhammed Kocabas, Nikos Athanasiou, Michael J. Black. *CVPR 2020*
**One Sentence Summary**: Temporal HMR (CNN+GRU for parameters regression & adversary training to avoid unreasonable temporal actions.)

* **\[iMoCap\]** [Motion Capture from Internet Videos.](https://zju3dv.github.io/iMoCap/) Junting Dong, Qing Shuai, Yuanqing Zhang, Xian Liu, Xiaowei Zhou, Hujun Bao. *ECCV 2020*
**One Sentence Summary**: Using the multi-view videos of the same celebrity performing a specific action to reconstruct the 3d human mesh using an optimization-based method.

* **\[TCMR\]** [Beyond Static Features for Temporally Consistent 3D Human Pose and Shape from a Video.](https://github.com/hongsukchoi/TCMR_RELEASE) Choi, Hongsuk and Moon, Gyeongsik and Chang, Ju Yong and Lee, Kyoung Mu. *CVPR 2021*
**One Sentence Summary**: VIBE + supervising the future and past SMPL body parameters with the overall predicted body model parameters.

* **\[\]** [Uncertainty-aware human mesh recovery from video by learning part-based 3D dynamics.](https://openaccess.thecvf.com/content/ICCV2021/papers/Lee_Uncertainty-Aware_Human_Mesh_Recovery_From_Video_by_Learning_Part-Based_3D_ICCV_2021_paper.pdf) Gun-Hee Lee, Seong-Whan Lee. *ICCV 2021*
**One Sentence Summary**: Using part-based features with uncertainty-aware mechanism as well as the optical flow features for body model parameters prediction.

* **\[SmoothNet\]** [SmoothNet: A Plug-and-Play Network for Refining Human Poses in Videos.](https://github.com/cure-lab/SmoothNet) Zeng, Ailing and Yang, Lei and Ju, Xuan and Li, Jiefeng and Wang, Jianyi and Xu, Qiang. *ECCV 2022*
**One Sentence Summary**: A plugin module to reduce the temporal jitter noises.

* **\[GLAMR\]** [GLAMR: Global Occlusion-Aware Human Mesh Recovery with Dynamic Cameras.](https://github.com/NVlabs/GLAMR) Ye Yuan, Umar Iqbal, Pavlo Molchanov, Kris Kitani, Jan Kautz. *CVPR 2022*
**One Sentence Summary**: Solving the occlusion problem with the prior of dynamic camera.

* **\[\]** [Human Mesh Recovery from Multiple Shots.](https://geopavlakos.github.io/multishot/) Georgios Pavlakos Jitendra Malik Angjoo Kanazawa *CVPR 2022*
**One Sentence Summary**: Using SPIN with smoothness term of canonical frame to get the ground-truth. Then using the ground-truth pose to train temporal HMR with transformer.

### For Multiple Persons

* **\[\]** [Monocular 3d pose and shape estimation of multiple people in natural scenes-the importance of multiple scene constraints.](https://openaccess.thecvf.com/content_cvpr_2018/papers/Zanfir_Monocular_3D_Pose_CVPR_2018_paper.pdf) Zanfir A, Marinoiu E, Sminchisescu C. *CVPR 2018*
**One Sentence Summary**: Optimizing the pose of multiple person with the plane-groud assumption, occupancy avoidance, and temporal smoothness.

* **\[HMOR\]** [HMOR: Hierarchical Multi-Person Ordinal Relations for Monocular Multi-Person 3D Pose Estimation.](https://www.ecva.net/papers/eccv_2020/papers_ECCV/papers/123480256.pdf) Can Wang, Jiefeng Li, Wentao Liu, Chen Qian, and Cewu Lu. *ECCV 2020*
**One Sentence Summary**: Improving multi-person 3D pose estimation using multi-person interaction relations considering instance and joints depth relations and body parts angle relations.

* **\[\]** [Coherent Reconstruction of Multiple Humans from a Single Image.](https://openaccess.thecvf.com/content_CVPR_2020/papers/Jiang_Coherent_Reconstruction_of_Multiple_Humans_From_a_Single_Image_CVPR_2020_paper.pdf) Wen Jiang, Nikos Kolotouros, Georgios Pavlakos, Xiaowei Zhou, and Kostas Daniilidis. *CVPR 2020*
**One Sentence Summary**: Training the network to estimate the SMPL parameters of all multiple persons with interpenetration loss (based on SDF) and depth-aware loss (based on instance segmentation.).

* **\[BMP\]** [Body Meshes as Points.](https://github.com/jfzhang95/BMP) Zhang, Jianfeng and Yu, Dongdong and Liew, Jun Hao and Nie, Xuecheng and Feng, Jiashi *CVPR 2021*
**One Sentence Summary**: One-stage model to estimate multiple persons' 3D body using the similar idea as one stage detection。

* **\[ROMP\]** [Monocular, One-stage, Regression of Multiple 3D People.](https://github.com/Arthur151/ROMP) Sun, Yu and Bao, Qian and Liu, Wu and Fu, Yili and Michael J., Black and Mei, Tao. *ICCV 2021*
**One Sentence Summary**: One-stage model to estimate multiple persons' 3D body using the similar idea as CenterNet.

* **\[BEV\]** [Putting People in their Place: Monocular Regression of 3D People in Depth.](https://arthur151.github.io/BEV/BEV.html) Sun, Yu and Liu, Wu and Bao, Qian and Fu, Yili and Mei, Tao and Black, Michael J.. *CVPR 2022*
**One Sentence Summary**: Age-aware SMPL model (adding age-related offsets) + estimating bird's eye view feature to help to refine the depth relationship for multiple subjects.

### Considering Environment

* **\[\]** [Resolving 3D Human Pose Ambiguities with 3D Scene Constraints.](https://prox.is.tue.mpg.de) Mohamed Hassan, Vasileios Choutas, Dimitrios Tzionas and Michael J. Black *ICCV 2019*
**One Sentence Summary**: Improving the recovery of 3D human in a given 3D scene by considering the interation of human and scene layouts.

* **\[\]** [The One Where They Reconstructed 3D Humans and Environments in TV Shows.](https://ethanweber.me/sitcoms3D/) Georgios Pavlakos and Ethan Weber and and Matthew Tancik and Angjoo Kanazawa *ECCV 2022*
**One Sentence Summary**: Improving the recovery of 3D human for TV shows by reconstructing the environment and estimating the camera and body scale information.

### Beyond Body Models (meshes, voxel, and etc.)

* **\[BodyNet\]** [BodyNet: Volumetric Inference of 3D Human Body Shapes.](https://github.com/gulvarol/bodynet) Gül Varol, Duygu Ceylan, Bryan Russell, Jimei Yang, Ersin Yumer, Ivan Laptev and Cordelia Schmid. *ECCV 2018*
**One Sentence Summary**: Volumetric Inference with the supervision of 2d & 3d keypoints, segmentations and voxelized SMPL model.

* **\[I2L-MeshNet\]** [I2L-MeshNet: Image-to-Lixel Prediction Network for Accurate 3D Human Pose and Mesh Estimation from a Single RGB Image.](https://github.com/mks0601/I2L-MeshNet_RELEASE) Gyeongsik Moon and Kyoung Mu Lee. *ECCV 2020*
**One Sentence Summary**: Regressing 2d & 3d key points firstly, and then regressing the mesh directly.

## Applications

### Tracking
* **\[HMAR\]** [Tracking People with 3D Representations.](http://people.eecs.berkeley.edu/~jathushan/T3DP/) Jathushan Rajasegaran, Georgios Pavlakos, Angjoo Kanazawa, and Jitendra Malik *NeurIPS 2021*
**One Sentence Summary**: HMR + texture recovery (using appearance flow) for human tracking.

* **\[PHALP\]** [Tracking People by Predicting 3D Appearance, Location and Pose.](http://people.eecs.berkeley.edu/~jathushan/PHALP/) Rajasegaran, Jathushan and Pavlakos, Georgios and Kanazawa, Angjoo and Malik, Jitendra *CVPR 2022*
**One Sentence Summary**: Predicting and matching HMAR estimation for tracking in videos.

### Motion prediction
* **\[PHD\]** [Predicting 3D Human Dynamics from Video.](https://openaccess.thecvf.com/content_ICCV_2019/papers/Zhang_Predicting_3D_Human_Dynamics_From_Video_ICCV_2019_paper.pdf) Jason Y. Zhang, Panna Felsen, Angjoo Kanazawa, Jitendra Malik *ICCV 2019*
**One Sentence Summary**: "Learning 3D Human Dynamics from Video" for 3D motion prediction.

## Datasets

* [SURREAL](https://www.di.ens.fr/willow/research/surreal/)
* [AGORA](https://agora.is.tue.mpg.de/)
* [THUman](https://github.com/ZhengZerong/DeepHuman/tree/master/THUmanDataset)
* [THUman2.0](https://github.com/ytrock/THuman2.0-Dataset)
* [ZJU-Mocap](https://github.com/zju3dv/neuralbody/blob/master/INSTALL.md#zju-mocap-dataset)
* [Human3.6M](https://github.com/kotaro-inoue/human3.6m_downloader)
* [MultiHuman](https://github.com/y-zheng18/MultiHuman-Dataset/)
* [3DPW](https://virtualhumans.mpi-inf.mpg.de/3DPW/)
* [HUMBI](https://humbi-data.net)
* [EFT](https://github.com/facebookresearch/eft) (Fitting data)

## Related Topic
* [Detailed 3D Human Recovery \(Clothing\)](https://github.com/weihaox/awesome-clothed-human)
* [3D Face](https://paperswithcode.com/task/3d-face-reconstruction)
* [3D Hand](https://github.com/SeanChenxy/Hand3DResearch)
* [Human Pose Detection](https://github.com/cbsudux/awesome-human-pose-estimation)

## Related Research Groups
[Michael Black](http://ps.is.mpg.de) (Max Planck Institute for Intelligent Systems)
[Yebin Liu](http://www.liuyebin.com) (Tsinghua University)
[Kyoung Mu Lee](https://cv.snu.ac.kr/index.php/~kmlee/) (Seoul National University)
[Yaser Sheikh](https://scholar.google.com/citations?user=Yd4KvooAAAAJ&hl=en) (Carnegie Mellon University, Facebook Reality Labs)
[Angjoo Kanazawa](https://people.eecs.berkeley.edu/~kanazawa/) (University of California, Berkeley)
[Kostas Daniilidis](http://www.cis.upenn.edu/~kostas) (University of Pennsylvania)
[Xiaowei Zhou](https://xzhou.me) (Zhejiang University)
[Siyu Tang](https://vlg.inf.ethz.ch) (ETH Zürich)