Hand Models¶
Distribution-aware coordinate representation for human pose estimation¶
Introduction¶
@inproceedings{zhang2020distribution,
title={Distribution-aware coordinate representation for human pose estimation},
author={Zhang, Feng and Zhu, Xiatian and Dai, Hanbin and Ye, Mao and Zhu, Ce},
booktitle={Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition},
pages={7093--7102},
year={2020}
}
Results and models¶
2d Hand Keypoint Estimation¶
Results on OneHand10K val set¶
Arch | Input Size | PCK@0.2 | AUC | EPE | ckpt | log |
---|---|---|---|---|---|---|
pose_hrnetv2_w18_dark | 256x256 | 0.990 | 0.573 | 23.84 | ckpt | log |
Results on CMU Panoptic (MPII+NZSL val set)¶
Arch | Input Size | PCKh@0.7 | AUC | EPE | ckpt | log |
---|---|---|---|---|---|---|
pose_hrnetv2_w18_dark | 256x256 | 0.999 | 0.745 | 7.77 | ckpt | log |
Results on RHD test set¶
Arch | Input Size | PCK@0.2 | AUC | EPE | ckpt | log |
---|---|---|---|---|---|---|
pose_hrnetv2_w18_dark | 256x256 | 0.992 | 0.903 | 2.17 | ckpt | log |
Deeppose: Human pose estimation via deep neural networks¶
Introduction¶
@inproceedings{toshev2014deeppose,
title={Deeppose: Human pose estimation via deep neural networks},
author={Toshev, Alexander and Szegedy, Christian},
booktitle={Proceedings of the IEEE conference on computer vision and pattern recognition},
pages={1653--1660},
year={2014}
}
Results and models¶
2d Hand Keypoint Estimation¶
Results on OneHand10K val set¶
Arch | Input Size | PCK@0.2 | AUC | EPE | ckpt | log |
---|---|---|---|---|---|---|
deeppose_resnet_50 | 256x256 | 0.990 | 0.486 | 34.28 | ckpt | log |
Results on CMU Panoptic (MPII+NZSL val set)¶
Arch | Input Size | PCKh@0.7 | AUC | EPE | ckpt | log |
---|---|---|---|---|---|---|
deeppose_resnet_50 | 256x256 | 0.999 | 0.686 | 9.36 | ckpt | log |
Results on RHD test set¶
Arch | Input Size | PCK@0.2 | AUC | EPE | ckpt | log |
---|---|---|---|---|---|---|
deeppose_resnet_50 | 256x256 | 0.988 | 0.865 | 3.29 | ckpt | log |
Deep high-resolution representation learning for visual recognition¶
Introduction¶
@article{WangSCJDZLMTWLX19,
title={Deep High-Resolution Representation Learning for Visual Recognition},
author={Jingdong Wang and Ke Sun and Tianheng Cheng and
Borui Jiang and Chaorui Deng and Yang Zhao and Dong Liu and Yadong Mu and
Mingkui Tan and Xinggang Wang and Wenyu Liu and Bin Xiao},
journal = {TPAMI}
year={2019}
}
Results and models¶
2d Hand Keypoint Estimation¶
Results on OneHand10K val set¶
Arch | Input Size | PCK@0.2 | AUC | EPE | ckpt | log |
---|---|---|---|---|---|---|
pose_hrnetv2_w18 | 256x256 | 0.990 | 0.568 | 24.16 | ckpt | log |
Results on CMU Panoptic (MPII+NZSL val set)¶
Arch | Input Size | PCKh@0.7 | AUC | EPE | ckpt | log |
---|---|---|---|---|---|---|
pose_hrnetv2_w18 | 256x256 | 0.999 | 0.744 | 7.79 | ckpt | log |
Results on RHD test set¶
Arch | Input Size | PCK@0.2 | AUC | EPE | ckpt | log |
---|---|---|---|---|---|---|
pose_hrnetv2_w18 | 256x256 | 0.992 | 0.902 | 2.21 | ckpt | log |
Mobilenetv2: Inverted residuals and linear bottlenecks¶
Introduction¶
@inproceedings{sandler2018mobilenetv2,
title={Mobilenetv2: Inverted residuals and linear bottlenecks},
author={Sandler, Mark and Howard, Andrew and Zhu, Menglong and Zhmoginov, Andrey and Chen, Liang-Chieh},
booktitle={Proceedings of the IEEE conference on computer vision and pattern recognition},
pages={4510--4520},
year={2018}
}
Results and models¶
2d Hand Pose Estimation¶
Results on OneHand10K val set¶
Arch | Input Size | PCK@0.2 | AUC | EPE | ckpt | log |
---|---|---|---|---|---|---|
pose_mobilenet_v2 | 256x256 | 0.986 | 0.537 | 28.60 | ckpt | log |
Results on CMU Panoptic (MPII+NZSL val set)¶
Arch | Input Size | PCKh@0.7 | AUC | EPE | ckpt | log |
---|---|---|---|---|---|---|
pose_mobilenet_v2 | 256x256 | 0.998 | 0.694 | 9.70 | ckpt | log |
Results on RHD test set¶
Arch | Input Size | PCK@0.2 | AUC | EPE | ckpt | log |
---|---|---|---|---|---|---|
pose_mobilenet_v2 | 256x256 | 0.985 | 0.883 | 2.80 | ckpt | log |
Simple baselines for human pose estimation and tracking¶
Introduction¶
@inproceedings{xiao2018simple,
title={Simple baselines for human pose estimation and tracking},
author={Xiao, Bin and Wu, Haiping and Wei, Yichen},
booktitle={Proceedings of the European conference on computer vision (ECCV)},
pages={466--481},
year={2018}
}
Results and models¶
2d Hand Pose Estimation¶
Results on OneHand10K val set¶
Arch | Input Size | PCK@0.2 | AUC | EPE | ckpt | log |
---|---|---|---|---|---|---|
pose_resnet_50 | 256x256 | 0.989 | 0.555 | 25.19 | ckpt | log |
Results on FreiHand val & test set¶
Set | Arch | Input Size | PCK@0.2 | AUC | EPE | ckpt | log |
---|---|---|---|---|---|---|---|
val | pose_resnet_50 | 224x224 | 0.993 | 0.868 | 3.25 | ckpt | log |
test | pose_resnet_50 | 224x224 | 0.992 | 0.868 | 3.27 | ckpt | log |
Results on CMU Panoptic (MPII+NZSL val set)¶
Arch | Input Size | PCKh@0.7 | AUC | EPE | ckpt | log |
---|---|---|---|---|---|---|
pose_resnet_50 | 256x256 | 0.999 | 0.713 | 9.00 | ckpt | log |
Results on InterHand2.6M val & test set¶
Train Set | Set | Arch | Input Size | PCK@0.2 | AUC | EPE | ckpt | log |
---|---|---|---|---|---|---|---|---|
Human_annot | val(M) | pose_resnet_50 | 256x256 | 0.973 | 0.828 | 5.15 | ckpt | log |
Human_annot | test(H) | pose_resnet_50 | 256x256 | 0.973 | 0.826 | 5.27 | ckpt | log |
Human_annot | test(M) | pose_resnet_50 | 256x256 | 0.975 | 0.841 | 4.90 | ckpt | log |
Human_annot | test(H+M) | pose_resnet_50 | 256x256 | 0.975 | 0.839 | 4.97 | ckpt | log |
Machine_annot | val(M) | pose_resnet_50 | 256x256 | 0.970 | 0.824 | 5.39 | ckpt | log |
Machine_annot | test(H) | pose_resnet_50 | 256x256 | 0.969 | 0.821 | 5.52 | ckpt | log |
Machine_annot | test(M) | pose_resnet_50 | 256x256 | 0.972 | 0.838 | 5.03 | ckpt | log |
Machine_annot | test(H+M) | pose_resnet_50 | 256x256 | 0.972 | 0.837 | 5.11 | ckpt | log |
All | val(M) | pose_resnet_50 | 256x256 | 0.977 | 0.840 | 4.66 | ckpt | log |
All | test(H) | pose_resnet_50 | 256x256 | 0.979 | 0.839 | 4.65 | ckpt | log |
All | test(M) | pose_resnet_50 | 256x256 | 0.979 | 0.838 | 4.42 | ckpt | log |
All | test(H+M) | pose_resnet_50 | 256x256 | 0.979 | 0.851 | 4.46 | ckpt | log |
Results on RHD test set¶
Arch | Input Size | PCK@0.2 | AUC | EPE | ckpt | log |
---|---|---|---|---|---|---|
pose_resnet_50 | 224x224 | 0.992 | 0.896 | 2.38 | ckpt | log |
pose_resnet_50 | 256x256 | 0.991 | 0.898 | 2.33 | ckpt | log |
The Devil is in the Details: Delving into Unbiased Data Processing for Human Pose Estimation¶
Introduction¶
@InProceedings{Huang_2020_CVPR,
author = {Huang, Junjie and Zhu, Zheng and Guo, Feng and Huang, Guan},
title = {The Devil Is in the Details: Delving Into Unbiased Data Processing for Human Pose Estimation},
booktitle = {The IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR)},
month = {June},
year = {2020}
}
Note that, UDP also adopts the unbiased encoding/decoding algorithm of DARK.
Results and models¶
2d Hand Keypoint Estimation¶
Results on OneHand10K val set¶
Arch | Input Size | PCK@0.2 | AUC | EPE | ckpt | log |
---|---|---|---|---|---|---|
pose_hrnetv2_w18_udp | 256x256 | 0.990 | 0.572 | 23.87 | ckpt | log |
Results on CMU Panoptic (MPII+NZSL val set)¶
Arch | Input Size | PCKh@0.7 | AUC | EPE | ckpt | log |
---|---|---|---|---|---|---|
pose_hrnetv2_w18_udp | 256x256 | 0.998 | 0.742 | 7.84 | ckpt | log |
Results on RHD test set¶
Arch | Input Size | PCK@0.2 | AUC | EPE | ckpt | log |
---|---|---|---|---|---|---|
pose_hrnetv2_w18_udp | 256x256 | 0.992 | 0.902 | 2.21 | ckpt | log |