TasksSotADatasetsPapersMethodsSubmitAbout
Papers With Code 2

A community resource for machine learning research: papers, code, benchmarks, and state-of-the-art results.

Explore

Notable BenchmarksAll SotADatasetsPapersMethods

Community

Submit ResultsAbout

Data sourced from the PWC Archive (CC-BY-SA 4.0). Built by the community, for the community.

Papers/Spatio-temporal Tendency Reasoning for Human Body Pose and...

Spatio-temporal Tendency Reasoning for Human Body Pose and Shape Estimation from Videos

Boyang Zhang, Suping Wu, Hu Cao, Kehua Ma, Pan Li, Lei Lin

2022-10-073D Human Pose EstimationTemporal Sequences
PaperPDF

Abstract

In this paper, we present a spatio-temporal tendency reasoning (STR) network for recovering human body pose and shape from videos. Previous approaches have focused on how to extend 3D human datasets and temporal-based learning to promote accuracy and temporal smoothing. Different from them, our STR aims to learn accurate and natural motion sequences in an unconstrained environment through temporal and spatial tendency and to fully excavate the spatio-temporal features of existing video data. To this end, our STR learns the representation of features in the temporal and spatial dimensions respectively, to concentrate on a more robust representation of spatio-temporal features. More specifically, for efficient temporal modeling, we first propose a temporal tendency reasoning (TTR) module. TTR constructs a time-dimensional hierarchical residual connection representation within a video sequence to effectively reason temporal sequences' tendencies and retain effective dissemination of human information. Meanwhile, for enhancing the spatial representation, we design a spatial tendency enhancing (STE) module to further learns to excite spatially time-frequency domain sensitive features in human motion information representations. Finally, we introduce integration strategies to integrate and refine the spatio-temporal feature representations. Extensive experimental findings on large-scale publically available datasets reveal that our STR remains competitive with the state-of-the-art on three datasets. Our code are available at https://github.com/Changboyang/STR.git.

Results

TaskDatasetMetricValueModel
3D Human Pose EstimationMPI-INF-3DHPAcceleration Error8.4STR
3D Human Pose EstimationMPI-INF-3DHPMPJPE95.3STR
3D Human Pose EstimationMPI-INF-3DHPPA-MPJPE61.6STR
3D Human Pose Estimation3DPWAcceleration Error6.9STR
3D Human Pose Estimation3DPWMPJPE85.2STR
3D Human Pose Estimation3DPWMPVPE101.2STR
3D Human Pose Estimation3DPWPA-MPJPE52.4STR
Pose EstimationMPI-INF-3DHPAcceleration Error8.4STR
Pose EstimationMPI-INF-3DHPMPJPE95.3STR
Pose EstimationMPI-INF-3DHPPA-MPJPE61.6STR
Pose Estimation3DPWAcceleration Error6.9STR
Pose Estimation3DPWMPJPE85.2STR
Pose Estimation3DPWMPVPE101.2STR
Pose Estimation3DPWPA-MPJPE52.4STR
3DMPI-INF-3DHPAcceleration Error8.4STR
3DMPI-INF-3DHPMPJPE95.3STR
3DMPI-INF-3DHPPA-MPJPE61.6STR
3D3DPWAcceleration Error6.9STR
3D3DPWMPJPE85.2STR
3D3DPWMPVPE101.2STR
3D3DPWPA-MPJPE52.4STR
1 Image, 2*2 StitchiMPI-INF-3DHPAcceleration Error8.4STR
1 Image, 2*2 StitchiMPI-INF-3DHPMPJPE95.3STR
1 Image, 2*2 StitchiMPI-INF-3DHPPA-MPJPE61.6STR
1 Image, 2*2 Stitchi3DPWAcceleration Error6.9STR
1 Image, 2*2 Stitchi3DPWMPJPE85.2STR
1 Image, 2*2 Stitchi3DPWMPVPE101.2STR
1 Image, 2*2 Stitchi3DPWPA-MPJPE52.4STR

Related Papers

Systematic Comparison of Projection Methods for Monocular 3D Human Pose Estimation on Fisheye Images2025-06-24Hunyuan-GameCraft: High-dynamic Interactive Game Video Generation with Hybrid History Condition2025-06-20TrajSceneLLM: A Multimodal Perspective on Semantic GPS Trajectory Analysis2025-06-19ExtPose: Robust and Coherent Pose Estimation by Extending ViTs2025-06-18PoseGRAF: Geometric-Reinforced Adaptive Fusion for Monocular 3D Human Pose Estimation2025-06-17APVR: Hour-Level Long Video Understanding with Adaptive Pivot Visual Information Retrieval2025-06-05Learning Pyramid-structured Long-range Dependencies for 3D Human Pose Estimation2025-06-03Time Blindness: Why Video-Language Models Can't See What Humans Can?2025-05-30