TasksSotADatasetsPapersMethodsSubmitAbout
Papers With Code 2

A community resource for machine learning research: papers, code, benchmarks, and state-of-the-art results.

Explore

Notable BenchmarksAll SotADatasetsPapersMethods

Community

Submit ResultsAbout

Data sourced from the PWC Archive (CC-BY-SA 4.0). Built by the community, for the community.

Papers/Refined Temporal Pyramidal Compression-and-Amplification T...

Refined Temporal Pyramidal Compression-and-Amplification Transformer for 3D Human Pose Estimation

Hanbing Liu, Wangmeng Xiang, Jun-Yan He, Zhi-Qi Cheng, Bin Luo, Yifeng Geng, Xuansong Xie

2023-09-043D Human Pose EstimationPose Estimation
PaperPDFCode(official)

Abstract

Accurately estimating the 3D pose of humans in video sequences requires both accuracy and a well-structured architecture. With the success of transformers, we introduce the Refined Temporal Pyramidal Compression-and-Amplification (RTPCA) transformer. Exploiting the temporal dimension, RTPCA extends intra-block temporal modeling via its Temporal Pyramidal Compression-and-Amplification (TPCA) structure and refines inter-block feature interaction with a Cross-Layer Refinement (XLR) module. In particular, TPCA block exploits a temporal pyramid paradigm, reinforcing key and value representation capabilities and seamlessly extracting spatial semantics from motion sequences. We stitch these TPCA blocks with XLR that promotes rich semantic representation through continuous interaction of queries, keys, and values. This strategy embodies early-stage information with current flows, addressing typical deficits in detail and stability seen in other transformer-based methods. We demonstrate the effectiveness of RTPCA by achieving state-of-the-art results on Human3.6M, HumanEva-I, and MPI-INF-3DHP benchmarks with minimal computational overhead. The source code is available at https://github.com/hbing-l/RTPCA.

Results

TaskDatasetMetricValueModel
3D Human Pose EstimationHumanEva-IMean Reconstruction Error (mm)19.1RTPCA
3D Human Pose EstimationMPI-INF-3DHPAUC74.2RTPCA
3D Human Pose EstimationMPI-INF-3DHPMPJPE40.5RTPCA
3D Human Pose EstimationMPI-INF-3DHPPCK98.8RTPCA
Pose EstimationHumanEva-IMean Reconstruction Error (mm)19.1RTPCA
Pose EstimationMPI-INF-3DHPAUC74.2RTPCA
Pose EstimationMPI-INF-3DHPMPJPE40.5RTPCA
Pose EstimationMPI-INF-3DHPPCK98.8RTPCA
3DHumanEva-IMean Reconstruction Error (mm)19.1RTPCA
3DMPI-INF-3DHPAUC74.2RTPCA
3DMPI-INF-3DHPMPJPE40.5RTPCA
3DMPI-INF-3DHPPCK98.8RTPCA
1 Image, 2*2 StitchiHumanEva-IMean Reconstruction Error (mm)19.1RTPCA
1 Image, 2*2 StitchiMPI-INF-3DHPAUC74.2RTPCA
1 Image, 2*2 StitchiMPI-INF-3DHPMPJPE40.5RTPCA
1 Image, 2*2 StitchiMPI-INF-3DHPPCK98.8RTPCA

Related Papers

$π^3$: Scalable Permutation-Equivariant Visual Geometry Learning2025-07-17Revisiting Reliability in the Reasoning-based Pose Estimation Benchmark2025-07-17DINO-VO: A Feature-based Visual Odometry Leveraging a Visual Foundation Model2025-07-17From Neck to Head: Bio-Impedance Sensing for Head Pose Estimation2025-07-17AthleticsPose: Authentic Sports Motion Dataset on Athletic Field and Evaluation of Monocular 3D Pose Estimation Ability2025-07-17SpatialTrackerV2: 3D Point Tracking Made Easy2025-07-16SGLoc: Semantic Localization System for Camera Pose Estimation from 3D Gaussian Splatting Representation2025-07-16Efficient Calisthenics Skills Classification through Foreground Instance Selection and Depth Estimation2025-07-16