TasksSotADatasetsPapersMethodsSubmitAbout
Papers With Code 2

A community resource for machine learning research: papers, code, benchmarks, and state-of-the-art results.

Explore

Notable BenchmarksAll SotADatasetsPapersMethods

Community

Submit ResultsAbout

Data sourced from the PWC Archive (CC-BY-SA 4.0). Built by the community, for the community.

Papers/Learning Unorthogonalized Matrices for Rotation Estimation

Learning Unorthogonalized Matrices for Rotation Estimation

Kerui Gu, Zhihao LI, Shiyong Liu, Jianzhuang Liu, Songcen Xu, Youliang Yan, Michael Bi Mi, Kenji Kawaguchi, Angela Yao

2023-12-013D Human Pose EstimationPose Estimation
PaperPDF

Abstract

Estimating 3D rotations is a common procedure for 3D computer vision. The accuracy depends heavily on the rotation representation. One form of representation -- rotation matrices -- is popular due to its continuity, especially for pose estimation tasks. The learning process usually incorporates orthogonalization to ensure orthonormal matrices. Our work reveals, through gradient analysis, that common orthogonalization procedures based on the Gram-Schmidt process and singular value decomposition will slow down training efficiency. To this end, we advocate removing orthogonalization from the learning process and learning unorthogonalized `Pseudo' Rotation Matrices (PRoM). An optimization analysis shows that PRoM converges faster and to a better solution. By replacing the orthogonalization incorporated representation with our proposed PRoM in various rotation-related tasks, we achieve state-of-the-art results on large-scale benchmarks for human pose estimation.

Results

TaskDatasetMetricValueModel
3D Human Pose Estimation3DPWMPJPE67.6PROM (CLIFF)
3D Human Pose Estimation3DPWMPVPE79.2PROM (CLIFF)
3D Human Pose Estimation3DPWPA-MPJPE42PROM (CLIFF)
Pose Estimation3DPWMPJPE67.6PROM (CLIFF)
Pose Estimation3DPWMPVPE79.2PROM (CLIFF)
Pose Estimation3DPWPA-MPJPE42PROM (CLIFF)
3D3DPWMPJPE67.6PROM (CLIFF)
3D3DPWMPVPE79.2PROM (CLIFF)
3D3DPWPA-MPJPE42PROM (CLIFF)
1 Image, 2*2 Stitchi3DPWMPJPE67.6PROM (CLIFF)
1 Image, 2*2 Stitchi3DPWMPVPE79.2PROM (CLIFF)
1 Image, 2*2 Stitchi3DPWPA-MPJPE42PROM (CLIFF)

Related Papers

$π^3$: Scalable Permutation-Equivariant Visual Geometry Learning2025-07-17Revisiting Reliability in the Reasoning-based Pose Estimation Benchmark2025-07-17DINO-VO: A Feature-based Visual Odometry Leveraging a Visual Foundation Model2025-07-17From Neck to Head: Bio-Impedance Sensing for Head Pose Estimation2025-07-17AthleticsPose: Authentic Sports Motion Dataset on Athletic Field and Evaluation of Monocular 3D Pose Estimation Ability2025-07-17SpatialTrackerV2: 3D Point Tracking Made Easy2025-07-16SGLoc: Semantic Localization System for Camera Pose Estimation from 3D Gaussian Splatting Representation2025-07-16Efficient Calisthenics Skills Classification through Foreground Instance Selection and Depth Estimation2025-07-16