TasksSotADatasetsPapersMethodsSubmitAbout
Papers With Code 2

A community resource for machine learning research: papers, code, benchmarks, and state-of-the-art results.

Explore

Notable BenchmarksAll SotADatasetsPapersMethods

Community

Submit ResultsAbout

Data sourced from the PWC Archive (CC-BY-SA 4.0). Built by the community, for the community.

Papers/DeepFuse: An IMU-Aware Network for Real-Time 3D Human Pose...

DeepFuse: An IMU-Aware Network for Real-Time 3D Human Pose Estimation from Multi-View Image

Fuyang Huang, Ailing Zeng, Minhao Liu, Qiuxia Lai, Qiang Xu

2019-12-093D Human Pose EstimationPose Estimation3D Pose Estimation
PaperPDF

Abstract

In this paper, we propose a two-stage fully 3D network, namely \textbf{DeepFuse}, to estimate human pose in 3D space by fusing body-worn Inertial Measurement Unit (IMU) data and multi-view images deeply. The first stage is designed for pure vision estimation. To preserve data primitiveness of multi-view inputs, the vision stage uses multi-channel volume as data representation and 3D soft-argmax as activation layer. The second one is the IMU refinement stage which introduces an IMU-bone layer to fuse the IMU and vision data earlier at data level. without requiring a given skeleton model a priori, we can achieve a mean joint error of $28.9$mm on TotalCapture dataset and $13.4$mm on Human3.6M dataset under protocol 1, improving the SOTA result by a large margin. Finally, we discuss the effectiveness of a fully 3D network for 3D pose estimation experimentally which may benefit future research.

Results

TaskDatasetMetricValueModel
3D Human Pose EstimationTotal CaptureAverage MPJPE (mm)28.9DeepFuse-IMU
3D Human Pose EstimationTotal CaptureAverage MPJPE (mm)32.7DeepFuse-Vision Only
Pose EstimationTotal CaptureAverage MPJPE (mm)28.9DeepFuse-IMU
Pose EstimationTotal CaptureAverage MPJPE (mm)32.7DeepFuse-Vision Only
3DTotal CaptureAverage MPJPE (mm)28.9DeepFuse-IMU
3DTotal CaptureAverage MPJPE (mm)32.7DeepFuse-Vision Only
1 Image, 2*2 StitchiTotal CaptureAverage MPJPE (mm)28.9DeepFuse-IMU
1 Image, 2*2 StitchiTotal CaptureAverage MPJPE (mm)32.7DeepFuse-Vision Only

Related Papers

$π^3$: Scalable Permutation-Equivariant Visual Geometry Learning2025-07-17Revisiting Reliability in the Reasoning-based Pose Estimation Benchmark2025-07-17DINO-VO: A Feature-based Visual Odometry Leveraging a Visual Foundation Model2025-07-17From Neck to Head: Bio-Impedance Sensing for Head Pose Estimation2025-07-17AthleticsPose: Authentic Sports Motion Dataset on Athletic Field and Evaluation of Monocular 3D Pose Estimation Ability2025-07-17SpatialTrackerV2: 3D Point Tracking Made Easy2025-07-16SGLoc: Semantic Localization System for Camera Pose Estimation from 3D Gaussian Splatting Representation2025-07-16Efficient Calisthenics Skills Classification through Foreground Instance Selection and Depth Estimation2025-07-16