TasksSotADatasetsPapersMethodsSubmitAbout
Papers With Code 2

A community resource for machine learning research: papers, code, benchmarks, and state-of-the-art results.

Explore

Notable BenchmarksAll SotADatasetsPapersMethods

Community

Submit ResultsAbout

Data sourced from the PWC Archive (CC-BY-SA 4.0). Built by the community, for the community.

Papers/Deep Two-Stream Video Inference for Human Body Pose and Sh...

Deep Two-Stream Video Inference for Human Body Pose and Shape Estimation

Ziwen Li, Bo Xu, Han Huang, Cheng Lu, Yandong Guo

2021-10-223D Human Pose EstimationOptical Flow Estimation
PaperPDF

Abstract

Several video-based 3D pose and shape estimation algorithms have been proposed to resolve the temporal inconsistency of single-image-based methods. However it still remains challenging to have stable and accurate reconstruction. In this paper, we propose a new framework Deep Two-Stream Video Inference for Human Body Pose and Shape Estimation (DTS-VIBE), to generate 3D human pose and mesh from RGB videos. We reformulate the task as a multi-modality problem that fuses RGB and optical flow for more reliable estimation. In order to fully utilize both sensory modalities (RGB or optical flow), we train a two-stream temporal network based on transformer to predict SMPL parameters. The supplementary modality, optical flow, helps to maintain temporal consistency by leveraging motion knowledge between two consecutive frames. The proposed algorithm is extensively evaluated on the Human3.6 and 3DPW datasets. The experimental results show that it outperforms other state-of-the-art methods by a significant margin.

Results

TaskDatasetMetricValueModel
3D Human Pose EstimationMPI-INF-3DHPAcceleration Error11.9DST-VIBE
3D Human Pose EstimationMPI-INF-3DHPMPJPE93.4DST-VIBE
3D Human Pose EstimationMPI-INF-3DHPPA-MPJPE62.2DST-VIBE
3D Human Pose Estimation3DPWAcceleration Error11DST-VIBE
3D Human Pose Estimation3DPWMPJPE76.7DST-VIBE
3D Human Pose Estimation3DPWMPVPE93.5DST-VIBE
3D Human Pose Estimation3DPWPA-MPJPE50.3DST-VIBE
Pose EstimationMPI-INF-3DHPAcceleration Error11.9DST-VIBE
Pose EstimationMPI-INF-3DHPMPJPE93.4DST-VIBE
Pose EstimationMPI-INF-3DHPPA-MPJPE62.2DST-VIBE
Pose Estimation3DPWAcceleration Error11DST-VIBE
Pose Estimation3DPWMPJPE76.7DST-VIBE
Pose Estimation3DPWMPVPE93.5DST-VIBE
Pose Estimation3DPWPA-MPJPE50.3DST-VIBE
3DMPI-INF-3DHPAcceleration Error11.9DST-VIBE
3DMPI-INF-3DHPMPJPE93.4DST-VIBE
3DMPI-INF-3DHPPA-MPJPE62.2DST-VIBE
3D3DPWAcceleration Error11DST-VIBE
3D3DPWMPJPE76.7DST-VIBE
3D3DPWMPVPE93.5DST-VIBE
3D3DPWPA-MPJPE50.3DST-VIBE
1 Image, 2*2 StitchiMPI-INF-3DHPAcceleration Error11.9DST-VIBE
1 Image, 2*2 StitchiMPI-INF-3DHPMPJPE93.4DST-VIBE
1 Image, 2*2 StitchiMPI-INF-3DHPPA-MPJPE62.2DST-VIBE
1 Image, 2*2 Stitchi3DPWAcceleration Error11DST-VIBE
1 Image, 2*2 Stitchi3DPWMPJPE76.7DST-VIBE
1 Image, 2*2 Stitchi3DPWMPVPE93.5DST-VIBE
1 Image, 2*2 Stitchi3DPWPA-MPJPE50.3DST-VIBE

Related Papers

Channel-wise Motion Features for Efficient Motion Segmentation2025-07-17An Efficient Approach for Muscle Segmentation and 3D Reconstruction Using Keypoint Tracking in MRI Scan2025-07-11Learning to Track Any Points from Human Motion2025-07-08TLB-VFI: Temporal-Aware Latent Brownian Bridge Diffusion for Video Frame Interpolation2025-07-07MEMFOF: High-Resolution Training for Memory-Efficient Multi-Frame Optical Flow Estimation2025-06-29EndoFlow-SLAM: Real-Time Endoscopic SLAM with Flow-Constrained Gaussian Splatting2025-06-26WAFT: Warping-Alone Field Transforms for Optical Flow2025-06-26Feature Hallucination for Self-supervised Action Recognition2025-06-25