TasksSotADatasetsPapersMethodsSubmitAbout
Papers With Code 2

A community resource for machine learning research: papers, code, benchmarks, and state-of-the-art results.

Explore

Notable BenchmarksAll SotADatasetsPapersMethods

Community

Submit ResultsAbout

Data sourced from the PWC Archive (CC-BY-SA 4.0). Built by the community, for the community.

Papers/Efficient Human Pose Estimation via 3D Event Point Cloud

Efficient Human Pose Estimation via 3D Event Point Cloud

Jiaan Chen, Hao Shi, Yaozu Ye, Kailun Yang, Lei Sun, Kaiwei Wang

2022-06-093D Human Pose EstimationPose EstimationPlaying the Game of 2048
PaperPDFCode(official)

Abstract

Human Pose Estimation (HPE) based on RGB images has experienced a rapid development benefiting from deep learning. However, event-based HPE has not been fully studied, which remains great potential for applications in extreme scenes and efficiency-critical conditions. In this paper, we are the first to estimate 2D human pose directly from 3D event point cloud. We propose a novel representation of events, the rasterized event point cloud, aggregating events on the same position of a small time slice. It maintains the 3D features from multiple statistical cues and significantly reduces memory consumption and computation complexity, proved to be efficient in our work. We then leverage the rasterized event point cloud as input to three different backbones, PointNet, DGCNN, and Point Transformer, with two linear layer decoders to predict the location of human keypoints. We find that based on our method, PointNet achieves promising results with much faster speed, whereas Point Transfomer reaches much higher accuracy, even close to previous event-frame-based methods. A comprehensive set of results demonstrates that our proposed method is consistently effective for these 3D backbone models in event-driven human pose estimation. Our method based on PointNet with 2048 points input achieves 82.46mm in MPJPE3D on the DHP19 dataset, while only has a latency of 12.29ms on an NVIDIA Jetson Xavier NX edge computing platform, which is ideally suitable for real-time detection with event cameras. Code is available at https://github.com/MasterHow/EventPointPose.

Results

TaskDatasetMetricValueModel
3D Human Pose EstimationDHP19GFLOPs10.06Point Transformer
3D Human Pose EstimationDHP19MPJPE2D6.46Point Transformer
3D Human Pose EstimationDHP19MPJPE3D73.37Point Transformer
3D Human Pose EstimationDHP19Params (M)3.65Point Transformer
Pose EstimationDHP19GFLOPs10.06Point Transformer
Pose EstimationDHP19MPJPE2D6.46Point Transformer
Pose EstimationDHP19MPJPE3D73.37Point Transformer
Pose EstimationDHP19Params (M)3.65Point Transformer
3DDHP19GFLOPs10.06Point Transformer
3DDHP19MPJPE2D6.46Point Transformer
3DDHP19MPJPE3D73.37Point Transformer
3DDHP19Params (M)3.65Point Transformer
1 Image, 2*2 StitchiDHP19GFLOPs10.06Point Transformer
1 Image, 2*2 StitchiDHP19MPJPE2D6.46Point Transformer
1 Image, 2*2 StitchiDHP19MPJPE3D73.37Point Transformer
1 Image, 2*2 StitchiDHP19Params (M)3.65Point Transformer

Related Papers

$π^3$: Scalable Permutation-Equivariant Visual Geometry Learning2025-07-17Revisiting Reliability in the Reasoning-based Pose Estimation Benchmark2025-07-17DINO-VO: A Feature-based Visual Odometry Leveraging a Visual Foundation Model2025-07-17From Neck to Head: Bio-Impedance Sensing for Head Pose Estimation2025-07-17AthleticsPose: Authentic Sports Motion Dataset on Athletic Field and Evaluation of Monocular 3D Pose Estimation Ability2025-07-17SpatialTrackerV2: 3D Point Tracking Made Easy2025-07-16SGLoc: Semantic Localization System for Camera Pose Estimation from 3D Gaussian Splatting Representation2025-07-16Efficient Calisthenics Skills Classification through Foreground Instance Selection and Depth Estimation2025-07-16