TasksSotADatasetsPapersMethodsSubmitAbout
Papers With Code 2

A community resource for machine learning research: papers, code, benchmarks, and state-of-the-art results.

Explore

Notable BenchmarksAll SotADatasetsPapersMethods

Community

Submit ResultsAbout

Data sourced from the PWC Archive (CC-BY-SA 4.0). Built by the community, for the community.

Papers/Differentiable Hierarchical Graph Grouping for Multi-Perso...

Differentiable Hierarchical Graph Grouping for Multi-Person Pose Estimation

Sheng Jin, Wentao Liu, Enze Xie, Wenhai Wang, Chen Qian, Wanli Ouyang, Ping Luo

2020-07-23ECCV 2020 8Graph ClusteringHuman Detection2D Human Pose EstimationPose EstimationMulti-Person Pose EstimationKeypoint DetectionClustering
PaperPDF

Abstract

Multi-person pose estimation is challenging because it localizes body keypoints for multiple persons simultaneously. Previous methods can be divided into two streams, i.e. top-down and bottom-up methods. The top-down methods localize keypoints after human detection, while the bottom-up methods localize keypoints directly and then cluster/group them for different persons, which are generally more efficient than top-down methods. However, in existing bottom-up methods, the keypoint grouping is usually solved independently from keypoint detection, making them not end-to-end trainable and have sub-optimal performance. In this paper, we investigate a new perspective of human part grouping and reformulate it as a graph clustering task. Especially, we propose a novel differentiable Hierarchical Graph Grouping (HGG) method to learn the graph grouping in bottom-up multi-person pose estimation task. Moreover, HGG is easily embedded into main-stream bottom-up methods. It takes human keypoint candidates as graph nodes and clusters keypoints in a multi-layer graph neural network model. The modules of HGG can be trained end-to-end with the keypoint detection network and is able to supervise the grouping process in a hierarchical manner. To improve the discrimination of the clustering, we add a set of edge discriminators and macro-node discriminators. Extensive experiments on both COCO and OCHuman datasets demonstrate that the proposed method improves the performance of bottom-up pose estimation methods.

Results

TaskDatasetMetricValueModel
Pose EstimationOCHumanTest AP36HGG (AE+)
Pose EstimationOCHumanValidation AP41.8HGG (AE+)
Pose EstimationOCHumanTest AP36HGG (AE+)
Pose EstimationOCHumanValidation AP41.8HGG (AE+)
3DOCHumanTest AP36HGG (AE+)
3DOCHumanValidation AP41.8HGG (AE+)
3DOCHumanTest AP36HGG (AE+)
3DOCHumanValidation AP41.8HGG (AE+)
2D Human Pose EstimationOCHumanTest AP36HGG (AE+)
2D Human Pose EstimationOCHumanValidation AP41.8HGG (AE+)
1 Image, 2*2 StitchiOCHumanTest AP36HGG (AE+)
1 Image, 2*2 StitchiOCHumanValidation AP41.8HGG (AE+)
1 Image, 2*2 StitchiOCHumanTest AP36HGG (AE+)
1 Image, 2*2 StitchiOCHumanValidation AP41.8HGG (AE+)

Related Papers

Tri-Learn Graph Fusion Network for Attributed Graph Clustering2025-07-18$π^3$: Scalable Permutation-Equivariant Visual Geometry Learning2025-07-17Revisiting Reliability in the Reasoning-based Pose Estimation Benchmark2025-07-17DINO-VO: A Feature-based Visual Odometry Leveraging a Visual Foundation Model2025-07-17From Neck to Head: Bio-Impedance Sensing for Head Pose Estimation2025-07-17AthleticsPose: Authentic Sports Motion Dataset on Athletic Field and Evaluation of Monocular 3D Pose Estimation Ability2025-07-17SpatialTrackerV2: 3D Point Tracking Made Easy2025-07-16SGLoc: Semantic Localization System for Camera Pose Estimation from 3D Gaussian Splatting Representation2025-07-16