SRNet: Improving Generalization in 3D Human Pose Estimation with a Split-and-Recombine Approach

Ailing Zeng, Xiao Sun, Fuyang Huang, Minhao Liu, Qiang Xu, Stephen Lin

2020-07-18ECCV 2020 83D Human Pose Estimation Monocular 3D Human Pose Estimation Pose Estimation

Abstract

Human poses that are rare or unseen in a training set are challenging for a network to predict. Similar to the long-tailed distribution problem in visual recognition, the small number of examples for such poses limits the ability of networks to model them. Interestingly, local pose distributions suffer less from the long-tail problem, i.e., local joint configurations within a rare pose may appear within other poses in the training set, making them less rare. We propose to take advantage of this fact for better generalization to rare and unseen poses. To be specific, our method splits the body into local regions and processes them in separate network branches, utilizing the property that a joint position depends mainly on the joints within its local body region. Global coherence is maintained by recombining the global context from the rest of the body into each branch as a low-dimensional vector. With the reduced dimensionality of less relevant body areas, the training set distribution within network branches more closely reflects the statistics of local poses instead of global body poses, without sacrificing information important for joint inference. The proposed split-and-recombine approach, called SRNet, can be easily adapted to both single-image and temporal models, and it leads to appreciable improvements in the prediction of rare and unseen poses.

Results

Task	Dataset	Metric	Value	Model
3D Human Pose Estimation	MPI-INF-3DHP	AUC	43.8	SRNET
3D Human Pose Estimation	MPI-INF-3DHP	PCK	77.6	SRNET
3D Human Pose Estimation	Human3.6M	Average MPJPE (mm)	44.8	SRNet (T=243)
3D Human Pose Estimation	Human3.6M	Average MPJPE (mm)	49.9	SRNet (T=1)
3D Human Pose Estimation	Human3.6M	Average MPJPE (mm)	49.9	SRNET
3D Human Pose Estimation	Human3.6M	Frames Needed	1	SRNET
Pose Estimation	MPI-INF-3DHP	AUC	43.8	SRNET
Pose Estimation	MPI-INF-3DHP	PCK	77.6	SRNET
Pose Estimation	Human3.6M	Average MPJPE (mm)	44.8	SRNet (T=243)
Pose Estimation	Human3.6M	Average MPJPE (mm)	49.9	SRNet (T=1)
Pose Estimation	Human3.6M	Average MPJPE (mm)	49.9	SRNET
Pose Estimation	Human3.6M	Frames Needed	1	SRNET
3D	MPI-INF-3DHP	AUC	43.8	SRNET
3D	MPI-INF-3DHP	PCK	77.6	SRNET
3D	Human3.6M	Average MPJPE (mm)	44.8	SRNet (T=243)
3D	Human3.6M	Average MPJPE (mm)	49.9	SRNet (T=1)
3D	Human3.6M	Average MPJPE (mm)	49.9	SRNET
3D	Human3.6M	Frames Needed	1	SRNET
1 Image, 2*2 Stitchi	MPI-INF-3DHP	AUC	43.8	SRNET
1 Image, 2*2 Stitchi	MPI-INF-3DHP	PCK	77.6	SRNET
1 Image, 2*2 Stitchi	Human3.6M	Average MPJPE (mm)	44.8	SRNet (T=243)
1 Image, 2*2 Stitchi	Human3.6M	Average MPJPE (mm)	49.9	SRNet (T=1)
1 Image, 2*2 Stitchi	Human3.6M	Average MPJPE (mm)	49.9	SRNET
1 Image, 2*2 Stitchi	Human3.6M	Frames Needed	1	SRNET

SRNet: Improving Generalization in 3D Human Pose Estimation with a Split-and-Recombine Approach

Abstract

Results

Related Papers

SRNet: Improving Generalization in 3D Human Pose Estimation with a Split-and-Recombine Approach

Abstract

Results

Related Papers