Collaborative Learning for Hand and Object Reconstruction with Attention-guided Graph Convolution

Tze Ho Elden Tse, Kwang In Kim, Ales Leonardis, Hyung Jin Chang

2022-04-27CVPR 2022 13D Hand Pose Estimation Object Reconstruction hand-object pose Pose Estimation 3D Pose Estimation

Abstract

Estimating the pose and shape of hands and objects under interaction finds numerous applications including augmented and virtual reality. Existing approaches for hand and object reconstruction require explicitly defined physical constraints and known objects, which limits its application domains. Our algorithm is agnostic to object models, and it learns the physical rules governing hand-object interaction. This requires automatically inferring the shapes and physical interaction of hands and (potentially unknown) objects. We seek to approach this challenging problem by proposing a collaborative learning strategy where two-branches of deep networks are learning from each other. Specifically, we transfer hand mesh information to the object branch and vice versa for the hand branch. The resulting optimisation (training) problem can be unstable, and we address this via two strategies: (i) attention-guided graph convolution which helps identify and focus on mutual occlusion and (ii) unsupervised associative loss which facilitates the transfer of information between the branches. Experiments using four widely-used benchmarks show that our framework achieves beyond state-of-the-art accuracy in 3D pose estimation, as well as recovers dense 3D hand and object shapes. Each technical component above contributes meaningfully in the ablation study.

Results

Task	Dataset	Metric	Value	Model
Hand	HO-3D v2	F@15mm	0.943	Tse et al.
Hand	HO-3D v2	F@5mm	0.485	Tse et al.
Hand	HO-3D v2	PA-MPVPE	10.9	Tse et al.
Hand	DexYCB	Average MPJPE (mm)	15.3	CLAGC
Pose Estimation	DexYCB	Average MPJPE (mm)	15.3	CLAGC
Pose Estimation	HO-3D v2	F@15mm	0.943	Tse et al.
Pose Estimation	HO-3D v2	F@5mm	0.485	Tse et al.
Pose Estimation	HO-3D v2	PA-MPVPE	10.9	Tse et al.
Hand Pose Estimation	HO-3D v2	F@15mm	0.943	Tse et al.
Hand Pose Estimation	HO-3D v2	F@5mm	0.485	Tse et al.
Hand Pose Estimation	HO-3D v2	PA-MPVPE	10.9	Tse et al.
Hand Pose Estimation	DexYCB	Average MPJPE (mm)	15.3	CLAGC
3D	DexYCB	Average MPJPE (mm)	15.3	CLAGC
3D	HO-3D v2	F@15mm	0.943	Tse et al.
3D	HO-3D v2	F@5mm	0.485	Tse et al.
3D	HO-3D v2	PA-MPVPE	10.9	Tse et al.
3D Hand Pose Estimation	HO-3D v2	F@15mm	0.943	Tse et al.
3D Hand Pose Estimation	HO-3D v2	F@5mm	0.485	Tse et al.
3D Hand Pose Estimation	HO-3D v2	PA-MPVPE	10.9	Tse et al.
3D Hand Pose Estimation	DexYCB	Average MPJPE (mm)	15.3	CLAGC
6D Pose Estimation	DexYCB	Average MPJPE (mm)	15.3	CLAGC
1 Image, 2*2 Stitchi	DexYCB	Average MPJPE (mm)	15.3	CLAGC
1 Image, 2*2 Stitchi	HO-3D v2	F@15mm	0.943	Tse et al.
1 Image, 2*2 Stitchi	HO-3D v2	F@5mm	0.485	Tse et al.
1 Image, 2*2 Stitchi	HO-3D v2	PA-MPVPE	10.9	Tse et al.

Collaborative Learning for Hand and Object Reconstruction with Attention-guided Graph Convolution

Abstract

Results

Related Papers

Collaborative Learning for Hand and Object Reconstruction with Attention-guided Graph Convolution

Abstract

Results

Related Papers