HDBN: A Novel Hybrid Dual-branch Network for Robust Skeleton-based Action Recognition

Jinfu Liu, Baiqiao Yin, Jiaying Lin, Jiajun Wen, Yue Li, Mengyuan Liu

2024-04-24Skeleton Based Action Recognition Action Recognition

Abstract

Skeleton-based action recognition has gained considerable traction thanks to its utilization of succinct and robust skeletal representations. Nonetheless, current methodologies often lean towards utilizing a solitary backbone to model skeleton modality, which can be limited by inherent flaws in the network backbone. To address this and fully leverage the complementary characteristics of various network architectures, we propose a novel Hybrid Dual-Branch Network (HDBN) for robust skeleton-based action recognition, which benefits from the graph convolutional network's proficiency in handling graph-structured data and the powerful modeling capabilities of Transformers for global information. In detail, our proposed HDBN is divided into two trunk branches: MixGCN and MixFormer. The two branches utilize GCNs and Transformers to model both 2D and 3D skeletal modalities respectively. Our proposed HDBN emerged as one of the top solutions in the Multi-Modal Video Reasoning and Analyzing Competition (MMVRAC) of 2024 ICME Grand Challenge, achieving accuracies of 47.95% and 75.36% on two benchmarks of the UAV-Human dataset by outperforming most existing methods. Our code will be publicly available at: https://github.com/liujf69/ICMEW2024-Track10.

Results

Task	Dataset	Metric	Value	Model
Video	UAV-Human	CSv1(%)	47.96	HDBN
Video	UAV-Human	CSv2(%)	75.36	HDBN
Temporal Action Localization	UAV-Human	CSv1(%)	47.96	HDBN
Temporal Action Localization	UAV-Human	CSv2(%)	75.36	HDBN
Zero-Shot Learning	UAV-Human	CSv1(%)	47.96	HDBN
Zero-Shot Learning	UAV-Human	CSv2(%)	75.36	HDBN
Activity Recognition	UAV-Human	CSv1(%)	47.96	HDBN
Activity Recognition	UAV-Human	CSv2(%)	75.36	HDBN
Action Localization	UAV-Human	CSv1(%)	47.96	HDBN
Action Localization	UAV-Human	CSv2(%)	75.36	HDBN
Action Detection	UAV-Human	CSv1(%)	47.96	HDBN
Action Detection	UAV-Human	CSv2(%)	75.36	HDBN
3D Action Recognition	UAV-Human	CSv1(%)	47.96	HDBN
3D Action Recognition	UAV-Human	CSv2(%)	75.36	HDBN
Action Recognition	UAV-Human	CSv1(%)	47.96	HDBN
Action Recognition	UAV-Human	CSv2(%)	75.36	HDBN

HDBN: A Novel Hybrid Dual-branch Network for Robust Skeleton-based Action Recognition

Abstract

Results

Related Papers

HDBN: A Novel Hybrid Dual-branch Network for Robust Skeleton-based Action Recognition

Abstract

Results

Related Papers