TasksSotADatasetsPapersMethodsSubmitAbout
Papers With Code 2

A community resource for machine learning research: papers, code, benchmarks, and state-of-the-art results.

Explore

Notable BenchmarksAll SotADatasetsPapersMethods

Community

Submit ResultsAbout

Data sourced from the PWC Archive (CC-BY-SA 4.0). Built by the community, for the community.

Papers/Real-Time Hand Gesture Recognition: Integrating Skeleton-B...

Real-Time Hand Gesture Recognition: Integrating Skeleton-Based Data Fusion and Multi-Stream CNN

Oluwaleke Yusuf, Maki Habib, Mohamed Moustafa

2024-06-21Image ClassificationSkeleton Based Action RecognitionGesture RecognitionHand Gesture RecognitionAction RecognitionHand-Gesture Recognition
PaperPDFCode(official)

Abstract

Hand Gesture Recognition (HGR) enables intuitive human-computer interactions in various real-world contexts. However, existing frameworks often struggle to meet the real-time requirements essential for practical HGR applications. This study introduces a robust, skeleton-based framework for dynamic HGR that simplifies the recognition of dynamic hand gestures into a static image classification task, effectively reducing both hardware and computational demands. Our framework utilizes a data-level fusion technique to encode 3D skeleton data from dynamic gestures into static RGB spatiotemporal images. It incorporates a specialized end-to-end Ensemble Tuner (e2eET) Multi-Stream CNN architecture that optimizes the semantic connections between data representations while minimizing computational needs. Tested across five benchmark datasets (SHREC'17, DHG-14/28, FPHA, LMDHG, and CNR), the framework showed competitive performance with the state-of-the-art. Its capability to support real-time HGR applications was also demonstrated through deployment on standard consumer PC hardware, showcasing low latency and minimal resource usage in real-world settings. The successful deployment of this framework underscores its potential to enhance real-time applications in fields such as virtual/augmented reality, ambient intelligence, and assistive technologies, providing a scalable and efficient solution for dynamic gesture recognition.

Results

TaskDatasetMetricValueModel
VideoSBU / SBU-RefineAccuracy93.96e2eET
VideoFirst-Person Hand Action Benchmark1:1 Accuracy91.83e2eET
Temporal Action LocalizationSBU / SBU-RefineAccuracy93.96e2eET
Temporal Action LocalizationFirst-Person Hand Action Benchmark1:1 Accuracy91.83e2eET
Zero-Shot LearningSBU / SBU-RefineAccuracy93.96e2eET
Zero-Shot LearningFirst-Person Hand Action Benchmark1:1 Accuracy91.83e2eET
Activity RecognitionSBU / SBU-RefineAccuracy93.96e2eET
Activity RecognitionFirst-Person Hand Action Benchmark1:1 Accuracy91.83e2eET
Action LocalizationSBU / SBU-RefineAccuracy93.96e2eET
Action LocalizationFirst-Person Hand Action Benchmark1:1 Accuracy91.83e2eET
HandDHG-28Accuracy92.38e2eET
HandSHREC 201714 Gestures Accuracy97.86e2eET
HandSHREC 201728 Gestures Accuracy95.36e2eET
HandDHG-14Accuracy95.83e2eET
Action DetectionSBU / SBU-RefineAccuracy93.96e2eET
Action DetectionFirst-Person Hand Action Benchmark1:1 Accuracy91.83e2eET
Gesture RecognitionDHG-28Accuracy92.38e2eET
Gesture RecognitionSHREC 201714 Gestures Accuracy97.86e2eET
Gesture RecognitionSHREC 201728 Gestures Accuracy95.36e2eET
Gesture RecognitionDHG-14Accuracy95.83e2eET
3D Action RecognitionSBU / SBU-RefineAccuracy93.96e2eET
3D Action RecognitionFirst-Person Hand Action Benchmark1:1 Accuracy91.83e2eET
Action RecognitionSBU / SBU-RefineAccuracy93.96e2eET
Action RecognitionFirst-Person Hand Action Benchmark1:1 Accuracy91.83e2eET

Related Papers

Efficient Deployment of Spiking Neural Networks on SpiNNaker2 for DVS Gesture Recognition Using Neuromorphic Intermediate Representation2025-09-04Automatic Classification and Segmentation of Tunnel Cracks Based on Deep Learning and Visual Explanations2025-07-18Adversarial attacks to image classification systems using evolutionary algorithms2025-07-17Efficient Adaptation of Pre-trained Vision Transformer underpinned by Approximately Orthogonal Fine-Tuning Strategy2025-07-17Federated Learning for Commercial Image Sources2025-07-17MUPAX: Multidimensional Problem Agnostic eXplainable AI2025-07-17A Real-Time System for Egocentric Hand-Object Interaction Detection in Industrial Domains2025-07-17Hashed Watermark as a Filter: Defeating Forging and Overwriting Attacks in Weight-based Neural Network Watermarking2025-07-15