TasksSotADatasetsPapersMethodsSubmitAbout
Papers With Code 2

A community resource for machine learning research: papers, code, benchmarks, and state-of-the-art results.

Explore

Notable BenchmarksAll SotADatasetsPapersMethods

Community

Submit ResultsAbout

Data sourced from the PWC Archive (CC-BY-SA 4.0). Built by the community, for the community.

Papers/Motion Fused Frames: Data Level Fusion Strategy for Hand G...

Motion Fused Frames: Data Level Fusion Strategy for Hand Gesture Recognition

Okan Köpüklü, Neslihan Köse, Gerhard Rigoll

2018-04-19Action ClassificationGesture RecognitionHand Gesture RecognitionGeneral ClassificationHand-Gesture Recognition
PaperPDFCode(official)

Abstract

Acquiring spatio-temporal states of an action is the most crucial step for action classification. In this paper, we propose a data level fusion strategy, Motion Fused Frames (MFFs), designed to fuse motion information into static images as better representatives of spatio-temporal states of an action. MFFs can be used as input to any deep learning architecture with very little modification on the network. We evaluate MFFs on hand gesture recognition tasks using three video datasets - Jester, ChaLearn LAP IsoGD and NVIDIA Dynamic Hand Gesture Datasets - which require capturing long-term temporal relations of hand movements. Our approach obtains very competitive performance on Jester and ChaLearn benchmarks with the classification accuracies of 96.28% and 57.4%, respectively, while achieving state-of-the-art performance with 84.7% accuracy on NVIDIA benchmark.

Results

TaskDatasetMetricValueModel
HandNVGestureAccuracy84.78-MFFs-3f1c
HandChaLearn valAccuracy57.48-MFFs-3f1c (5 crop)
HandJester testTop 1 Accuracy96.6DRX3D
HandJester valTop 1 Accuracy96.338-MFFs-3f1c (5 crop)
HandJester valTop 5 Accuracy99.868-MFFs-3f1c (5 crop)
HandChaLean testAccuracy56.78-MFFs-3f1c
Gesture RecognitionNVGestureAccuracy84.78-MFFs-3f1c
Gesture RecognitionChaLearn valAccuracy57.48-MFFs-3f1c (5 crop)
Gesture RecognitionJester testTop 1 Accuracy96.6DRX3D
Gesture RecognitionJester valTop 1 Accuracy96.338-MFFs-3f1c (5 crop)
Gesture RecognitionJester valTop 5 Accuracy99.868-MFFs-3f1c (5 crop)
Gesture RecognitionChaLean testAccuracy56.78-MFFs-3f1c

Related Papers

Efficient Deployment of Spiking Neural Networks on SpiNNaker2 for DVS Gesture Recognition Using Neuromorphic Intermediate Representation2025-09-04Visual Hand Gesture Recognition with Deep Learning: A Comprehensive Review of Methods, Datasets, Challenges and Future Research Directions2025-07-06How do Foundation Models Compare to Skeleton-Based Approaches for Gesture Recognition in Human-Robot Interaction?2025-06-25Wi-Fi Sensing Tool Release: Gathering 802.11ax Channel State Information from a Commercial Wi-Fi Access Point2025-06-20Accessible Gesture-Driven Augmented Reality Interaction System2025-06-18SLRNet: A Real-Time LSTM-Based Sign Language Recognition System2025-06-11Diver-Robot Communication Dataset for Underwater Hand Gesture Recognition2025-06-10SurgBench: A Unified Large-Scale Benchmark for Surgical Video Analysis2025-06-09