TasksSotADatasetsPapersMethodsSubmitAbout
Papers With Code 2

A community resource for machine learning research: papers, code, benchmarks, and state-of-the-art results.

Explore

Notable BenchmarksAll SotADatasetsPapersMethods

Community

Submit ResultsAbout

Data sourced from the PWC Archive (CC-BY-SA 4.0). Built by the community, for the community.

Papers/A Spatio-Temporal Multilayer Perceptron for Gesture Recogn...

A Spatio-Temporal Multilayer Perceptron for Gesture Recognition

Adrian Holzbock, Alexander Tsaregorodtsev, Youssef Dawoud, Klaus Dietmayer, Vasileios Belagiannis

2022-04-25Autonomous VehiclesSkeleton Based Action RecognitionGesture Recognition
PaperPDFCode(official)

Abstract

Gesture recognition is essential for the interaction of autonomous vehicles with humans. While the current approaches focus on combining several modalities like image features, keypoints and bone vectors, we present neural network architecture that delivers state-of-the-art results only with body skeleton input data. We propose the spatio-temporal multilayer perceptron for gesture recognition in the context of autonomous vehicles. Given 3D body poses over time, we define temporal and spatial mixing operations to extract features in both domains. Additionally, the importance of each time step is re-weighted with Squeeze-and-Excitation layers. An extensive evaluation of the TCG and Drive&Act datasets is provided to showcase the promising performance of our approach. Furthermore, we deploy our model to our autonomous vehicle to show its real-time capability and stable execution.

Results

TaskDatasetMetricValueModel
VideoTCG-datasetAcc85.99stMLP
VideoTCG-datasetF1-Score80.05stMLP
VideoTCG-datasetJaccard Index67.88stMLP
VideoDrive&Actmean per-class accuracy34.61stMLP
Temporal Action LocalizationTCG-datasetAcc85.99stMLP
Temporal Action LocalizationTCG-datasetF1-Score80.05stMLP
Temporal Action LocalizationTCG-datasetJaccard Index67.88stMLP
Temporal Action LocalizationDrive&Actmean per-class accuracy34.61stMLP
Zero-Shot LearningTCG-datasetAcc85.99stMLP
Zero-Shot LearningTCG-datasetF1-Score80.05stMLP
Zero-Shot LearningTCG-datasetJaccard Index67.88stMLP
Zero-Shot LearningDrive&Actmean per-class accuracy34.61stMLP
Activity RecognitionTCG-datasetAcc85.99stMLP
Activity RecognitionTCG-datasetF1-Score80.05stMLP
Activity RecognitionTCG-datasetJaccard Index67.88stMLP
Activity RecognitionDrive&Actmean per-class accuracy34.61stMLP
Action LocalizationTCG-datasetAcc85.99stMLP
Action LocalizationTCG-datasetF1-Score80.05stMLP
Action LocalizationTCG-datasetJaccard Index67.88stMLP
Action LocalizationDrive&Actmean per-class accuracy34.61stMLP
Action DetectionTCG-datasetAcc85.99stMLP
Action DetectionTCG-datasetF1-Score80.05stMLP
Action DetectionTCG-datasetJaccard Index67.88stMLP
Action DetectionDrive&Actmean per-class accuracy34.61stMLP
3D Action RecognitionTCG-datasetAcc85.99stMLP
3D Action RecognitionTCG-datasetF1-Score80.05stMLP
3D Action RecognitionTCG-datasetJaccard Index67.88stMLP
3D Action RecognitionDrive&Actmean per-class accuracy34.61stMLP
Action RecognitionTCG-datasetAcc85.99stMLP
Action RecognitionTCG-datasetF1-Score80.05stMLP
Action RecognitionTCG-datasetJaccard Index67.88stMLP
Action RecognitionDrive&Actmean per-class accuracy34.61stMLP

Related Papers

Efficient Deployment of Spiking Neural Networks on SpiNNaker2 for DVS Gesture Recognition Using Neuromorphic Intermediate Representation2025-09-04Vision-based Perception for Autonomous Vehicles in Obstacle Avoidance Scenarios2025-07-16Fast and Accurate Collision Probability Estimation for Autonomous Vehicles using Adaptive Sigma-Point Sampling2025-07-08Robustifying 3D Perception through Least-Squares Multi-Agent Graphs Object Tracking2025-07-07Visual Hand Gesture Recognition with Deep Learning: A Comprehensive Review of Methods, Datasets, Challenges and Future Research Directions2025-07-06LLM-based Realistic Safety-Critical Driving Video Generation2025-07-02Zero-shot Skeleton-based Action Recognition with Prototype-guided Feature Alignment2025-07-01A Survey on Vision-Language-Action Models for Autonomous Driving2025-06-30