TasksSotADatasetsPapersMethodsSubmitAbout
Papers With Code 2

A community resource for machine learning research: papers, code, benchmarks, and state-of-the-art results.

Explore

Notable BenchmarksAll SotADatasetsPapersMethods

Community

Submit ResultsAbout

Data sourced from the PWC Archive (CC-BY-SA 4.0). Built by the community, for the community.

Papers/Simple yet efficient real-time pose-based action recognition

Simple yet efficient real-time pose-based action recognition

Dennis Ludl, Thomas Gulde, Cristóbal Curio

2019-04-19Action DetectionSkeleton Based Action RecognitionAutonomous DrivingAction RecognitionTemporal Action Localization
PaperPDFCode(official)

Abstract

Recognizing human actions is a core challenge for autonomous systems as they directly share the same space with humans. Systems must be able to recognize and assess human actions in real-time. In order to train corresponding data-driven algorithms, a significant amount of annotated training data is required. We demonstrated a pipeline to detect humans, estimate their pose, track them over time and recognize their actions in real-time with standard monocular camera sensors. For action recognition, we encode the human pose into a new data format called Encoded Human Pose Image (EHPI) that can then be classified using standard methods from the computer vision community. With this simple procedure we achieve competitive state-of-the-art performance in pose-based action detection and can ensure real-time performance. In addition, we show a use case in the context of autonomous driving to demonstrate how such a system can be trained to recognize human actions using simulation data.

Results

TaskDatasetMetricValueModel
VideoJHMDB (2D poses only)Average accuracy of 3 splits65.5EHPI
VideoJ-HMDBAccuracy (pose)65.5EHPI
Temporal Action LocalizationJHMDB (2D poses only)Average accuracy of 3 splits65.5EHPI
Temporal Action LocalizationJ-HMDBAccuracy (pose)65.5EHPI
Zero-Shot LearningJHMDB (2D poses only)Average accuracy of 3 splits65.5EHPI
Zero-Shot LearningJ-HMDBAccuracy (pose)65.5EHPI
Activity RecognitionJHMDB (2D poses only)Average accuracy of 3 splits65.5EHPI
Activity RecognitionJ-HMDBAccuracy (pose)65.5EHPI
Action LocalizationJHMDB (2D poses only)Average accuracy of 3 splits65.5EHPI
Action LocalizationJ-HMDBAccuracy (pose)65.5EHPI
Action DetectionJHMDB (2D poses only)Average accuracy of 3 splits65.5EHPI
Action DetectionJ-HMDBAccuracy (pose)65.5EHPI
3D Action RecognitionJHMDB (2D poses only)Average accuracy of 3 splits65.5EHPI
3D Action RecognitionJ-HMDBAccuracy (pose)65.5EHPI
Action RecognitionJHMDB (2D poses only)Average accuracy of 3 splits65.5EHPI
Action RecognitionJ-HMDBAccuracy (pose)65.5EHPI

Related Papers

GEMINUS: Dual-aware Global and Scene-Adaptive Mixture-of-Experts for End-to-End Autonomous Driving2025-07-19AGENTS-LLM: Augmentative GENeration of Challenging Traffic Scenarios with an Agentic LLM Framework2025-07-18World Model-Based End-to-End Scene Generation for Accident Anticipation in Autonomous Driving2025-07-17Orbis: Overcoming Challenges of Long-Horizon Prediction in Driving World Models2025-07-17Channel-wise Motion Features for Efficient Motion Segmentation2025-07-17LaViPlan : Language-Guided Visual Path Planning with RLVR2025-07-17A Real-Time System for Egocentric Hand-Object Interaction Detection in Industrial Domains2025-07-17Safeguarding Federated Learning-based Road Condition Classification2025-07-16