Simple yet efficient real-time pose-based action recognition

Dennis Ludl, Thomas Gulde, Cristóbal Curio

2019-04-19Action Detection Skeleton Based Action Recognition Autonomous Driving Action Recognition Temporal Action Localization

Paper PDF Code(official)

Abstract

Recognizing human actions is a core challenge for autonomous systems as they directly share the same space with humans. Systems must be able to recognize and assess human actions in real-time. In order to train corresponding data-driven algorithms, a significant amount of annotated training data is required. We demonstrated a pipeline to detect humans, estimate their pose, track them over time and recognize their actions in real-time with standard monocular camera sensors. For action recognition, we encode the human pose into a new data format called Encoded Human Pose Image (EHPI) that can then be classified using standard methods from the computer vision community. With this simple procedure we achieve competitive state-of-the-art performance in pose-based action detection and can ensure real-time performance. In addition, we show a use case in the context of autonomous driving to demonstrate how such a system can be trained to recognize human actions using simulation data.

Results

Task	Dataset	Metric	Value	Model
Video	JHMDB (2D poses only)	Average accuracy of 3 splits	65.5	EHPI
Video	J-HMDB	Accuracy (pose)	65.5	EHPI
Temporal Action Localization	JHMDB (2D poses only)	Average accuracy of 3 splits	65.5	EHPI
Temporal Action Localization	J-HMDB	Accuracy (pose)	65.5	EHPI
Zero-Shot Learning	JHMDB (2D poses only)	Average accuracy of 3 splits	65.5	EHPI
Zero-Shot Learning	J-HMDB	Accuracy (pose)	65.5	EHPI
Activity Recognition	JHMDB (2D poses only)	Average accuracy of 3 splits	65.5	EHPI
Activity Recognition	J-HMDB	Accuracy (pose)	65.5	EHPI
Action Localization	JHMDB (2D poses only)	Average accuracy of 3 splits	65.5	EHPI
Action Localization	J-HMDB	Accuracy (pose)	65.5	EHPI
Action Detection	JHMDB (2D poses only)	Average accuracy of 3 splits	65.5	EHPI
Action Detection	J-HMDB	Accuracy (pose)	65.5	EHPI
3D Action Recognition	JHMDB (2D poses only)	Average accuracy of 3 splits	65.5	EHPI
3D Action Recognition	J-HMDB	Accuracy (pose)	65.5	EHPI
Action Recognition	JHMDB (2D poses only)	Average accuracy of 3 splits	65.5	EHPI
Action Recognition	J-HMDB	Accuracy (pose)	65.5	EHPI

Simple yet efficient real-time pose-based action recognition

Abstract

Results

Related Papers

Simple yet efficient real-time pose-based action recognition

Abstract

Results

Related Papers