TasksSotADatasetsPapersMethodsSubmitAbout
Papers With Code 2

A community resource for machine learning research: papers, code, benchmarks, and state-of-the-art results.

Explore

Notable BenchmarksAll SotADatasetsPapersMethods

Community

Submit ResultsAbout

Data sourced from the PWC Archive (CC-BY-SA 4.0). Built by the community, for the community.

Papers/Skeleton based action recognition using translation-scale ...

Skeleton based action recognition using translation-scale invariant image mapping and multi-scale deep cnn

Bo Li, Mingyi He, Xuelian Cheng, Yu-cheng Chen, Yuchao Dai

2017-04-19Image ClassificationSkeleton Based Action RecognitionTranslationAction RecognitionTemporal Action Localization
PaperPDF

Abstract

This paper presents an image classification based approach for skeleton-based video action recognition problem. Firstly, A dataset independent translation-scale invariant image mapping method is proposed, which transformes the skeleton videos to colour images, named skeleton-images. Secondly, A multi-scale deep convolutional neural network (CNN) architecture is proposed which could be built and fine-tuned on the powerful pre-trained CNNs, e.g., AlexNet, VGGNet, ResNet etal.. Even though the skeleton-images are very different from natural images, the fine-tune strategy still works well. At last, we prove that our method could also work well on 2D skeleton video data. We achieve the state-of-the-art results on the popular benchmard datasets e.g. NTU RGB+D, UTD-MHAD, MSRC-12, and G3D. Especially on the largest and challenge NTU RGB+D, UTD-MHAD, and MSRC-12 dataset, our method outperforms other methods by a large margion, which proves the efficacy of the proposed method.

Results

TaskDatasetMetricValueModel
VideoNTU RGB+DAccuracy (CS)853scale ResNet152
Temporal Action LocalizationNTU RGB+DAccuracy (CS)853scale ResNet152
Zero-Shot LearningNTU RGB+DAccuracy (CS)853scale ResNet152
Activity RecognitionNTU RGB+DAccuracy (CS)853scale ResNet152
Action LocalizationNTU RGB+DAccuracy (CS)853scale ResNet152
Action DetectionNTU RGB+DAccuracy (CS)853scale ResNet152
3D Action RecognitionNTU RGB+DAccuracy (CS)853scale ResNet152
Action RecognitionNTU RGB+DAccuracy (CS)853scale ResNet152

Related Papers

Automatic Classification and Segmentation of Tunnel Cracks Based on Deep Learning and Visual Explanations2025-07-18Adversarial attacks to image classification systems using evolutionary algorithms2025-07-17Efficient Adaptation of Pre-trained Vision Transformer underpinned by Approximately Orthogonal Fine-Tuning Strategy2025-07-17Federated Learning for Commercial Image Sources2025-07-17MUPAX: Multidimensional Problem Agnostic eXplainable AI2025-07-17A Translation of Probabilistic Event Calculus into Markov Decision Processes2025-07-17A Real-Time System for Egocentric Hand-Object Interaction Detection in Industrial Domains2025-07-17DVFL-Net: A Lightweight Distilled Video Focal Modulation Network for Spatio-Temporal Action Recognition2025-07-16