TasksSotADatasetsPapersMethodsSubmitAbout
Papers With Code 2

A community resource for machine learning research: papers, code, benchmarks, and state-of-the-art results.

Explore

Notable BenchmarksAll SotADatasetsPapersMethods

Community

Submit ResultsAbout

Data sourced from the PWC Archive (CC-BY-SA 4.0). Built by the community, for the community.

Papers/No Pain, Big Gain: Classify Dynamic Point Cloud Sequences ...

No Pain, Big Gain: Classify Dynamic Point Cloud Sequences with Static Models by Fitting Feature-level Space-time Surfaces

Jia-Xing Zhong, Kaichen Zhou, Qingyong Hu, Bing Wang, Niki Trigoni, Andrew Markham

2022-03-21CVPR 2022 13D Action RecognitionRolling Shutter CorrectionPoint Cloud Classification
PaperPDFCode(official)

Abstract

Scene flow is a powerful tool for capturing the motion field of 3D point clouds. However, it is difficult to directly apply flow-based models to dynamic point cloud classification since the unstructured points make it hard or even impossible to efficiently and effectively trace point-wise correspondences. To capture 3D motions without explicitly tracking correspondences, we propose a kinematics-inspired neural network (Kinet) by generalizing the kinematic concept of ST-surfaces to the feature space. By unrolling the normal solver of ST-surfaces in the feature space, Kinet implicitly encodes feature-level dynamics and gains advantages from the use of mature backbones for static point cloud processing. With only minor changes in network structures and low computing overhead, it is painless to jointly train and deploy our framework with a given static model. Experiments on NvGesture, SHREC'17, MSRAction-3D, and NTU-RGBD demonstrate its efficacy in performance, efficiency in both the number of parameters and computational complexity, as well as its versatility to various static backbones. Noticeably, Kinet achieves the accuracy of 93.27% on MSRAction-3D with only 3.20M parameters and 10.35G FLOPS.

Results

TaskDatasetMetricValueModel
VideoNTU RGB+DCross Subject Accuracy92.3Kinet
VideoNTU RGB+DCross View Accuracy96.4Kinet
Temporal Action LocalizationNTU RGB+DCross Subject Accuracy92.3Kinet
Temporal Action LocalizationNTU RGB+DCross View Accuracy96.4Kinet
Zero-Shot LearningNTU RGB+DCross Subject Accuracy92.3Kinet
Zero-Shot LearningNTU RGB+DCross View Accuracy96.4Kinet
Activity RecognitionNTU RGB+DCross Subject Accuracy92.3Kinet
Activity RecognitionNTU RGB+DCross View Accuracy96.4Kinet
Action LocalizationNTU RGB+DCross Subject Accuracy92.3Kinet
Action LocalizationNTU RGB+DCross View Accuracy96.4Kinet
3D Action RecognitionNTU RGB+DCross Subject Accuracy92.3Kinet
3D Action RecognitionNTU RGB+DCross View Accuracy96.4Kinet
Action RecognitionNTU RGB+DCross Subject Accuracy92.3Kinet
Action RecognitionNTU RGB+DCross View Accuracy96.4Kinet

Related Papers

BeyondRPC: A Contrastive and Augmentation-Driven Framework for Robust Point Cloud Understanding2025-06-15Rethinking Gradient-based Adversarial Attacks on Point Cloud Classification2025-05-28SMART-PC: Skeletal Model Adaptation for Robust Test-Time Training in Point Clouds2025-05-26Hybrid-Emba3D: Geometry-Aware and Cross-Path Feature Hybrid Enhanced State Space Model for Point Cloud Classification2025-05-16Optimal Control for Transformer Architectures: Enhancing Generalization, Robustness and Efficiency2025-05-16Streaming Sliced Optimal Transport2025-05-11StableMotion: Repurposing Diffusion-Based Image Priors for Motion Estimation2025-05-10FA-KPConv: Introducing Euclidean Symmetries to KPConv via Frame Averaging2025-05-07