TasksSotADatasetsPapersMethodsSubmitAbout
Papers With Code 2

A community resource for machine learning research: papers, code, benchmarks, and state-of-the-art results.

Explore

Notable BenchmarksAll SotADatasetsPapersMethods

Community

Submit ResultsAbout

Data sourced from the PWC Archive (CC-BY-SA 4.0). Built by the community, for the community.

Papers/HYperbolic Self-Paced Learning for Self-Supervised Skeleto...

HYperbolic Self-Paced Learning for Self-Supervised Skeleton-based Action Representations

Luca Franco, Paolo Mandica, Bharti Munjal, Fabio Galasso

2023-03-10Unsupervised Skeleton Based Action RecognitionSkeleton Based Action RecognitionAction RecognitionDomain Adaptation
PaperPDFCode(official)

Abstract

Self-paced learning has been beneficial for tasks where some initial knowledge is available, such as weakly supervised learning and domain adaptation, to select and order the training sample sequence, from easy to complex. However its applicability remains unexplored in unsupervised learning, whereby the knowledge of the task matures during training. We propose a novel HYperbolic Self-Paced model (HYSP) for learning skeleton-based action representations. HYSP adopts self-supervision: it uses data augmentations to generate two views of the same sample, and it learns by matching one (named online) to the other (the target). We propose to use hyperbolic uncertainty to determine the algorithmic learning pace, under the assumption that less uncertain samples should be more strongly driving the training, with a larger weight and pace. Hyperbolic uncertainty is a by-product of the adopted hyperbolic neural networks, it matures during training and it comes with no extra cost, compared to the established Euclidean SSL framework counterparts. When tested on three established skeleton-based action recognition datasets, HYSP outperforms the state-of-the-art on PKU-MMD I, as well as on 2 out of 3 downstream tasks on NTU-60 and NTU-120. Additionally, HYSP only uses positive pairs and bypasses therefore the complex and computationally-demanding mining procedures required for the negatives in contrastive techniques. Code is available at https://github.com/paolomandica/HYSP.

Results

TaskDatasetMetricValueModel
VideoNTU RGB+D 120Accuracy (Cross-Setup)86.33s-HYSP
VideoNTU RGB+D 120Accuracy (Cross-Subject)84.53s-HYSP
VideoNTU RGB+D 120Accuracy (Cross-Setup)82HYSP
VideoNTU RGB+D 120Accuracy (Cross-Subject)81.4HYSP
VideoPKU-MMDAccuracy (Cross-Subject)96.23s-HYSP
VideoNTU RGB+DAccuracy (CS)89.13s-HYSP
VideoNTU RGB+DAccuracy (CV)95.23s-HYSP
VideoNTU RGB+DAccuracy (CS)86.5HYSP
VideoNTU RGB+DAccuracy (CV)93.5HYSP
Temporal Action LocalizationNTU RGB+D 120Accuracy (Cross-Setup)86.33s-HYSP
Temporal Action LocalizationNTU RGB+D 120Accuracy (Cross-Subject)84.53s-HYSP
Temporal Action LocalizationNTU RGB+D 120Accuracy (Cross-Setup)82HYSP
Temporal Action LocalizationNTU RGB+D 120Accuracy (Cross-Subject)81.4HYSP
Temporal Action LocalizationPKU-MMDAccuracy (Cross-Subject)96.23s-HYSP
Temporal Action LocalizationNTU RGB+DAccuracy (CS)89.13s-HYSP
Temporal Action LocalizationNTU RGB+DAccuracy (CV)95.23s-HYSP
Temporal Action LocalizationNTU RGB+DAccuracy (CS)86.5HYSP
Temporal Action LocalizationNTU RGB+DAccuracy (CV)93.5HYSP
Zero-Shot LearningNTU RGB+D 120Accuracy (Cross-Setup)86.33s-HYSP
Zero-Shot LearningNTU RGB+D 120Accuracy (Cross-Subject)84.53s-HYSP
Zero-Shot LearningNTU RGB+D 120Accuracy (Cross-Setup)82HYSP
Zero-Shot LearningNTU RGB+D 120Accuracy (Cross-Subject)81.4HYSP
Zero-Shot LearningPKU-MMDAccuracy (Cross-Subject)96.23s-HYSP
Zero-Shot LearningNTU RGB+DAccuracy (CS)89.13s-HYSP
Zero-Shot LearningNTU RGB+DAccuracy (CV)95.23s-HYSP
Zero-Shot LearningNTU RGB+DAccuracy (CS)86.5HYSP
Zero-Shot LearningNTU RGB+DAccuracy (CV)93.5HYSP
Activity RecognitionNTU RGB+D 120Accuracy (Cross-Setup)86.33s-HYSP
Activity RecognitionNTU RGB+D 120Accuracy (Cross-Subject)84.53s-HYSP
Activity RecognitionNTU RGB+D 120Accuracy (Cross-Setup)82HYSP
Activity RecognitionNTU RGB+D 120Accuracy (Cross-Subject)81.4HYSP
Activity RecognitionPKU-MMDAccuracy (Cross-Subject)96.23s-HYSP
Activity RecognitionNTU RGB+DAccuracy (CS)89.13s-HYSP
Activity RecognitionNTU RGB+DAccuracy (CV)95.23s-HYSP
Activity RecognitionNTU RGB+DAccuracy (CS)86.5HYSP
Activity RecognitionNTU RGB+DAccuracy (CV)93.5HYSP
Action LocalizationNTU RGB+D 120Accuracy (Cross-Setup)86.33s-HYSP
Action LocalizationNTU RGB+D 120Accuracy (Cross-Subject)84.53s-HYSP
Action LocalizationNTU RGB+D 120Accuracy (Cross-Setup)82HYSP
Action LocalizationNTU RGB+D 120Accuracy (Cross-Subject)81.4HYSP
Action LocalizationPKU-MMDAccuracy (Cross-Subject)96.23s-HYSP
Action LocalizationNTU RGB+DAccuracy (CS)89.13s-HYSP
Action LocalizationNTU RGB+DAccuracy (CV)95.23s-HYSP
Action LocalizationNTU RGB+DAccuracy (CS)86.5HYSP
Action LocalizationNTU RGB+DAccuracy (CV)93.5HYSP
Action DetectionNTU RGB+D 120Accuracy (Cross-Setup)86.33s-HYSP
Action DetectionNTU RGB+D 120Accuracy (Cross-Subject)84.53s-HYSP
Action DetectionNTU RGB+D 120Accuracy (Cross-Setup)82HYSP
Action DetectionNTU RGB+D 120Accuracy (Cross-Subject)81.4HYSP
Action DetectionPKU-MMDAccuracy (Cross-Subject)96.23s-HYSP
Action DetectionNTU RGB+DAccuracy (CS)89.13s-HYSP
Action DetectionNTU RGB+DAccuracy (CV)95.23s-HYSP
Action DetectionNTU RGB+DAccuracy (CS)86.5HYSP
Action DetectionNTU RGB+DAccuracy (CV)93.5HYSP
3D Action RecognitionNTU RGB+D 120Accuracy (Cross-Setup)86.33s-HYSP
3D Action RecognitionNTU RGB+D 120Accuracy (Cross-Subject)84.53s-HYSP
3D Action RecognitionNTU RGB+D 120Accuracy (Cross-Setup)82HYSP
3D Action RecognitionNTU RGB+D 120Accuracy (Cross-Subject)81.4HYSP
3D Action RecognitionPKU-MMDAccuracy (Cross-Subject)96.23s-HYSP
3D Action RecognitionNTU RGB+DAccuracy (CS)89.13s-HYSP
3D Action RecognitionNTU RGB+DAccuracy (CV)95.23s-HYSP
3D Action RecognitionNTU RGB+DAccuracy (CS)86.5HYSP
3D Action RecognitionNTU RGB+DAccuracy (CV)93.5HYSP
Action RecognitionNTU RGB+D 120Accuracy (Cross-Setup)86.33s-HYSP
Action RecognitionNTU RGB+D 120Accuracy (Cross-Subject)84.53s-HYSP
Action RecognitionNTU RGB+D 120Accuracy (Cross-Setup)82HYSP
Action RecognitionNTU RGB+D 120Accuracy (Cross-Subject)81.4HYSP
Action RecognitionPKU-MMDAccuracy (Cross-Subject)96.23s-HYSP
Action RecognitionNTU RGB+DAccuracy (CS)89.13s-HYSP
Action RecognitionNTU RGB+DAccuracy (CV)95.23s-HYSP
Action RecognitionNTU RGB+DAccuracy (CS)86.5HYSP
Action RecognitionNTU RGB+DAccuracy (CV)93.5HYSP

Related Papers

A Real-Time System for Egocentric Hand-Object Interaction Detection in Industrial Domains2025-07-17A Privacy-Preserving Semantic-Segmentation Method Using Domain-Adaptation Technique2025-07-17Domain Borders Are There to Be Crossed With Federated Few-Shot Adaptation2025-07-14An Offline Mobile Conversational Agent for Mental Health Support: Learning from Emotional Dialogues and Psychological Texts with Student-Centered Evaluation2025-07-11The Bayesian Approach to Continual Learning: An Overview2025-07-11Doodle Your Keypoints: Sketch-Based Few-Shot Keypoint Detection2025-07-10YOLO-APD: Enhancing YOLOv8 for Robust Pedestrian Detection on Complex Road Geometries2025-07-07CORE-ReID V2: Advancing the Domain Adaptation for Object Re-Identification with Optimized Training and Ensemble Fusion2025-07-04