TasksSotADatasetsPapersMethodsSubmitAbout
Papers With Code 2

A community resource for machine learning research: papers, code, benchmarks, and state-of-the-art results.

Explore

Notable BenchmarksAll SotADatasetsPapersMethods

Community

Submit ResultsAbout

Data sourced from the PWC Archive (CC-BY-SA 4.0). Built by the community, for the community.

Papers/Action Segmentation with Joint Self-Supervised Temporal Do...

Action Segmentation with Joint Self-Supervised Temporal Domain Adaptation

Min-Hung Chen, Baopu Li, Yingze Bao, Ghassan AlRegib, Zsolt Kira

2020-03-05CVPR 2020 6Action SegmentationDomain Adaptation
PaperPDFCode(official)

Abstract

Despite the recent progress of fully-supervised action segmentation techniques, the performance is still not fully satisfactory. One main challenge is the problem of spatiotemporal variations (e.g. different people may perform the same activity in various ways). Therefore, we exploit unlabeled videos to address this problem by reformulating the action segmentation task as a cross-domain problem with domain discrepancy caused by spatio-temporal variations. To reduce the discrepancy, we propose Self-Supervised Temporal Domain Adaptation (SSTDA), which contains two self-supervised auxiliary tasks (binary and sequential domain prediction) to jointly align cross-domain feature spaces embedded with local and global temporal dynamics, achieving better performance than other Domain Adaptation (DA) approaches. On three challenging benchmark datasets (GTEA, 50Salads, and Breakfast), SSTDA outperforms the current state-of-the-art method by large margins (e.g. for the F1@25 score, from 59.6% to 69.1% on Breakfast, from 73.4% to 81.5% on 50Salads, and from 83.6% to 89.1% on GTEA), and requires only 65% of the labeled training data for comparable performance, demonstrating the usefulness of adapting to unlabeled target videos across variations. The source code is available at https://github.com/cmhungsteve/SSTDA.

Results

TaskDatasetMetricValueModel
Action Localization50 SaladsAcc83.2SSTDA
Action Localization50 SaladsEdit75.8SSTDA
Action Localization50 SaladsF1@10%83SSTDA
Action Localization50 SaladsF1@25%81.5SSTDA
Action Localization50 SaladsF1@50%73.8SSTDA
Action LocalizationGTEAAcc79.8SSTDA
Action LocalizationGTEAEdit86.2SSTDA
Action LocalizationGTEAF1@10%90SSTDA
Action LocalizationGTEAF1@25%89.1SSTDA
Action LocalizationGTEAF1@50%78SSTDA
Action LocalizationBreakfastAcc70.2SSTDA
Action LocalizationBreakfastAverage F166.4SSTDA
Action LocalizationBreakfastEdit73.7SSTDA
Action LocalizationBreakfastF1@10%75SSTDA
Action LocalizationBreakfastF1@25%69.1SSTDA
Action LocalizationBreakfastF1@50%55.2SSTDA
Action Segmentation50 SaladsAcc83.2SSTDA
Action Segmentation50 SaladsEdit75.8SSTDA
Action Segmentation50 SaladsF1@10%83SSTDA
Action Segmentation50 SaladsF1@25%81.5SSTDA
Action Segmentation50 SaladsF1@50%73.8SSTDA
Action SegmentationGTEAAcc79.8SSTDA
Action SegmentationGTEAEdit86.2SSTDA
Action SegmentationGTEAF1@10%90SSTDA
Action SegmentationGTEAF1@25%89.1SSTDA
Action SegmentationGTEAF1@50%78SSTDA
Action SegmentationBreakfastAcc70.2SSTDA
Action SegmentationBreakfastAverage F166.4SSTDA
Action SegmentationBreakfastEdit73.7SSTDA
Action SegmentationBreakfastF1@10%75SSTDA
Action SegmentationBreakfastF1@25%69.1SSTDA
Action SegmentationBreakfastF1@50%55.2SSTDA

Related Papers

A Privacy-Preserving Semantic-Segmentation Method Using Domain-Adaptation Technique2025-07-17Domain Borders Are There to Be Crossed With Federated Few-Shot Adaptation2025-07-14Self-supervised pretraining of vision transformers for animal behavioral analysis and neural encoding2025-07-13An Offline Mobile Conversational Agent for Mental Health Support: Learning from Emotional Dialogues and Psychological Texts with Student-Centered Evaluation2025-07-11The Bayesian Approach to Continual Learning: An Overview2025-07-11Doodle Your Keypoints: Sketch-Based Few-Shot Keypoint Detection2025-07-10YOLO-APD: Enhancing YOLOv8 for Robust Pedestrian Detection on Complex Road Geometries2025-07-07CORE-ReID V2: Advancing the Domain Adaptation for Object Re-Identification with Optimized Training and Ensemble Fusion2025-07-04