TasksSotADatasetsPapersMethodsSubmitAbout
Papers With Code 2

A community resource for machine learning research: papers, code, benchmarks, and state-of-the-art results.

Explore

Notable BenchmarksAll SotADatasetsPapersMethods

Community

Submit ResultsAbout

Data sourced from the PWC Archive (CC-BY-SA 4.0). Built by the community, for the community.

Papers/Temporally Consistent Unbalanced Optimal Transport for Uns...

Temporally Consistent Unbalanced Optimal Transport for Unsupervised Action Segmentation

Ming Xu, Stephen Gould

2024-04-01CVPR 2024 1Action SegmentationUnsupervised Action SegmentationSegmentation
PaperPDFCode(official)

Abstract

We propose a novel approach to the action segmentation task for long, untrimmed videos, based on solving an optimal transport problem. By encoding a temporal consistency prior into a Gromov-Wasserstein problem, we are able to decode a temporally consistent segmentation from a noisy affinity/matching cost matrix between video frames and action classes. Unlike previous approaches, our method does not require knowing the action order for a video to attain temporal consistency. Furthermore, our resulting (fused) Gromov-Wasserstein problem can be efficiently solved on GPUs using a few iterations of projected mirror descent. We demonstrate the effectiveness of our method in an unsupervised learning setting, where our method is used to generate pseudo-labels for self-training. We evaluate our segmentation approach and unsupervised learning pipeline on the Breakfast, 50-Salads, YouTube Instructions and Desktop Assembly datasets, yielding state-of-the-art results for the unsupervised video action segmentation task.

Results

TaskDatasetMetricValueModel
Action LocalizationIKEA ASMAccuracy34ASOT
Action LocalizationIKEA ASMF127.9ASOT
Action LocalizationIKEA ASMJSD88.7ASOT
Action LocalizationIKEA ASMPrecision21.1ASOT
Action LocalizationIKEA ASMRecall24ASOT
Action LocalizationYoutube INRIA InstructionalAcc52.9ASOT
Action LocalizationYoutube INRIA InstructionalF135.1ASOT
Action LocalizationYoutube INRIA InstructionalPrecision47.6ASOT
Action LocalizationYoutube INRIA InstructionalRecall27.8ASOT
Action LocalizationYoutube INRIA InstructionalmIoU24.7ASOT
Action LocalizationBreakfastAcc56.1ASOT
Action LocalizationBreakfastF138.3ASOT
Action LocalizationBreakfastJSD94.9ASOT
Action LocalizationBreakfastPrecision36.7ASOT
Action LocalizationBreakfastRecall40.1ASOT
Action LocalizationBreakfastmIoU18.6ASOT
Action SegmentationIKEA ASMAccuracy34ASOT
Action SegmentationIKEA ASMF127.9ASOT
Action SegmentationIKEA ASMJSD88.7ASOT
Action SegmentationIKEA ASMPrecision21.1ASOT
Action SegmentationIKEA ASMRecall24ASOT
Action SegmentationYoutube INRIA InstructionalAcc52.9ASOT
Action SegmentationYoutube INRIA InstructionalF135.1ASOT
Action SegmentationYoutube INRIA InstructionalPrecision47.6ASOT
Action SegmentationYoutube INRIA InstructionalRecall27.8ASOT
Action SegmentationYoutube INRIA InstructionalmIoU24.7ASOT
Action SegmentationBreakfastAcc56.1ASOT
Action SegmentationBreakfastF138.3ASOT
Action SegmentationBreakfastJSD94.9ASOT
Action SegmentationBreakfastPrecision36.7ASOT
Action SegmentationBreakfastRecall40.1ASOT
Action SegmentationBreakfastmIoU18.6ASOT

Related Papers

SeC: Advancing Complex Video Object Segmentation via Progressive Concept Construction2025-07-21Deep Learning-Based Fetal Lung Segmentation from Diffusion-weighted MRI Images and Lung Maturity Evaluation for Fetal Growth Restriction2025-07-17DiffOSeg: Omni Medical Image Segmentation via Multi-Expert Collaboration Diffusion Model2025-07-17From Variability To Accuracy: Conditional Bernoulli Diffusion Models with Consensus-Driven Correction for Thin Structure Segmentation2025-07-17Unleashing Vision Foundation Models for Coronary Artery Segmentation: Parallel ViT-CNN Encoding and Variational Fusion2025-07-17SCORE: Scene Context Matters in Open-Vocabulary Remote Sensing Instance Segmentation2025-07-17Unified Medical Image Segmentation with State Space Modeling Snake2025-07-17A Privacy-Preserving Semantic-Segmentation Method Using Domain-Adaptation Technique2025-07-17