TasksSotADatasetsPapersMethodsSubmitAbout
Papers With Code 2

A community resource for machine learning research: papers, code, benchmarks, and state-of-the-art results.

Explore

Notable BenchmarksAll SotADatasetsPapersMethods

Community

Submit ResultsAbout

Data sourced from the PWC Archive (CC-BY-SA 4.0). Built by the community, for the community.

Papers/Multigrid Predictive Filter Flow for Unsupervised Learning...

Multigrid Predictive Filter Flow for Unsupervised Learning on Videos

Shu Kong, Charless Fowlkes

2019-04-02Optical Flow EstimationSkeleton Based Action RecognitionSemantic SegmentationPose TrackingVideo Semantic Segmentation
PaperPDFCodeCode

Abstract

We introduce multigrid Predictive Filter Flow (mgPFF), a framework for unsupervised learning on videos. The mgPFF takes as input a pair of frames and outputs per-pixel filters to warp one frame to the other. Compared to optical flow used for warping frames, mgPFF is more powerful in modeling sub-pixel movement and dealing with corruption (e.g., motion blur). We develop a multigrid coarse-to-fine modeling strategy that avoids the requirement of learning large filters to capture large displacement. This allows us to train an extremely compact model (4.6MB) which operates in a progressive way over multiple resolutions with shared weights. We train mgPFF on unsupervised, free-form videos and show that mgPFF is able to not only estimate long-range flow for frame reconstruction and detect video shot transitions, but also readily amendable for video object segmentation and pose tracking, where it substantially outperforms the published state-of-the-art without bells and whistles. Moreover, owing to mgPFF's nature of per-pixel filter prediction, we have the unique opportunity to visualize how each pixel is evolving during solving these tasks, thus gaining better interpretability.

Results

TaskDatasetMetricValueModel
VideoJHMDB Pose TrackingPCK@0.158.4mgPFF+ft 1st
VideoJHMDB Pose TrackingPCK@0.278.1mgPFF+ft 1st
VideoJHMDB Pose TrackingPCK@0.385.9mgPFF+ft 1st
VideoJHMDB Pose TrackingPCK@0.489.8mgPFF+ft 1st
VideoJHMDB Pose TrackingPCK@0.592.4mgPFF+ft 1st
Temporal Action LocalizationJHMDB Pose TrackingPCK@0.158.4mgPFF+ft 1st
Temporal Action LocalizationJHMDB Pose TrackingPCK@0.278.1mgPFF+ft 1st
Temporal Action LocalizationJHMDB Pose TrackingPCK@0.385.9mgPFF+ft 1st
Temporal Action LocalizationJHMDB Pose TrackingPCK@0.489.8mgPFF+ft 1st
Temporal Action LocalizationJHMDB Pose TrackingPCK@0.592.4mgPFF+ft 1st
Zero-Shot LearningJHMDB Pose TrackingPCK@0.158.4mgPFF+ft 1st
Zero-Shot LearningJHMDB Pose TrackingPCK@0.278.1mgPFF+ft 1st
Zero-Shot LearningJHMDB Pose TrackingPCK@0.385.9mgPFF+ft 1st
Zero-Shot LearningJHMDB Pose TrackingPCK@0.489.8mgPFF+ft 1st
Zero-Shot LearningJHMDB Pose TrackingPCK@0.592.4mgPFF+ft 1st
Activity RecognitionJHMDB Pose TrackingPCK@0.158.4mgPFF+ft 1st
Activity RecognitionJHMDB Pose TrackingPCK@0.278.1mgPFF+ft 1st
Activity RecognitionJHMDB Pose TrackingPCK@0.385.9mgPFF+ft 1st
Activity RecognitionJHMDB Pose TrackingPCK@0.489.8mgPFF+ft 1st
Activity RecognitionJHMDB Pose TrackingPCK@0.592.4mgPFF+ft 1st
Action LocalizationJHMDB Pose TrackingPCK@0.158.4mgPFF+ft 1st
Action LocalizationJHMDB Pose TrackingPCK@0.278.1mgPFF+ft 1st
Action LocalizationJHMDB Pose TrackingPCK@0.385.9mgPFF+ft 1st
Action LocalizationJHMDB Pose TrackingPCK@0.489.8mgPFF+ft 1st
Action LocalizationJHMDB Pose TrackingPCK@0.592.4mgPFF+ft 1st
Action DetectionJHMDB Pose TrackingPCK@0.158.4mgPFF+ft 1st
Action DetectionJHMDB Pose TrackingPCK@0.278.1mgPFF+ft 1st
Action DetectionJHMDB Pose TrackingPCK@0.385.9mgPFF+ft 1st
Action DetectionJHMDB Pose TrackingPCK@0.489.8mgPFF+ft 1st
Action DetectionJHMDB Pose TrackingPCK@0.592.4mgPFF+ft 1st
3D Action RecognitionJHMDB Pose TrackingPCK@0.158.4mgPFF+ft 1st
3D Action RecognitionJHMDB Pose TrackingPCK@0.278.1mgPFF+ft 1st
3D Action RecognitionJHMDB Pose TrackingPCK@0.385.9mgPFF+ft 1st
3D Action RecognitionJHMDB Pose TrackingPCK@0.489.8mgPFF+ft 1st
3D Action RecognitionJHMDB Pose TrackingPCK@0.592.4mgPFF+ft 1st
Action RecognitionJHMDB Pose TrackingPCK@0.158.4mgPFF+ft 1st
Action RecognitionJHMDB Pose TrackingPCK@0.278.1mgPFF+ft 1st
Action RecognitionJHMDB Pose TrackingPCK@0.385.9mgPFF+ft 1st
Action RecognitionJHMDB Pose TrackingPCK@0.489.8mgPFF+ft 1st
Action RecognitionJHMDB Pose TrackingPCK@0.592.4mgPFF+ft 1st

Related Papers

SeC: Advancing Complex Video Object Segmentation via Progressive Concept Construction2025-07-21Channel-wise Motion Features for Efficient Motion Segmentation2025-07-17DiffOSeg: Omni Medical Image Segmentation via Multi-Expert Collaboration Diffusion Model2025-07-17SCORE: Scene Context Matters in Open-Vocabulary Remote Sensing Instance Segmentation2025-07-17Unified Medical Image Segmentation with State Space Modeling Snake2025-07-17A Privacy-Preserving Semantic-Segmentation Method Using Domain-Adaptation Technique2025-07-17From Neck to Head: Bio-Impedance Sensing for Head Pose Estimation2025-07-17SAMST: A Transformer framework based on SAM pseudo label filtering for remote sensing semi-supervised semantic segmentation2025-07-16