TasksSotADatasetsPapersMethodsSubmitAbout
Papers With Code 2

A community resource for machine learning research: papers, code, benchmarks, and state-of-the-art results.

Explore

Notable BenchmarksAll SotADatasetsPapersMethods

Community

Submit ResultsAbout

Data sourced from the PWC Archive (CC-BY-SA 4.0). Built by the community, for the community.

Papers/Unsupervised Flow-Aligned Sequence-to-Sequence Learning fo...

Unsupervised Flow-Aligned Sequence-to-Sequence Learning for Video Restoration

Jing Lin, Xiaowan Hu, Yuanhao Cai, Haoqian Wang, Youliang Yan, Xueyi Zou, Yulun Zhang, Luc van Gool

2022-05-20Super-ResolutionDeblurringOptical Flow EstimationVideo Super-ResolutionVideo DeblurringVideo EnhancementVideo Restoration
PaperPDFCode(official)

Abstract

How to properly model the inter-frame relation within the video sequence is an important but unsolved challenge for video restoration (VR). In this work, we propose an unsupervised flow-aligned sequence-to-sequence model (S2SVR) to address this problem. On the one hand, the sequence-to-sequence model, which has proven capable of sequence modeling in the field of natural language processing, is explored for the first time in VR. Optimized serialization modeling shows potential in capturing long-range dependencies among frames. On the other hand, we equip the sequence-to-sequence model with an unsupervised optical flow estimator to maximize its potential. The flow estimator is trained with our proposed unsupervised distillation loss, which can alleviate the data discrepancy and inaccurate degraded optical flow issues of previous flow-based methods. With reliable optical flow, we can establish accurate correspondence among multiple frames, narrowing the domain difference between 1D language and 2D misaligned frames and improving the potential of the sequence-to-sequence model. S2SVR shows superior performance in multiple VR tasks, including video deblurring, video super-resolution, and compressed video quality enhancement. Code and models are publicly available at https://github.com/linjing7/VR-Baseline

Results

TaskDatasetMetricValueModel
DeblurringGoProPSNR31.82S2SVR
DeblurringGoProSSIM0.923S2SVR
Super-ResolutionVimeo90KPSNR37.63S2SVR
3D Human Pose EstimationVimeo90KPSNR37.63S2SVR
VideoVimeo90KPSNR37.63S2SVR
Pose EstimationVimeo90KPSNR37.63S2SVR
3DVimeo90KPSNR37.63S2SVR
3D Face AnimationVimeo90KPSNR37.63S2SVR
Video EnhancementMFQE v2Incremental PSNR0.93S2SVR
2D Human Pose EstimationVimeo90KPSNR37.63S2SVR
3D Absolute Human Pose EstimationVimeo90KPSNR37.63S2SVR
2D ClassificationGoProPSNR31.82S2SVR
2D ClassificationGoProSSIM0.923S2SVR
Video Super-ResolutionVimeo90KPSNR37.63S2SVR
10-shot image generationGoProPSNR31.82S2SVR
10-shot image generationGoProSSIM0.923S2SVR
3D Object Super-ResolutionVimeo90KPSNR37.63S2SVR
1 Image, 2*2 StitchiVimeo90KPSNR37.63S2SVR
Blind Image DeblurringGoProPSNR31.82S2SVR
Blind Image DeblurringGoProSSIM0.923S2SVR

Related Papers

SpectraLift: Physics-Guided Spectral-Inversion Network for Self-Supervised Hyperspectral Image Super-Resolution2025-07-17Channel-wise Motion Features for Efficient Motion Segmentation2025-07-17IM-LUT: Interpolation Mixing Look-Up Tables for Image Super-Resolution2025-07-14PanoDiff-SR: Synthesizing Dental Panoramic Radiographs using Diffusion and Super-resolution2025-07-12Generative Latent Kernel Modeling for Blind Motion Deblurring2025-07-12An Efficient Approach for Muscle Segmentation and 3D Reconstruction Using Keypoint Tracking in MRI Scan2025-07-11Unsupervised Methods for Video Quality Improvement: A Survey of Restoration and Enhancement Techniques2025-07-11HNOSeg-XS: Extremely Small Hartley Neural Operator for Efficient and Resolution-Robust 3D Image Segmentation2025-07-10