TasksSotADatasetsPapersMethodsSubmitAbout
Papers With Code 2

A community resource for machine learning research: papers, code, benchmarks, and state-of-the-art results.

Explore

Notable BenchmarksAll SotADatasetsPapersMethods

Community

Submit ResultsAbout

Data sourced from the PWC Archive (CC-BY-SA 4.0). Built by the community, for the community.

Papers/BiFormer: Learning Bilateral Motion Estimation via Bilater...

BiFormer: Learning Bilateral Motion Estimation via Bilateral Transformer for 4K Video Frame Interpolation

Junheum Park, Jintae Kim, Chang-Su Kim

2023-04-05CVPR 2023 6Motion Estimation4kVideo Frame Interpolation
PaperPDFCode(official)

Abstract

A novel 4K video frame interpolator based on bilateral transformer (BiFormer) is proposed in this paper, which performs three steps: global motion estimation, local motion refinement, and frame synthesis. First, in global motion estimation, we predict symmetric bilateral motion fields at a coarse scale. To this end, we propose BiFormer, the first transformer-based bilateral motion estimator. Second, we refine the global motion fields efficiently using blockwise bilateral cost volumes (BBCVs). Third, we warp the input frames using the refined motion fields and blend them to synthesize an intermediate frame. Extensive experiments demonstrate that the proposed BiFormer algorithm achieves excellent interpolation performance on 4K datasets. The source codes are available at https://github.com/JunHeum/BiFormer.

Results

TaskDatasetMetricValueModel
VideoX4K1000FPSPSNR31.32BiFormer
VideoX4K1000FPSSSIM0.9212BiFormer
Video Frame InterpolationX4K1000FPSPSNR31.32BiFormer
Video Frame InterpolationX4K1000FPSSSIM0.9212BiFormer

Related Papers

DINO-VO: A Feature-based Visual Odometry Leveraging a Visual Foundation Model2025-07-17Dynamic Parameter Memory: Temporary LoRA-Enhanced LLM for Long-Sequence Emotion Recognition in Conversation2025-07-11HiM2SAM: Enhancing SAM2 with Hierarchical Motion Estimation and Memory Optimization towards Long-term Tracking2025-07-104KAgent: Agentic Any Image to 4K Super-Resolution2025-07-09TLB-VFI: Temporal-Aware Latent Brownian Bridge Diffusion for Video Frame Interpolation2025-07-07AUTOMATIC ROOM LIGHT CONTROLLER MANAGEMENT SYSTEM.2025-06-25EndoMUST: Monocular Depth Estimation for Robotic Endoscopy via End-to-end Multi-step Self-supervised Training2025-06-19Uncertainty-Driven Radar-Inertial Fusion for Instantaneous 3D Ego-Velocity Estimation2025-06-17