TasksSotADatasetsPapersMethodsSubmitAbout
Papers With Code 2

A community resource for machine learning research: papers, code, benchmarks, and state-of-the-art results.

Explore

Notable BenchmarksAll SotADatasetsPapersMethods

Community

Submit ResultsAbout

Data sourced from the PWC Archive (CC-BY-SA 4.0). Built by the community, for the community.

Papers/Boosting Flow-based Generative Super-Resolution Models via...

Boosting Flow-based Generative Super-Resolution Models via Learned Prior

Li-Yuan Tsao, Yi-Chen Lo, Chia-Che Chang, Hao-Wei Chen, Roy Tseng, Chien Feng, Chun-Yi Lee

2024-03-16CVPR 2024 1Super-ResolutionImage Super-ResolutionImage Generation
PaperPDFCode(official)

Abstract

Flow-based super-resolution (SR) models have demonstrated astonishing capabilities in generating high-quality images. However, these methods encounter several challenges during image generation, such as grid artifacts, exploding inverses, and suboptimal results due to a fixed sampling temperature. To overcome these issues, this work introduces a conditional learned prior to the inference phase of a flow-based SR model. This prior is a latent code predicted by our proposed latent module conditioned on the low-resolution image, which is then transformed by the flow model into an SR image. Our framework is designed to seamlessly integrate with any contemporary flow-based SR model without modifying its architecture or pre-trained weights. We evaluate the effectiveness of our proposed framework through extensive experiments and ablation analyses. The proposed framework successfully addresses all the inherent issues in flow-based SR models and enhances their performance in various SR scenarios. Our code is available at: https://github.com/liyuantsao/BFSR

Results

TaskDatasetMetricValueModel
Super-ResolutionDIV2K val - 4x upscalingLPIPS0.105LINF-LP
Super-ResolutionDIV2K val - 4x upscalingLRPSNR47.3LINF-LP
Super-ResolutionDIV2K val - 4x upscalingPSNR28LINF-LP
Super-ResolutionDIV2K val - 4x upscalingSSIM0.78LINF-LP
Super-ResolutionDIV2K val - 4x upscalingLPIPS0.109SRFlow-LP
Super-ResolutionDIV2K val - 4x upscalingLRPSNR51.51SRFlow-LP
Super-ResolutionDIV2K val - 4x upscalingPSNR27.51SRFlow-LP
Super-ResolutionDIV2K val - 4x upscalingSSIM0.78SRFlow-LP
Image Super-ResolutionDIV2K val - 4x upscalingLPIPS0.105LINF-LP
Image Super-ResolutionDIV2K val - 4x upscalingLRPSNR47.3LINF-LP
Image Super-ResolutionDIV2K val - 4x upscalingPSNR28LINF-LP
Image Super-ResolutionDIV2K val - 4x upscalingSSIM0.78LINF-LP
Image Super-ResolutionDIV2K val - 4x upscalingLPIPS0.109SRFlow-LP
Image Super-ResolutionDIV2K val - 4x upscalingLRPSNR51.51SRFlow-LP
Image Super-ResolutionDIV2K val - 4x upscalingPSNR27.51SRFlow-LP
Image Super-ResolutionDIV2K val - 4x upscalingSSIM0.78SRFlow-LP
3D Object Super-ResolutionDIV2K val - 4x upscalingLPIPS0.105LINF-LP
3D Object Super-ResolutionDIV2K val - 4x upscalingLRPSNR47.3LINF-LP
3D Object Super-ResolutionDIV2K val - 4x upscalingPSNR28LINF-LP
3D Object Super-ResolutionDIV2K val - 4x upscalingSSIM0.78LINF-LP
3D Object Super-ResolutionDIV2K val - 4x upscalingLPIPS0.109SRFlow-LP
3D Object Super-ResolutionDIV2K val - 4x upscalingLRPSNR51.51SRFlow-LP
3D Object Super-ResolutionDIV2K val - 4x upscalingPSNR27.51SRFlow-LP
3D Object Super-ResolutionDIV2K val - 4x upscalingSSIM0.78SRFlow-LP
16kDIV2K val - 4x upscalingLPIPS0.105LINF-LP
16kDIV2K val - 4x upscalingLRPSNR47.3LINF-LP
16kDIV2K val - 4x upscalingPSNR28LINF-LP
16kDIV2K val - 4x upscalingSSIM0.78LINF-LP
16kDIV2K val - 4x upscalingLPIPS0.109SRFlow-LP
16kDIV2K val - 4x upscalingLRPSNR51.51SRFlow-LP
16kDIV2K val - 4x upscalingPSNR27.51SRFlow-LP
16kDIV2K val - 4x upscalingSSIM0.78SRFlow-LP

Related Papers

SpectraLift: Physics-Guided Spectral-Inversion Network for Self-Supervised Hyperspectral Image Super-Resolution2025-07-17fastWDM3D: Fast and Accurate 3D Healthy Tissue Inpainting2025-07-17Synthesizing Reality: Leveraging the Generative AI-Powered Platform Midjourney for Construction Worker Detection2025-07-17FashionPose: Text to Pose to Relight Image Generation for Personalized Fashion Visualization2025-07-17A Distributed Generative AI Approach for Heterogeneous Multi-Domain Environments under Data Sharing constraints2025-07-17Pixel Perfect MegaMed: A Megapixel-Scale Vision-Language Foundation Model for Generating High Resolution Medical Images2025-07-17FADE: Adversarial Concept Erasure in Flow Models2025-07-16CharaConsist: Fine-Grained Consistent Character Generation2025-07-15