TasksSotADatasetsPapersMethodsSubmitAbout
Papers With Code 2

A community resource for machine learning research: papers, code, benchmarks, and state-of-the-art results.

Explore

Notable BenchmarksAll SotADatasetsPapersMethods

Community

Submit ResultsAbout

Data sourced from the PWC Archive (CC-BY-SA 4.0). Built by the community, for the community.

Papers/MUSIQ: Multi-scale Image Quality Transformer

MUSIQ: Multi-scale Image Quality Transformer

Junjie Ke, Qifei Wang, Yilin Wang, Peyman Milanfar, Feng Yang

2021-08-12ICCV 2021 10Video Quality AssessmentImage Quality Assessment
PaperPDFCode(official)Code

Abstract

Image quality assessment (IQA) is an important research topic for understanding and improving visual experience. The current state-of-the-art IQA methods are based on convolutional neural networks (CNNs). The performance of CNN-based models is often compromised by the fixed shape constraint in batch training. To accommodate this, the input images are usually resized and cropped to a fixed shape, causing image quality degradation. To address this, we design a multi-scale image quality Transformer (MUSIQ) to process native resolution images with varying sizes and aspect ratios. With a multi-scale image representation, our proposed method can capture image quality at different granularities. Furthermore, a novel hash-based 2D spatial embedding and a scale embedding is proposed to support the positional embedding in the multi-scale representation. Experimental results verify that our method can achieve state-of-the-art performance on multiple large scale IQA datasets such as PaQ-2-PiQ, SPAQ and KonIQ-10k.

Results

TaskDatasetMetricValueModel
Video UnderstandingMSU NR VQA DatabaseKLCC0.7433MUSIQ
Video UnderstandingMSU NR VQA DatabasePLCC0.9068MUSIQ
Video UnderstandingMSU NR VQA DatabaseSRCC0.9004MUSIQ
Video UnderstandingMSU SR-QA DatasetKLCC0.55312MUSIQ trained on PaQ-2-PiQ
Video UnderstandingMSU SR-QA DatasetPLCC0.66531MUSIQ trained on PaQ-2-PiQ
Video UnderstandingMSU SR-QA DatasetSROCC0.67746MUSIQ trained on PaQ-2-PiQ
Video UnderstandingMSU SR-QA DatasetKLCC0.52673MUSIQ trained on SPAQ
Video UnderstandingMSU SR-QA DatasetPLCC0.60216MUSIQ trained on SPAQ
Video UnderstandingMSU SR-QA DatasetSROCC0.64927MUSIQ trained on SPAQ
Video UnderstandingMSU SR-QA DatasetKLCC0.51897MUSIQ trained on KONIQ
Video UnderstandingMSU SR-QA DatasetPLCC0.59151MUSIQ trained on KONIQ
Video UnderstandingMSU SR-QA DatasetSROCC0.64589MUSIQ trained on KONIQ
Video UnderstandingMSU SR-QA DatasetKLCC0.44669MUSIQ trained on AVA
Video UnderstandingMSU SR-QA DatasetPLCC0.52404MUSIQ trained on AVA
Video UnderstandingMSU SR-QA DatasetSROCC0.56152MUSIQ trained on AVA
Video Quality AssessmentMSU NR VQA DatabaseKLCC0.7433MUSIQ
Video Quality AssessmentMSU NR VQA DatabasePLCC0.9068MUSIQ
Video Quality AssessmentMSU NR VQA DatabaseSRCC0.9004MUSIQ
Video Quality AssessmentMSU SR-QA DatasetKLCC0.55312MUSIQ trained on PaQ-2-PiQ
Video Quality AssessmentMSU SR-QA DatasetPLCC0.66531MUSIQ trained on PaQ-2-PiQ
Video Quality AssessmentMSU SR-QA DatasetSROCC0.67746MUSIQ trained on PaQ-2-PiQ
Video Quality AssessmentMSU SR-QA DatasetKLCC0.52673MUSIQ trained on SPAQ
Video Quality AssessmentMSU SR-QA DatasetPLCC0.60216MUSIQ trained on SPAQ
Video Quality AssessmentMSU SR-QA DatasetSROCC0.64927MUSIQ trained on SPAQ
Video Quality AssessmentMSU SR-QA DatasetKLCC0.51897MUSIQ trained on KONIQ
Video Quality AssessmentMSU SR-QA DatasetPLCC0.59151MUSIQ trained on KONIQ
Video Quality AssessmentMSU SR-QA DatasetSROCC0.64589MUSIQ trained on KONIQ
Video Quality AssessmentMSU SR-QA DatasetKLCC0.44669MUSIQ trained on AVA
Video Quality AssessmentMSU SR-QA DatasetPLCC0.52404MUSIQ trained on AVA
Video Quality AssessmentMSU SR-QA DatasetSROCC0.56152MUSIQ trained on AVA
Image Quality AssessmentMSU NR VQA DatabaseKLCC0.7433MUSIQ
Image Quality AssessmentMSU NR VQA DatabasePLCC0.9068MUSIQ
Image Quality AssessmentMSU NR VQA DatabaseSRCC0.9004MUSIQ
VideoMSU NR VQA DatabaseKLCC0.7433MUSIQ
VideoMSU NR VQA DatabasePLCC0.9068MUSIQ
VideoMSU NR VQA DatabaseSRCC0.9004MUSIQ
VideoMSU SR-QA DatasetKLCC0.55312MUSIQ trained on PaQ-2-PiQ
VideoMSU SR-QA DatasetPLCC0.66531MUSIQ trained on PaQ-2-PiQ
VideoMSU SR-QA DatasetSROCC0.67746MUSIQ trained on PaQ-2-PiQ
VideoMSU SR-QA DatasetKLCC0.52673MUSIQ trained on SPAQ
VideoMSU SR-QA DatasetPLCC0.60216MUSIQ trained on SPAQ
VideoMSU SR-QA DatasetSROCC0.64927MUSIQ trained on SPAQ
VideoMSU SR-QA DatasetKLCC0.51897MUSIQ trained on KONIQ
VideoMSU SR-QA DatasetPLCC0.59151MUSIQ trained on KONIQ
VideoMSU SR-QA DatasetSROCC0.64589MUSIQ trained on KONIQ
VideoMSU SR-QA DatasetKLCC0.44669MUSIQ trained on AVA
VideoMSU SR-QA DatasetPLCC0.52404MUSIQ trained on AVA
VideoMSU SR-QA DatasetSROCC0.56152MUSIQ trained on AVA

Related Papers

Visual-Language Model Knowledge Distillation Method for Image Quality Assessment2025-07-21DeQA-Doc: Adapting DeQA-Score to Document Image Quality Assessment2025-07-17Text-Visual Semantic Constrained AI-Generated Image Quality Assessment2025-07-144KAgent: Agentic Any Image to 4K Super-Resolution2025-07-09Bridging Video Quality Scoring and Justification via Large Multimodal Models2025-06-26FundaQ-8: A Clinically-Inspired Scoring Framework for Automated Fundus Image Quality Assessment2025-06-25MS-IQA: A Multi-Scale Feature Fusion Network for PET/CT Image Quality Assessment2025-06-25Enhanced Dermatology Image Quality Assessment via Cross-Domain Training2025-06-19