TasksSotADatasetsPapersMethodsSubmitAbout
Papers With Code 2

A community resource for machine learning research: papers, code, benchmarks, and state-of-the-art results.

Explore

Notable BenchmarksAll SotADatasetsPapersMethods

Community

Submit ResultsAbout

Data sourced from the PWC Archive (CC-BY-SA 4.0). Built by the community, for the community.

Papers/Blindly Assess Quality of In-the-Wild Videos via Quality-a...

Blindly Assess Quality of In-the-Wild Videos via Quality-aware Pre-training and Motion Perception

Bowen Li, Weixia Zhang, Meng Tian, Guangtao Zhai, Xianpei Wang

2021-08-19Transfer LearningVideo Quality AssessmentImage Quality AssessmentAction RecognitionVisual Question Answering (VQA)
PaperPDFCodeCode(official)

Abstract

Perceptual quality assessment of the videos acquired in the wilds is of vital importance for quality assurance of video services. The inaccessibility of reference videos with pristine quality and the complexity of authentic distortions pose great challenges for this kind of blind video quality assessment (BVQA) task. Although model-based transfer learning is an effective and efficient paradigm for the BVQA task, it remains to be a challenge to explore what and how to bridge the domain shifts for better video representation. In this work, we propose to transfer knowledge from image quality assessment (IQA) databases with authentic distortions and large-scale action recognition with rich motion patterns. We rely on both groups of data to learn the feature extractor. We train the proposed model on the target VQA databases using a mixed list-wise ranking loss function. Extensive experiments on six databases demonstrate that our method performs very competitively under both individual database and mixed database training settings. We also verify the rationality of each component of the proposed method and explore a simple manner for further improvement.

Results

TaskDatasetMetricValueModel
Video UnderstandingMSU NR VQA DatabaseKLCC0.764LI
Video UnderstandingMSU NR VQA DatabasePLCC0.927LI
Video UnderstandingMSU NR VQA DatabaseSRCC0.9131LI
Video UnderstandingLIVE-VQCPLCC0.839BVQA-2022
Video UnderstandingYouTube-UGCPLCC0.8178BVQA-2022
Video UnderstandingKoNViD-1kPLCC0.834BVQA-2022
Video UnderstandingLIVE-FB LSVQPLCC0.854BVQA-2022
Video Quality AssessmentMSU NR VQA DatabaseKLCC0.764LI
Video Quality AssessmentMSU NR VQA DatabasePLCC0.927LI
Video Quality AssessmentMSU NR VQA DatabaseSRCC0.9131LI
Video Quality AssessmentLIVE-VQCPLCC0.839BVQA-2022
Video Quality AssessmentYouTube-UGCPLCC0.8178BVQA-2022
Video Quality AssessmentKoNViD-1kPLCC0.834BVQA-2022
Video Quality AssessmentLIVE-FB LSVQPLCC0.854BVQA-2022
VideoMSU NR VQA DatabaseKLCC0.764LI
VideoMSU NR VQA DatabasePLCC0.927LI
VideoMSU NR VQA DatabaseSRCC0.9131LI
VideoLIVE-VQCPLCC0.839BVQA-2022
VideoYouTube-UGCPLCC0.8178BVQA-2022
VideoKoNViD-1kPLCC0.834BVQA-2022
VideoLIVE-FB LSVQPLCC0.854BVQA-2022

Related Papers

Visual-Language Model Knowledge Distillation Method for Image Quality Assessment2025-07-21RaMen: Multi-Strategy Multi-Modal Learning for Bundle Construction2025-07-18Disentangling coincident cell events using deep transfer learning and compressive sensing2025-07-17DeQA-Doc: Adapting DeQA-Score to Document Image Quality Assessment2025-07-17A Real-Time System for Egocentric Hand-Object Interaction Detection in Industrial Domains2025-07-17VisionThink: Smart and Efficient Vision Language Model via Reinforcement Learning2025-07-17Best Practices for Large-Scale, Pixel-Wise Crop Mapping and Transfer Learning Workflows2025-07-16MGFFD-VLM: Multi-Granularity Prompt Learning for Face Forgery Detection with VLM2025-07-16