TasksSotADatasetsPapersMethodsSubmitAbout
Papers With Code 2

A community resource for machine learning research: papers, code, benchmarks, and state-of-the-art results.

Explore

Notable BenchmarksAll SotADatasetsPapersMethods

Community

Submit ResultsAbout

Data sourced from the PWC Archive (CC-BY-SA 4.0). Built by the community, for the community.

Papers/Deep Learning based Full-reference and No-reference Qualit...

Deep Learning based Full-reference and No-reference Quality Assessment Models for Compressed UGC Videos

Wei Sun, Tao Wang, Xiongkuo Min, Fuwang Yi, Guangtao Zhai

2021-06-02regressionVideo Quality Assessment
PaperPDFCode(official)

Abstract

In this paper, we propose a deep learning based video quality assessment (VQA) framework to evaluate the quality of the compressed user's generated content (UGC) videos. The proposed VQA framework consists of three modules, the feature extraction module, the quality regression module, and the quality pooling module. For the feature extraction module, we fuse the features from intermediate layers of the convolutional neural network (CNN) network into final quality-aware feature representation, which enables the model to make full use of visual information from low-level to high-level. Specifically, the structure and texture similarities of feature maps extracted from all intermediate layers are calculated as the feature representation for the full reference (FR) VQA model, and the global mean and standard deviation of the final feature maps fused by intermediate feature maps are calculated as the feature representation for the no reference (NR) VQA model. For the quality regression module, we use the fully connected (FC) layer to regress the quality-aware features into frame-level scores. Finally, a subjectively-inspired temporal pooling strategy is adopted to pool frame-level scores into the video-level score. The proposed model achieves the best performance among the state-of-the-art FR and NR VQA models on the Compressed UGC VQA database and also achieves pretty good performance on the in-the-wild UGC VQA databases.

Results

TaskDatasetMetricValueModel
Video UnderstandingMSU NR VQA DatabaseKLCC0.7037GVSP-UGCVQA-NR (single_scale)
Video UnderstandingMSU NR VQA DatabasePLCC0.8933GVSP-UGCVQA-NR (single_scale)
Video UnderstandingMSU NR VQA DatabaseSRCC0.8742GVSP-UGCVQA-NR (single_scale)
Video UnderstandingMSU NR VQA DatabaseKLCC0.6942GVSP-UGCVQA-NR (multi_scale)
Video UnderstandingMSU NR VQA DatabasePLCC0.8851GVSP-UGCVQA-NR (multi_scale)
Video UnderstandingMSU NR VQA DatabaseSRCC0.8673GVSP-UGCVQA-NR (multi_scale)
Video UnderstandingMSU FR VQA DatabaseKLCC0.695FR GVSP-UGCVQA (single scale)
Video UnderstandingMSU FR VQA DatabasePLCC0.893FR GVSP-UGCVQA (single scale)
Video UnderstandingMSU FR VQA DatabaseSRCC0.864FR GVSP-UGCVQA (single scale)
Video Quality AssessmentMSU NR VQA DatabaseKLCC0.7037GVSP-UGCVQA-NR (single_scale)
Video Quality AssessmentMSU NR VQA DatabasePLCC0.8933GVSP-UGCVQA-NR (single_scale)
Video Quality AssessmentMSU NR VQA DatabaseSRCC0.8742GVSP-UGCVQA-NR (single_scale)
Video Quality AssessmentMSU NR VQA DatabaseKLCC0.6942GVSP-UGCVQA-NR (multi_scale)
Video Quality AssessmentMSU NR VQA DatabasePLCC0.8851GVSP-UGCVQA-NR (multi_scale)
Video Quality AssessmentMSU NR VQA DatabaseSRCC0.8673GVSP-UGCVQA-NR (multi_scale)
Video Quality AssessmentMSU FR VQA DatabaseKLCC0.695FR GVSP-UGCVQA (single scale)
Video Quality AssessmentMSU FR VQA DatabasePLCC0.893FR GVSP-UGCVQA (single scale)
Video Quality AssessmentMSU FR VQA DatabaseSRCC0.864FR GVSP-UGCVQA (single scale)
VideoMSU NR VQA DatabaseKLCC0.7037GVSP-UGCVQA-NR (single_scale)
VideoMSU NR VQA DatabasePLCC0.8933GVSP-UGCVQA-NR (single_scale)
VideoMSU NR VQA DatabaseSRCC0.8742GVSP-UGCVQA-NR (single_scale)
VideoMSU NR VQA DatabaseKLCC0.6942GVSP-UGCVQA-NR (multi_scale)
VideoMSU NR VQA DatabasePLCC0.8851GVSP-UGCVQA-NR (multi_scale)
VideoMSU NR VQA DatabaseSRCC0.8673GVSP-UGCVQA-NR (multi_scale)
VideoMSU FR VQA DatabaseKLCC0.695FR GVSP-UGCVQA (single scale)
VideoMSU FR VQA DatabasePLCC0.893FR GVSP-UGCVQA (single scale)
VideoMSU FR VQA DatabaseSRCC0.864FR GVSP-UGCVQA (single scale)

Related Papers

Language Integration in Fine-Tuning Multimodal Large Language Models for Image-Based Regression2025-07-20Neural Network-Guided Symbolic Regression for Interpretable Descriptor Discovery in Perovskite Catalysts2025-07-16Imbalanced Regression Pipeline Recommendation2025-07-16Second-Order Bounds for [0,1]-Valued Regression via Betting Loss2025-07-16Sparse Regression Codes exploit Multi-User Diversity without CSI2025-07-15Bradley-Terry and Multi-Objective Reward Modeling Are Complementary2025-07-10Active Learning for Manifold Gaussian Process Regression2025-06-26Bridging Video Quality Scoring and Justification via Large Multimodal Models2025-06-26