TasksSotADatasetsPapersMethodsSubmitAbout
Papers With Code 2

A community resource for machine learning research: papers, code, benchmarks, and state-of-the-art results.

Explore

Notable BenchmarksAll SotADatasetsPapersMethods

Community

Submit ResultsAbout

Data sourced from the PWC Archive (CC-BY-SA 4.0). Built by the community, for the community.

Papers/Shift-tolerant Perceptual Similarity Metric

Shift-tolerant Perceptual Similarity Metric

Abhijay Ghildyal, Feng Liu

2022-07-27Video Quality Assessment
PaperPDFCode(official)Code

Abstract

Existing perceptual similarity metrics assume an image and its reference are well aligned. As a result, these metrics are often sensitive to a small alignment error that is imperceptible to the human eyes. This paper studies the effect of small misalignment, specifically a small shift between the input and reference image, on existing metrics, and accordingly develops a shift-tolerant similarity metric. This paper builds upon LPIPS, a widely used learned perceptual similarity metric, and explores architectural design considerations to make it robust against imperceptible misalignment. Specifically, we study a wide spectrum of neural network elements, such as anti-aliasing filtering, pooling, striding, padding, and skip connection, and discuss their roles in making a robust metric. Based on our studies, we develop a new deep neural network-based perceptual similarity metric. Our experiments show that our metric is tolerant to imperceptible shifts while being consistent with the human similarity judgment.

Results

TaskDatasetMetricValueModel
Video UnderstandingMSU SR-QA DatasetKLCC0.45898ST-LPIPS (VGG)
Video UnderstandingMSU SR-QA DatasetPLCC0.56431ST-LPIPS (VGG)
Video UnderstandingMSU SR-QA DatasetSROCC0.57336ST-LPIPS (VGG)
Video UnderstandingMSU SR-QA DatasetKLCC0.42897ST-LPIPS (Alex)
Video UnderstandingMSU SR-QA DatasetPLCC0.5474ST-LPIPS (Alex)
Video UnderstandingMSU SR-QA DatasetSROCC0.53473ST-LPIPS (Alex)
Video Quality AssessmentMSU SR-QA DatasetKLCC0.45898ST-LPIPS (VGG)
Video Quality AssessmentMSU SR-QA DatasetPLCC0.56431ST-LPIPS (VGG)
Video Quality AssessmentMSU SR-QA DatasetSROCC0.57336ST-LPIPS (VGG)
Video Quality AssessmentMSU SR-QA DatasetKLCC0.42897ST-LPIPS (Alex)
Video Quality AssessmentMSU SR-QA DatasetPLCC0.5474ST-LPIPS (Alex)
Video Quality AssessmentMSU SR-QA DatasetSROCC0.53473ST-LPIPS (Alex)
VideoMSU SR-QA DatasetKLCC0.45898ST-LPIPS (VGG)
VideoMSU SR-QA DatasetPLCC0.56431ST-LPIPS (VGG)
VideoMSU SR-QA DatasetSROCC0.57336ST-LPIPS (VGG)
VideoMSU SR-QA DatasetKLCC0.42897ST-LPIPS (Alex)
VideoMSU SR-QA DatasetPLCC0.5474ST-LPIPS (Alex)
VideoMSU SR-QA DatasetSROCC0.53473ST-LPIPS (Alex)

Related Papers

Bridging Video Quality Scoring and Justification via Large Multimodal Models2025-06-26EyeSim-VQA: A Free-Energy-Guided Eye Simulation Framework for Video Quality Assessment2025-06-13TDVE-Assessor: Benchmarking and Evaluating the Quality of Text-Driven Video Editing with LMMs2025-05-26NTIRE 2025 Challenge on Video Quality Enhancement for Video Conferencing: Datasets, Methods and Results2025-05-25CP-LLM: Context and Pixel Aware Large Language Model for Video Quality Assessment2025-05-21Semantically-Aware Game Image Quality Assessment2025-05-16Breaking Annotation Barriers: Generalized Video Quality Assessment via Ranking-based Self-Supervision2025-05-06DiffVQA: Video Quality Assessment Using Diffusion Feature Extractor2025-05-06