Exploring Video Quality Assessment on User Generated Contents from Aesthetic and Technical Perspectives

HaoNing Wu, Erli Zhang, Liang Liao, Chaofeng Chen, Jingwen Hou, Annan Wang, Wenxiu Sun, Qiong Yan, Weisi Lin

2022-11-09ICCV 2023 1Disentanglement Video Quality Assessment Visual Question Answering (VQA)Video Generation

Paper PDF Code(official)Code Code(official)

Abstract

The rapid increase in user-generated-content (UGC) videos calls for the development of effective video quality assessment (VQA) algorithms. However, the objective of the UGC-VQA problem is still ambiguous and can be viewed from two perspectives: the technical perspective, measuring the perception of distortions; and the aesthetic perspective, which relates to preference and recommendation on contents. To understand how these two perspectives affect overall subjective opinions in UGC-VQA, we conduct a large-scale subjective study to collect human quality opinions on overall quality of videos as well as perceptions from aesthetic and technical perspectives. The collected Disentangled Video Quality Database (DIVIDE-3k) confirms that human quality opinions on UGC videos are universally and inevitably affected by both aesthetic and technical perspectives. In light of this, we propose the Disentangled Objective Video Quality Evaluator (DOVER) to learn the quality of UGC videos based on the two perspectives. The DOVER proves state-of-the-art performance in UGC-VQA under very high efficiency. With perspective opinions in DIVIDE-3k, we further propose DOVER++, the first approach to provide reliable clear-cut quality evaluations from a single aesthetic or technical perspective. Code at https://github.com/VQAssessment/DOVER.

Results

Task	Dataset	Metric	Value	Model
Video Understanding	MSU NR VQA Database	KLCC	0.7216	DOVER
Video Understanding	MSU NR VQA Database	PLCC	0.9099	DOVER
Video Understanding	MSU NR VQA Database	SRCC	0.8871	DOVER
Video Understanding	LIVE-VQC	PLCC	0.874	DOVER (end-to-end)
Video Understanding	LIVE-VQC	PLCC	0.863	DOVER (head-only)
Video Understanding	YouTube-UGC	PLCC	0.874	DOVER (end-to-end)
Video Understanding	YouTube-UGC	PLCC	0.862	DOVER (head-only)
Video Understanding	KoNViD-1k	PLCC	0.905	DOVER (end-to-end)
Video Understanding	KoNViD-1k	PLCC	0.894	DOVER (head-only)
Video Understanding	LIVE-FB LSVQ	PLCC	0.889	DOVER
Video Quality Assessment	MSU NR VQA Database	KLCC	0.7216	DOVER
Video Quality Assessment	MSU NR VQA Database	PLCC	0.9099	DOVER
Video Quality Assessment	MSU NR VQA Database	SRCC	0.8871	DOVER
Video Quality Assessment	LIVE-VQC	PLCC	0.874	DOVER (end-to-end)
Video Quality Assessment	LIVE-VQC	PLCC	0.863	DOVER (head-only)
Video Quality Assessment	YouTube-UGC	PLCC	0.874	DOVER (end-to-end)
Video Quality Assessment	YouTube-UGC	PLCC	0.862	DOVER (head-only)
Video Quality Assessment	KoNViD-1k	PLCC	0.905	DOVER (end-to-end)
Video Quality Assessment	KoNViD-1k	PLCC	0.894	DOVER (head-only)
Video Quality Assessment	LIVE-FB LSVQ	PLCC	0.889	DOVER
Video	MSU NR VQA Database	KLCC	0.7216	DOVER
Video	MSU NR VQA Database	PLCC	0.9099	DOVER
Video	MSU NR VQA Database	SRCC	0.8871	DOVER
Video	LIVE-VQC	PLCC	0.874	DOVER (end-to-end)
Video	LIVE-VQC	PLCC	0.863	DOVER (head-only)
Video	YouTube-UGC	PLCC	0.874	DOVER (end-to-end)
Video	YouTube-UGC	PLCC	0.862	DOVER (head-only)
Video	KoNViD-1k	PLCC	0.905	DOVER (end-to-end)
Video	KoNViD-1k	PLCC	0.894	DOVER (head-only)
Video	LIVE-FB LSVQ	PLCC	0.889	DOVER

Exploring Video Quality Assessment on User Generated Contents from Aesthetic and Technical Perspectives

Abstract

Results

Related Papers

Exploring Video Quality Assessment on User Generated Contents from Aesthetic and Technical Perspectives

Abstract

Results

Related Papers