Zhenqiang Ying, Haoran Niu, Praful Gupta, Dhruv Mahajan, Deepti Ghadiyaram, Alan Bovik
Blind or no-reference (NR) perceptual picture quality prediction is a difficult, unsolved problem of great consequence to the social and streaming media industries that impacts billions of viewers daily. Unfortunately, popular NR prediction models perform poorly on real-world distorted pictures. To advance progress on this problem, we introduce the largest (by far) subjective picture quality database, containing about 40000 real-world distorted pictures and 120000 patches, on which we collected about 4M human judgments of picture quality. Using these picture and patch quality labels, we built deep region-based architectures that learn to produce state-of-the-art global picture quality predictions as well as useful local picture quality maps. Our innovations include picture quality prediction architectures that produce global-to-local inferences as well as local-to-global inferences (via feedback).
| Task | Dataset | Metric | Value | Model |
|---|---|---|---|---|
| Video Understanding | MSU NR VQA Database | KLCC | 0.7079 | PaQ-2-PiQ |
| Video Understanding | MSU NR VQA Database | PLCC | 0.8549 | PaQ-2-PiQ |
| Video Understanding | MSU NR VQA Database | SRCC | 0.8705 | PaQ-2-PiQ |
| Video Understanding | MSU SR-QA Dataset | KLCC | 0.57753 | PaQ-2-PiQ |
| Video Understanding | MSU SR-QA Dataset | PLCC | 0.70988 | PaQ-2-PiQ |
| Video Understanding | MSU SR-QA Dataset | SROCC | 0.71167 | PaQ-2-PiQ |
| Video Quality Assessment | MSU NR VQA Database | KLCC | 0.7079 | PaQ-2-PiQ |
| Video Quality Assessment | MSU NR VQA Database | PLCC | 0.8549 | PaQ-2-PiQ |
| Video Quality Assessment | MSU NR VQA Database | SRCC | 0.8705 | PaQ-2-PiQ |
| Video Quality Assessment | MSU SR-QA Dataset | KLCC | 0.57753 | PaQ-2-PiQ |
| Video Quality Assessment | MSU SR-QA Dataset | PLCC | 0.70988 | PaQ-2-PiQ |
| Video Quality Assessment | MSU SR-QA Dataset | SROCC | 0.71167 | PaQ-2-PiQ |
| Image Quality Assessment | MSU NR VQA Database | KLCC | 0.7079 | PaQ-2-PiQ |
| Image Quality Assessment | MSU NR VQA Database | PLCC | 0.8549 | PaQ-2-PiQ |
| Image Quality Assessment | MSU NR VQA Database | SRCC | 0.8705 | PaQ-2-PiQ |
| Video | MSU NR VQA Database | KLCC | 0.7079 | PaQ-2-PiQ |
| Video | MSU NR VQA Database | PLCC | 0.8549 | PaQ-2-PiQ |
| Video | MSU NR VQA Database | SRCC | 0.8705 | PaQ-2-PiQ |
| Video | MSU SR-QA Dataset | KLCC | 0.57753 | PaQ-2-PiQ |
| Video | MSU SR-QA Dataset | PLCC | 0.70988 | PaQ-2-PiQ |
| Video | MSU SR-QA Dataset | SROCC | 0.71167 | PaQ-2-PiQ |