TasksSotADatasetsPapersMethodsSubmitAbout
Papers With Code 2

A community resource for machine learning research: papers, code, benchmarks, and state-of-the-art results.

Explore

Notable BenchmarksAll SotADatasetsPapersMethods

Community

Submit ResultsAbout

Data sourced from the PWC Archive (CC-BY-SA 4.0). Built by the community, for the community.

Papers/AIM 2024 Challenge on Video Saliency Prediction: Methods a...

AIM 2024 Challenge on Video Saliency Prediction: Methods and Results

Andrey Moskalenko, Alexey Bryncev, Dmitry Vatolin, Radu Timofte, Gen Zhan, Li Yang, Yunlong Tang, Yiting Liao, Jiongzhi Lin, Baitao Huang, Morteza Moradi, Mohammad Moradi, Francesco Rundo, Concetto Spampinato, Ali Borji, Simone Palazzo, Yuxin Zhu, Yinan Sun, Huiyu Duan, Yuqin Cao, Ziheng Jia, Qiang Hu, Xiongkuo Min, Guangtao Zhai, Hao Fang, Runmin Cong, Xiankai Lu, Xiaofei Zhou, Wei zhang, Chunyu Zhao, Wentao Mu, Tao Deng, Hamed R. Tavakoli

2024-09-23Video CompressionVideo Saliency PredictionSaliency PredictionVideo Saliency DetectionSaliency Detection
PaperPDFCode(official)

Abstract

This paper reviews the Challenge on Video Saliency Prediction at AIM 2024. The goal of the participants was to develop a method for predicting accurate saliency maps for the provided set of video sequences. Saliency maps are widely exploited in various applications, including video compression, quality assessment, visual perception studies, the advertising industry, etc. For this competition, a previously unused large-scale audio-visual mouse saliency (AViMoS) dataset of 1500 videos with more than 70 observers per video was collected using crowdsourced mouse tracking. The dataset collection methodology has been validated using conventional eye-tracking data and has shown high consistency. Over 30 teams registered in the challenge, and there are 7 teams that submitted the results in the final phase. The final phase solutions were tested and ranked by commonly used quality metrics on a private test subset. The results of this evaluation and the descriptions of the solutions are presented in this report. All data, including the private test subset, is made publicly available on the challenge homepage - https://challenges.videoprocessing.ai/challenges/video-saliency-prediction.html.

Related Papers

GSVR: 2D Gaussian-based Video Representation for 800+ FPS with Hybrid Deformation Field2025-07-08Feature Hallucination for Self-supervised Action Recognition2025-06-25Video Compression for Spatiotemporal Earth System Data2025-06-24MSNeRV: Neural Video Representation with Multi-Scale Feature Fusion2025-06-18Audio-Visual Driven Compression for Low-Bitrate Talking Head Videos2025-06-16Rethinking Generative Human Video Coding with Implicit Motion Transformation2025-06-12Generalized Gaussian Entropy Model for Point Cloud Attribute Compression with Dynamic Likelihood Intervals2025-06-11Generative Models at the Frontier of Compression: A Survey on Generative Face Video Coding2025-06-09